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Abstract :  The  authors  present  a  modelling 
method  allowing  the  study  of  voltage  inverter 
fed  slotless  permanent  magnet  machines.  The 
suggested  method  is  based  on  an  analytical 
field  calculation  inside  the  magnetic 
structure  which  is  associated  with  a  numerical 
solution  of  the  global  electric  equation. 

INTRODUCTION: 

For  special  applications  needing  the 
lowest  vibration  level  possible,  the  slotless 
permanent  magnet  machine  fed  by  a  pulse  width 
modulation  controlled  voltage  inverter,  seems 
to  be  a  most  efficient  solution  Cl).  In  this 
article,  a  noddling  and  simulation  method 
adapted  to  such  a  drive  systea  is  presented. 
In  order  to  take  into  account  the  power 
subdivision  required  by  high  powered  drive, 
the  general  case  of  a  multistar  and  polyphased 
systea  is  considered. 

I  -  GENERAL  ELECTRIC  EQUATION: 

The  general  electric  scheme  of  the 
studied  structure,  which  is  made  up  of  a  e- 
star  c-phased  armature,  is  given  on  Figure  1. 


The  different  stars,  electrically 
independent  and  magnetically  coupled  are  fed 
by  e  inverters  including  a  phases.  The 
switches  of  a  saae  inverter  leg  are 
unidirectional  for  voltage  and  bidirectional 
for  current. 

Neglecting  the  voltage  switching 
duration,  the  applied  voltages  on  the  machine 
different  phases  are  entirely  defined  by  the 
inverter  working  sequences.  In  the  case  of  the 
mecanic  steady  state  (constant  rotating  speed) 
the  systea  behaviour  is  entirely  described  by 
the  electric  equation  which  is  given  in  a 
aatrical  fora  for  all  the  cachine  phases  by; 

IV)  -  [R)  [IJ  +  ^  (<D)  (1) 

The  unicoluan  vectors  [V],  (I)  and  [<&], 
including  e.a  components,  respectively 
correspond  to  voltage,  current  and  flux  in  the 
stator  araature.  The  diagonax  matrix  iRj, 
corresponds  to  the  winding  resistance. 

The  determination  of  the  currents  fro? 
Equation  (1)  knowing  the  applied  voltages,, 
requires  the  calculation  of  flux  («!>].  An 
expression  of  (<!>}  can  be  obtained  from  an 
analytical  field  calculation  in  the  machine 
magnetic  structure. 

II  -  MAGNETIC  STRUCTURE  MODELLING: 

In  the  case  of  a  smooth  pole  and  slotless 
stator  machine,  the  magnetic  structure  can  be 
modelled  by  a  two-dimensional  analytical  field 
calculation  method  in  a  (r, 8)  co-ordinates 
system.  As  shown  on  Figure  2,  the  studied 
structure  can  be  divided  into  several, 
concentric  layers  corresponding  to  the 
magnetic  materials  and  the  field  source^ 
(magnets  or  currents) . 


Stator  yoke 
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Figure  2  -  Magnetic  structure. 
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Figure  1  -  General  electric  scheme. 
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To  the  extent  that  the  thickness, and  the 
permeability  of  these  layers  are  constant,  an 
analytical  solution  of  Poisson’s  equation  can 
be  obtained  in  the  different  (Sk)  zones.  The 
development  into  Fourier's  series  of  the 
fonctions  defining  the  magnet  and  current 
spatial  distribution  allows  the  determination 
of  the  vector  potential  expression  in  every 
zone  [2] . 

The  vector  potential  non  nil  component, 
for  a  suffix  layer, .is  given  by: 

+oo 

ftk  -  Zrx9krWP«%rWP^«r,Jei^  (2) 

*=•<» 

where  the  coefficient  Xgk,  Ygk  and  the  Zgk 
functions  are  calculated  from  the  structure's 
geometrical  and  electronagnetic 
characteristics,  p  is  the  pole  pair  number. 

Elaborated  magnet  arrangements  and 
winding  structures  can  be  taken  into  account 
in  a  quite  simple  way  by  this  method: 
multiblock  poles  with  a  magnetization  either 
radial  or  parallel  to  any  direction,  multistar 
polyphased  windings  with  complex  distributions 
of  conductors... 

Ill  -  FLUX  DETERMINATION: 

The  different-  electromagnetic  quantities 
defining  the  working  of  the  machine  can  be 
analytically  determined  by  the  above  vector 
potential  expression.  Particularly  "  flux  in 
any  winding  phase  (nth  phase  of  the  hth  star) 
is  given  by: 

<Dhn(t>  -  L.  J  A4(r,0,t)  .Chn(O)rd0dr  (3) 

(S4) 

where  A4,  function  of  the  space  variables  r,  0 
and  of  the  tine  t,  represents  the  vector 
potential  norm  in  the  currents'  layer.  Chn(0) 
function  represents  the  angular  distribution 
of  the  considered  phase  conductors,  L  is  the 
axial  length  of  the  machine. 

By  substituting  the  Fourier's  series 
defining  A4  and  Chn  in  Equation  (2) ,  and  after 
integration,  the  flux  can  be  put  in  the  form: 
o  m 

<Dhn(t>  *  0Ahn(t)+S  X  Mhnii.Iu(t)  (4) 

i-1  1-1 

This  flux  is  divided  into  a  no  load  flux 
0Ahn(t),  corresponding  to  the  magnet 
contribution,  and  an  armature  reaction  flux, 
Mhnii  representing  the  mutual  inductance 
between  the  considered  phase  and  the  1th  phase 
of  the  ith  star,  in  which  ..he  In(t)  current 
flows.  The  ®khTi(t)  and  Mhnil  quantities  are 
developed  into  Fourier's  series  of  which 
coefficients  depend  on  the  geometric  and 
magnetic  structure  characteristics. 

Considering  all  the  machine  phases,  the 
flux  vector  ultimately  becomes: 

[d>l  -  tOA)+mjm  +  lbsJUl  <s> 


where  the  no  load  flux  (4>A)  and  the  inductance 
matrix  [MJ  are  analytically  defined  from  the 
magnetic  structure  modelling. 'The  [L3]  matrix 
makes  it  possible  to  take  intOj account  the 
magnetic  leakages  in  the*  winding  overhangs, 
which  are  not  considered  by  the  two  dimensions 
modelization.  Consequently,  the  general 
electric  equation  (1)  which  governs  the 
evolution  of  the  -currents  in  the  machine 
windings  is  perfectly  defined  and  becomes: 

iyi-^[4>Al  -  JBJ  (6) 

Knowing  the  voltages  applied  by  the 
inverter,  and  the  electromotive  force 
determined  oy  its  analytical  expression. 
Equation  (5)  solution  makes  it  possible  tc 
determine  the  currents,  in  the  machine- and  to 
deduct  the  space-time  evolution  of-  the 
different  electromagnetic  quantities  (flux 
density,  torque,  force  density  in  the 
winding...)  from  the  modelization  of  the 
magnetic  structure. 

IV  -  ELECTRIC  EQUATION  SOLUTION  METHOD: 

Equation  (b)' can  be  pur  in  state  form: 

jrm  -  (Aim  +  [bhui  n> 

with: 

(X)-(Il,  [U1-[V)-“[<J>A],  [Bl-ILs+M)"1. 

(A)  —  (B).(R) 

To  the  extent  that  (A)  and  (Bl  matrix  are 
independent  of  time  ana  of  vectors  (X)  and 
(U),  a  suitable  method  for  the  solution  of 
Equation  (6)  is  the  matrix  exponential  method 
(3).  The  general  solution  of  Equation  (6)  then 
takes  the  form  of: 

t 

[X(t)  I-eIA|  •***«’ .  IX <t0)  ]  +  Je,A1  (t",) .  (B) .  IU(T)  Jdt 


which,  if  the  calculation  step  is  chosen  in 
such  a  way  that  the  vector  (U)  remains 
constant  during  this  period,  leads  to  the 
exact  recurrent  analytical  expression  of  the 
solution: 

(X(to+h)  J"  e|A,h  [X<t0>) 

+  |A)_1.  <e|A,h-|l] )  .  IB) .  IU(to) )  (9) 

where  (1)  is  the  unitary  matrix. 

According  to  the  inverter  control  type 
adopted,  the  calculation  step  can  be  chosen 
either  variable  or  constant.  So,  if  the 
inverter  command  is  a  priori  known,  it  is 
interesting  to  place  the  calculation  points  at 
the  supply  voltage  switching  instants.  If 
these  instants  are  near  enough,  the  e.m.f 
variations  are  negligible  on  that  time  scale 
and  the  command  vector  (U)  can  be  considered 
as  constant  between  two  switching  instants. 
This  solution  allows  the  minimization  of  the 
number  of  the  calculation  points,  but  requires 
an  exponential  matrix  calculation  for  every 
new  step. 
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If  the  switching  instants  are'^'a  'priori 
unknown,  ^wh'icif  happens  in  ^particular  'when  the 
daverter  command  conies  froa'.a  regulation  of 
the  currents  in  the  machine#  the  calculation 
step' needs  to  be  chosen  sufficiently,  fine  to 
limit  the  switching' ^errors .  The  time 
calculation  rise  due  to  the  increase  of  the 
calculation  points  is  limited'by  the  adoption 
of  a  constant  calculation  step  which  requires 
only  one  matrix  exponential  calculation. 

Jr 

V  -  EXAMPLE:' 

'To  illustrate  the  proposed  method,  the 
case  of  a  high  powered  four-star  three-phased 
sinusoidal  waveform  structure  has  been 
considered.  The  four  -PWM  three-phased 
inverters  .being  controlled  by  a  sinusoidal 
modulation,  the  current  waveforms  obtained  in 
two  different  configurations  of  the  structure 
has  been  studied. 

The  first  configuration  corresponds  to  a 
in-phase  four-star  winding  and  to  a 
synchronized  control  of  the  four  inverters.  As. 
shown  on  Figure  3,  the  phase  current  ripple  is 
rather  weak,  in  as  much  as  the  frequency 
modulation  here  is  100  times-higher  than  the 
f  !:r.  dimes  tnl-  frequency. 


The  second  configuration  corresponds  to 
the  case  where  the  four  stars  are  spatialy 
shifted  from  one  another.  The  fundamental 
currents  of  the  different  stars  having  to  be 
shifted  too,  tile  control  of  the  four  Inverters 
here  is  such  that  the  voltage  switchings  in 
the  different  stars  are  not  simultaneous.  So, 
during  a  star  voltage  switching,  the  other 
stars,  connected  to  the  direct  voltage  source, 
act  as  an  amortisseur  winding.  The  transient 
inductance  which  corresponds  to  a  leakage 
inductance  between  the  different  stars  is 
noticeably  weaker  than  the  synchronous 
inductance  involved  in  the  previous 
configuration.  As  shown  on  Figure  4,  the 
current  ripple  obtained  is  far  higher. 


,  CONCLUSION: 

In  this  paper,  a  modelling  and  simulation 
method  based  on  an  analytical  field 
calculation  associated  with  a  matrix 
exponential  solution  of  the  electric  equation 
has  been  presented.  The  proposed  method,  which 
considers  the  general  case  of  a  multistar 
polyphased  system,  allows  the  taking  into 
account  of  quite  complex  inductor  and  winding 
structures.  By  adopting  a  constant  or  variable 
calculation  step  according  to  the  inverter 
working  type,  the  matrix  exponential  solution 
technique  enables  the  minimization  of  the 
calculation  tine?.  Moreover,  thanks  to  its 
analytical  nature,  by  shoving  the  influence  of 
the  different  design  parameters,  the  proposed 
nodelization  method  has  turned  out  to  be  a 
particularly  performing  optimization  tool. 
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ABSTRACT  The  induction  generator  excited  by  a  current 
source  inverter  is  analysed.  Tuis  system  can  be  seen  like  a  DC 
machinc^o  the  sliding  contact  circuit  theory  is  used  to  develop  a 
new  dynamical  mold. 

Using  that  model,  the  steady-state  and  transient  behaviours  are 
presented.  Experimental  results  are  ut  agree  with  the  theoretical 
ones. 

I.  INTR0PUCT1QN 

The  induction  generator  excited  by  a  current  source  inverter  is 
a  system  proposed  by  Sato  and  al  in  reference  i 1  ]. 

There  are  two  ways  to  study  that  system :  I)  the  machine  with 
its  equations  is  constrained  to  the  conditions  given  by  the 
electronic  circuits,  so  the  overall  model  results  of  the 
compatibility  of  the  relations  which  govern  the  components.This 
was  done  in  reference  (I),  where  the  steady-state  study  was 
presented ,  2)  the  system  is  considered  a  new  electrical  machine 
for  which  it  is  possible  to  apply  the  equation  of  sliding  contact 
circuits  [2]. 

In  this  paper,  a  new  dynamical  model  is  obtained  with  the 
second  method.  The  steady-state  as  well  as  some  transient 
behaviours  arc  presented.  The  theoretical  results  arc  in  agree  with 
the  experimental  ones.  So  the  mode!  is  suitable  for  analysis  and 
synthesis. 

n.  SYSTEM  MODELLING 

The  experimental  system  is  presented  in  FIG.  1.  The  induction 
generator  is  excited  by  a  current  source  inverter.  The  load  is  a 
diode  bridge  supplying  a  DC  circuit. 


The  system  presented  inFIG.l  can  be  seen  like  a  commutation 
machine.  The  current  source  inverter  and  the  diodr  bridge  are  two 
JDC  inputs,  sliding  on  a  commutator, FIG.2  The  rotor.windings 
are  represented  by  a,  (5  circuits. 


FIG.2*. Equivalent  system. 

Using  th<,  sliding  contact  circuit  theory,  the  stator  equations 
arc  given  by: 


U 


is " 


. . •> 


0-0.. 

>1. 


5Wcn 

60„ 


0) 


O-0J1 


Uj,*rj.*2.  +  -3p  + 


60„ 


(2) 


Where:  \Vcm  is  Ihe  system  co-energy 

Vl ,  and  are  the  stator  (luxes. 

The  fictitious  brushes  of  the  commutation  circuits  are  not 
fixed.  Their  variation  laws  depend  of  convert  physic. 

The  position  of  CD  brushes  (current  inverter)  on  the 
commutator  is  fixed  by  the  control  circuit  (3)t 


i|,»llref)  (3) 

The  AB  brushes  are  related  with  the  diode  bridge,  so  they  are 
placed  on  a  commutator  position  of  maximum  voltage.  To  obtain 
this  condition  wc  use  a  test  circuit  v-ith  voltage  U’21  and  position 


O', 2  HI: 


(4) 
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The  rotor  behaviour  is  represented  by  two  quadrature 
.windings  a$>\ 

.  df,<.  * 

(5) 


°‘,«i«+-dr 


°mVt 


(6) 


Where;  Va  and  ^  are  the  rotor  fluxes. 

The  inverter  control  is  done  as  FIG.3  shows,  so  in  this  case 
equation  (3)  ,i  e,  the  inverter  frequency  is  given  by.  * 


(7) 


FIG.3-ControI  circuit! 

In  normal  operation  voltage  Uj*  is  closely  zero  and  the 
position  of  CD  brushes  is  near  90*,  the  stator  windings  are  not  in 
quadrature  to  permit  the  inverter  DC  current  control  (excitation). 
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Fig.6  -  Output  current  evolution  afler  a  load  perturbation  (ARj) 
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In  the  following  figures,  we  present  some  results  for  the 
steady-state  and  the  transient  behaviour.  The  theoretical  results 
were  obtained  with  the  developed  model.  These  and  the 
experimental  data  are  concerned  with  standard  2  kW  induction 
machine  squirrel-  cage. 


IV.  CONCLUSIONS 

The  equation  of  sliding  contact  circuits  has  been  used  to  study 
the  commutation  machines.  Also  in  this  paper,  this  theory  was 
used  to  obtained  a  dynamical  model  of  the  induction  generator 
with  current  source  inverter  exciter. 

To  validate  that  model,  an  experimental  system  was  built.  The 
theoretical  results  are  in  agree  with  the  experimental  ones.  So  the 
developed  model  is  suitable  for  the  analysis  and  synthesis  of 
experimental  systems. 
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ABSTRACT—  For  the  analysis  and  design  of  the  tubular 
actuator  (TA),  an  equivalent  model  derived  from  the  similar 
model  used  for  the  low  speed  flat  linear  actuator,  is  presented.  To 
test  the  theory,  two  experimental  TAs  were  designed  and  built  by 
the  author.  The  results  are  presented  and  discussed.  Using  the 
maximum  value  of  force  per  power  input  at  standstill,  as  an 
optimum  design  criterion,  the  optimized  design  formulas  are 
proposed. 

LIST  OF  SYMBOLS 

D  •  mean  diameter  of  the  rotor  aluminium  sheet,  m 

De «  maximum  diameter  of  co-axial  coils,  m 

Di  •  minimum  diameter  of  co-axial  coils,  m 

dc  -  standard  wire  copper  diameter,  m 

g  -  actual  airgap  length,  m 

It  -  r.m.s.  phase  current,  A 

Nf  number  of  turns  per  phase 

p  •  number  of  pole  pairs 

Rj  -  stator  winding  resistance,  ft 

R2  -  rotor  aluminium  resistance  referred  to  the  stator,  ft 

s-  slot  width,  m 

t «  rotor  aluminium  sheet  thickness,  m 

Xm  -  magnetizing  reactance,  ft 

Zc*  number  of  turns  per  coil 

p  -  volume  resistivity  of  the  aluminium,  ft.m 

pc  •  volume  resistivity  of  cooper,  ft.m 

Tp  *  pole  pitch,  m 

t4*  slot  pitch,  m 

to  <■  angular  supply  frequency,  1/s 

J.. INTRODUCTION 

The  small  TA  Is  an  electric  piston,  and  compared  with 
compressed  air  and  hydraulic  systems,  is  more  ^ad van tageous 
because  no  compressors  nor  conducts  are  necessary.  Besides,  the 
TA  is  very  reliable  and  without  maintenance.  The  actual 
applications  of  TA  are  [1,  2, 3];  Pumps,  valves,  bottle  transfers, 
oscillators  and  lifts.  Other  applications  arc  proposed  [4]:  Sliding 
doors  and  curtain  operators,  robotics,  weaver's  looms,  switch  • 
pointer  drives,  nuclear  reactors  and  aeronautics. 

2,  .EQUIVALENT-MODEL 

The  TA  steady  -  state  operational  characteristics  may  be  derived 
and  analysed  from  a  simplified  equivalent  model  per  phase 


shown  in  ftg.  !.  The  core  losses’and  leakage  flux  have  been 
neglected.  The  topology  of  this  equivalent  circuit  is  similar  to 
those  proposed  by  Nix  and  Laithwaite  ( 1 ),  Nasi  r  and  Boldea  [2], 
Davis  [5],  and  Groot  and  Heuvelman  (6],  but  in  Ref.  [2,6)  the 
stator  leakage  flux  is  considered. 


Fig.  1  -  Equivalent  circuit  per  phase  at  standstill,  neglecting 
core  losses  and  leakage  flux 

However,  the  expressions  for  TA  parameters  Rj,  Xm  and  Rj  are 
different  from  author  to  author.  These  parameters  may  be 


determined  by  the  following  equations  [2,4); 

R,-2pt(l/dt!)(Dc+D,)N, 

(1) 

X„-(6pcl0)/ii)(TI,D/pKeg)N; 

(2) 

R:-(67tD/Tpp)(p/t)Ni 

(3) 

The  expression  of  standstill  force,  or  thrust,  derived  from  the 
equivalent  circuit  as  well  as  from  Maxwell's  equations,  is  given 
by  1 1,4, 7), 


F-  18s2pD(p/i)(l!/o>)(^/r/[QV(l  +Qj)]k 

where  Q  Is  the  Laithwaite’s  goodness  factor  [1]  and  Kc  is  the 
Carter's  coefficient  (8).  Q  and  Kc  are  expressed  as, 

Q-(r'p0ojt)/(nlpKeg)  (4a) 

Kc-t,/[t,.sV(s  +  58)]  (4b) 

The  inclusion  of  Kcis  important  and  necessary  In  the  calculation 
of  force  [6,7]. 

The  force  correction  factor  K[  1,7)  is  a  function  of  pole  number 
only,  and  becomes  simply  equal  to  1  for  even-pole  TAs 
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Figs.  2,  3  show  the  comparison  between  measured  results  and 
the  theoretical  curves,  obtained  in  the  test  of  prototypes  TAI  and 
TAII  (see  Appendix).  The  degrees  of  error  that  result  of  this 
comparison  are:  1  -  Power  factor  -  TAI:  9%  including  Kcand 
10%  not  including  Kd  TAII:  2%  including  Kcand  18%  not 
including  K-  2  -  Standstill  force  -  TAI:  12%  including  Kcand 
50%  not  including  K^ TAII:  8%  including  K«  and  40%  not 
including  Kc.  Consequently,  the  importance  of  Kc  in  the 
calculation  of  force  is  corroborated  by  ,  these  results.  A 
comparison  between  experimental  and  theoretical  values, 
including  Kc,  shows  a  good  agreement,  confirming  the  validity 
of  equivalent  circuit. 


Fig.  2  -  Power  factor  against  r.m.s.  phase  current  at 
standstill  (SO  Hz) 


Ijl  A)  IjlA) 

Fig.  3  •  Standstill  force  against  r.m  s .  phase 
current  (SO  Hz) 


For  the  following  imposed  data:  t-0.00 15  m,  g-0.0020  m,  Kc* 
1.5,  Q  -  1,  <o-314  1/s,  from  (4a),  rp-0.039  m  [4J.  Thus,  the 
optimized  design  formulas  becomes: 

R,-3.4.10',(l/d*)(D«+P,)N, 

(5) 

-x„-r2-io-:W)n? 

(6) 

F-  9.I0'4(iyp)(N,II)2K 

(7) 

This  optimum  design  criterion  is  based  in  fact  that  the  maximum 
value  of  force  per  power  input  is  maximum  at  standstill  when 


Q*1  [1, 2, 4, 5]. 

^CONCLUSIONS 

From  analytical  equivalent  circuit,  whose  validity  was  tested 

experimentally  by  means  two  prototypes,  a  simple  design 

formulas  provide  a  reliable  method  to  the  optimized  design  of 

TAs  for  standstill  applications. 
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APPENDIX.  Details  of  experimental  tubular  actuators  TAI  and 

TAII: 


TAI 

TAII 

TAI 

TAII 

2p 

4 

3 

D,  mm 

17.5 

43.3 

zc 

400 

400 

De,mm 

60 

90 

Nl 

1600 

1200 

I>„mm 

28 

54 

tp,  mm 

43.5 

43.5 

g,  mm 

4.50 

4.25 

s,  mm 

13 

13 

t,  mm 

1.5 

1.3 
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Abstract  -  This  paper  investigates  the  use  of 
two  different  simulation  methods  of  converter 
fed  electrical  machines.  The  first  one  called 
global  simulation  method  needs  only  the 
knowledge  of  the  converter  diagram,  the 
different  firing  sequences  and  the  machine 
parameters.  The  second  one,  based  on 
sequential  analysis  of  the  converter  operating 
sequences  uses  an  automatic  generator  of 
numerical  simulation  program  of, electro¬ 
mechanical  process. 

I.  INTRODUCTION 

An  analytical  study  of. power  electronic 
system  including  static  converter  can  be  done 
when-the  different  working  sequences  are 
known,. and  when  their  successions  in  time 
domain  are  predicted.  Otherwise,  numerical 
simulation  tools  become  helpful  mean  to 
investigate  the  behaviour  of  such  systems. 
These  programs^can  be  classified  into  two 
groups.  In  the  first  one,  the  simulation 
method  discovers  the  .functionning  of  the 
converter,  step  by  step,  from  the  only 
knowledge  of  the  converter  diagram  and  the 
semiconductors  firing  sequences.  In  the  second 
one,  the  different  working  sequences  are 
previously  known  in  order  to  limit  tests. on 
succession  of  them.  The  two  packages  which  are 
presented  in  this  paper  belong  to  these  two 
groups:  SACSO-Machine  Cl),  an  automatic 
program  for  the  simulation  of  converter  fed 
machine  is  a  part  of  a  family  (SACSO,  SCRIPT) 
still  develcpped  actually:  GASPE  (2),  a 
simulator,  generates  automatically,  from  the 
previous  knowledge  of  the  different  operating 
sequences,  a  simulation  program  of  any 
control-converter-machine  assembly. 

After  the  presention  of  these  two  packages, 
simulation  results  of  the  functionning  of  an 
inverter  fed  synchronous  motor  (Fig.  i)  show 
the  advantages  and  the  drawbacks  of  employed 
simulation  methods.  These  comparisons  lead  to 
point  out  the  best  using,  field  of  both 
packages. 

II.  SACSO-MACHINE  PACKAGE 

SACSO-Machine  is  composed  of  three ‘main 
parts:  a  data  unit,  where  is  described  the 
converter  topology,  the  control  of  the 
semiconductors  and  the  machine  parameters,  a 
simulation  processor  of  the  machine  converter 
set  and  a  result  post  processor.  The 
semiconductors  are  considered  as  resistances 
whose  values  depend  on  if  they  are  blocked  or 
conducting.'  Each  sequence  of  the  converter  is 
described  by  a  state  equation  which  depend  on 
the  topology  and  the  states  of  the  semi¬ 
conductors.  Combining  this  equation  with  the 
electrical  machine  equations  expressed  in  a 
Park  reference  frame  leads  to  an  equation 
which  represents  the  functionning  of  the  whole 
system 
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Fig.  1.  Circuit  diagram. 

dX/dt  =  A  .  X  (1) 

in  which  the  X  vector  is  composed  of  the 
converter  state  variables,  of  the  currents  in 
the  machine  coils  (in  the  Park  reference 
frame)  and  of  a  term  representing  the 
supplying  sources  of  the  machine-converter 
set.  Equation  (1)  is  solved  in  a  form  of  a 
recurrent  equation 

X(t+T)  =  Exp(A.T)  .  X(t)  (2) 

In  order  to  reduce  the  simulation  time  which 
essentially  depends  on  the  matrix  exponential 
calculation,  one  can  transform  equation  (1)  in 

dZ/dt  =  D  .  Z  (3) 

D  being  a  diagonal  matrix,  the  exponential 
matrix  is  then  obtained  analytically  from  the 
exponential  of  the  eigenvalues  of  A. 

In  order  to  accelerate  the  transient  state,  a 
computation  of  the  steady  state  solution  based 
on  the  Newton  Raphson  method  is  implemented  in 
the  SACSO-Machine  package. 

III.  GASPE  METHODOLOGY 

GASPE  generator  has  been  designed  in  order  to 
study  power  electronic  systems  which  are 
generally  composed  of  an  association  of  a 
static  converter,  an  electric  machine  and 
control  circuits.  Actually,  it  generates  a 
simulator  which  >8  made  of  a  main  program 
body,  a  simulation  management  system  and  a 
block  procedure  to  detect  all  occurences  of 
events  and  updating  variables  (time,  currents, 
semiconductor  states...). 

After  analysing  the  system  to  be  simulated, 
the  first  step  consists  to  subdivide  the 
system  into  blocks  with  respect  to  its 
function.  For  the  considered  problem  there  are 
a  Motor  Block  for  the  synchronous  motor  (hero 
a  non  salient  one  modelling  with  the  Park 
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transformation) ,,  a  Source  Block  to. represent, 
the  current .source  which  is  assumed  here  to  be 
constant  and  a  Thyristor  Control  Block  to' 
generate  the  firing  signals  cf  the  thyristors 
according  to. the  rotor  position. 

In  order  to  use  GASPE  methodology,  a  set  of 
informations  in- a  specific  language  has  to  be 
written.  The  Grammar  File,  which:specif ies^the 
type  of  variables  and  the  nature  of. events  for 
each  block. 

For  the  simulation  of  current  inverter  fed 
synchronous  motor,  a  fast  convergence < process 
is  implemented  in  the  simulator  in  order  to 
reach  rapidly  the  steady  state. 

IV.  SIMULATION  RESULTS 

Simulation  results  of  steady  state  operation 
of  the  inverter  fed  machine  are  presented  in 
Fig.  2.  The  research  of  the  steady  state 
obtained  from  the  two  programs'  is  shown*  in 
Fig.  3.  The  steady  state  is  obtained  after  one 
period  with  GASPE  due  to  the* particular 
implemented  method. 

In  SACSO-Machine  package,  the  resolution 
method  is  based  on  the  calculation  of  a  matrix 
exponential.  This  method  has  the  advantage  to 
allow  an  analytical  solution  in  the  form  of  a 
recurrent  equation  where  the  calculation  step 
does  not  depend  on  the  time  constants  of  the 
system.  Nevertheless,  this  method  is  not 
really  compatible  with  the  study  of  mechanical 
transient  operation,  neither  the  modelling  of 
a  salient  pole  machine.  However  this 
simulation  method  allows  to  study  unusual  or 
unpredictable  working  sequences  (Fig.  4). 

In  the  simulator  generated  by  GASPE,  the 
functionning  of  the  converter  has  to  be 
defined  previously.  Then,  unusual  working 
sequences  cannot  be  studied.  However,  this 
package  takes  into  account  salient  pole 
machine  (Fig.  4)  and  sophisticated  control 
loops. 


V.  CONCLUSION 

This  paper  investigates  the  use  of  two 
simulation  methods  to  study  the  behaviour  of 
synchronous  machine  supplied  by  a  current 
source  inverter.  The  advantages  of  the  two 
methods  have  been  point  out.  SACSO-Machine 
program  becomes  necessary  when  the. working 
sequences  are  unusual  or  Unpredictable,  either 
in  transient  state  or  following  a  fault. 
Programs  developped  from  the  GASPE  methodology 
do  not  use  a  so  sophisticated  resolution 
method  and  then  allow  to  study  salient  pole 
machines  or  complicated  control  loops 
including  mechanical  transient  functionning. 
However  these  two  packages  are  complementary, 
the  advantages  of  one  make  up  for  the 
drawbacks  of  the  other. 
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Fig.  3.  Steady  state  research,  SACSO-Machine 
(left)  and  GASPE  (right). 


Fig.  4.  Missing  of  a  firing  pulse  of  a 
thyristor  with  SACSO-Machine  (left)  and 
salient  pole  machine  with  GASTE  (right). 
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Abstract  -  This  paper  discuss,  for  FM  synchronous 
actuators,  the  liaits  of  application  of  the  control 
eethod  which  keep  the  direct  axis  eoaponent  of  the 
armature  current  equal  to  zero,  and  describes  two 
control  algorithas  which  maintain  the  power  factor ^as 
close  as  possible  to  unity.  These  algorithas  have  been 
evolved  with  a  view  to  an  easy  hardware  implementation. 
The  perforoances  of  the  proposed  control  algorithas  have 
been  validated  by  digital  simulation. 

1.  INTRODUCTION 

For  persanent  magnet  synchronous  actuators,  the  control 
eethod  consisting  of  keeping  the  direct  axis  component 
of  the  arsature  current  equal  to  zero  allows  to  use  very 
simple  control  algorithas  11,2).  This  eethod  is  the  cost 
cocoonly  used,  but  can  strongly  deteriorate  the  power 
factor  if  applied  to  sose  type  of  buried  cagnet 
synchronous  actuators. 

Control  cethods  which  ioprove  the  power  factor  are  core 
intricate  because  they  need  a  non  linear  control  law. 
However  these  control  cethods  reduce  the  stator  voltage 
allowing  to  use  a  lower  rated  voltage  inverter.  On  the 
other  hand,  as  oodern  percanent  cagnet  actuators  can 
reach  tens  of  kV  (3),  to  keep  a  good  power  factor  allows 
to  ioprove  the  efficiency. 


In  these  equations 

-  l6  and  la^»re  the  self  inductances  of  the  d  and  q 
2 mature  equivalent  windings 

-  R4  is  the  resistance  of  the  araature  dq  windings 

-  K,  the  torque  constant 

-  0a  the  angular  position  of  the  rotor  aad  $a  its 
velocity 

-  s  the  Laplace  operator. 


a  b 

Fig.  1.  Schematic  representation  of  PM  synchronous 
actuators. 


3.  THEORETICAL  STUDY 


This  paper  determines,  in  accordance  with  the  rotor 
saliency,  the  Units  of  application  of  the  control 
eethod  which  keep  the  direct  axis  eoaponent  of  the 
araature  current  equal  to  zero,  and  describes  two 
control  algorithas  which  aaintain  the  power  factor  as 
close  as  possible  to  unity.  These  algorithas  have  been 
evolved  with  a  view  to  an  easy  hardware  iapleaentation. 
The  perforoances  of  the  proposed  control  algorithas  have 
been  validated  by  digital  siaulation. 


The  control  strategy  and  the  rotor  geoaetry  affect  the 
perforoances  of  the  systea  and  the  size  of  the  inverter 
feeding  the  actuator.  Three  control  strategies  are 
investigated  :  the  first  one  consists  of  keeping  id-0, 
the  second  one  of  keeping  cos**l  and  the  third  one  of 
keeping  the  RMS  value  of  the  stator  voltage  proportional 
to  the  rotor  speed.  Sooe  other  theoretical 
considerations  which  don't  appear  in  this  study  are 
given  in  reference  (5). 


2.  ACTUATOR  MODELLING 

Permanent  cagnet  synchronous  actuators  came  in  two 
configurations  according  to  their  rotor  geoaetry  (figure 

0  m  : 

-  PM  synchronous  actuators  with  surface  counted 
magnets  which  schematic  representation  is  shown  in 
figure  l.a  have  no  significant  saliency  effects. 
Their  saliency  coefficient  p  *  lJld  is  nearly  equal 
to  unity. 

-  Buried  PM  synchronous  actuators  which  scheaatic 
representation  is  shown  in  figure  l.b  have  a  saliency 
coefficient  which  can  reach  values  as  high  as  5. 

For  a  two  pole  actuator  without  damper  windings,  the 
Park  equations  reduce  to  : 

va  _  -h,  l, 

V,  ‘  i,  Ld  R,U,  s 

T*.  ■  K,  i,  ♦  (Ld-l,)id  i„ 


In  order  to  simplify  the  theoretical  study,  the  currents 
ta  and  i  are  replaced  by  the  variables  I,  and  0 
according  to  equations  (2). 


i.  -  JTT7" 


0  -  arctg  (-  — ) 


(Z-a) 

(Z-b) 


0 

vd 

♦ 

a) 

v„ 

K,  3. 

.it.. 

I,  is  the  RMS  value  of  the  stator  current  (multiplied  by 
43/2  for  a  three  phase  actuator)  and  0  is  its  leading 
angle  with  respect  to  the  rotor  q  axis. 

With  these  variables,  equations  (1)  become  in  steady- 
state  : 


K  I, 


■KK 

R, 


I,  sln0 
I,  cos0 


(3) 


2  is  obtained. 
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Fig.  2.  Vector  diagras. 

The  electrccragsetfc  torque  becoses 

T„  -  I,  [Kj.cosfi  +  |  (P-1)  l„  I,  Jin2JJ 

where,  p  -  —  is  the  saliency  coefficient. 
The  co tor  power  factor  is 


I^-order  tc  Steep  t be  pater  fader  always  equal  to  eaity, 
it  is  necessary  to  achieve  the  follcwicg  relatioa 
betweea  1*  aad  fi. 

'l».  ■  tia* -  ,8, 

J  sin*0  +  p  cos*£ 

Ooe  drawback  of  this  netbod  is  that  the  relation  between 
the  tc rqa&iad  the  current  is  a  eon  linear  coe,  as  shown 
in  ffgsre  4. a. 

This  figure  shows  also  that  the  torque  decreases  beyond 
a  certain  value  of, I*.  Kevertheless  a  lot  of  actuators 
have  a  rated  current  lower  than  that  value  of  I*. 
Hwever  it  is  possible  to  saintafn  the  torque- current 
relation  roughly  linear  by  keeping  fi  constant  beyond  a 
critical- current  judiciously  chosen.  This  is  achieved 
u\  a  scall  deterioration  of  the  power  factor,  but  it 

recains  better  than  with  the  control  keeping  i^O  for 
actuator  with  p  >  1. 

Figure  4,b  shows  the  power  factor  and  the  torque 
characteristics  when  0  is  kept  constant  beyond  a 
critical  value  of  the  current  ij. 


cos*  •  co~,{6-0) 


tg  j  ^  -  {5) 

_  \  V*  cos£  "  K-  L<J  K  +  Kr  K 
These  equations  are  normalized  as  follows 

.  ki-.  t»  .  i-is. 

*  ‘  K? 

By  neglecting  the  voltage  drop  on  the  amature 
resistance  R,  in  comparison  with  the  back  enf.  we  can 
rewrite  (4)  and  (5)  in  function  of  the  normalized 
variables  I"  and  T*  : 


T"  •  I!  [cos/1  *  |  (p-1)  i;  sin2« 


p  ■:  cos/i 
1  -  i;  sin/J 


In  this  control  cethod  the  direct  axis  cocponent  of  the 
stator  current  id  doesn't  exist.  The  relation  between  the 
torque  and  the  current  is  linear  and  this  allows  to 
evolve  very  siople  control  algorltha  (1,2). 

Figure  3  shows  that  the  power  factor  decreases  strongly 
with  the  load  for  actuator  with  a  saliency  coefficient 
greater  than  the  unity. 


Fig.  4.  Torque  and  power  factor  with  the  cos*»l 
control  cethod. 


To  keep  the  RHS  value  of  the  stator  voltage  constant  and 
equal  to  the  back  eaf,  it  is  necessary  to  achieve 
relation  (9). 

r- .  — -  (9) 

$in70  +  p 7  cos70 


Figure  5  shows  the  power  factor  and  the  torque 
characteristics  for  this  control  cethod. 


Fig.  5.  Torque  and  power  factor  of  the  control  with  RHS 
Fig.  3.  Power  factor  with  the  id»0  control  cethod.  value  of  the  stator  voltage  proportional  to  the  speed 
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Fig.  6.  General  overview  of  the  proposed  controller. 


The  torque- current  relation  Is  practically  linear 
whereas  the  power  factor  reaains  acceptable. 

4.  SYNTHESIS  OF  THE  DIGITAL  CONTROLLER 


The  control  with  cost-I  and  the  control  keeping  the  RHS 
value  of  the  stator  voltage  proportional  to  the  speed 
can  both  be  achieved  by  the  controller  shown  in  figure 
6. 


The  difference  between  the  two  control  eethods  is  in  the 
content  of  the  block  called  ’coaputing  of  id,  iq*. 

For  the  control  with  cosi-1,  this  block  deterames  if  I* 
is  smaller  or  bigger  than  the  chosen  critical  value  lc 
of  I*.  Then  it  cosputes  id  and  ij.  The  algoritha  is  the 
following. 

If  I,  *  Ic»  the  solution  of  (8)  : 


sin£ 


I  -  Jl  -  4  g(l-p)i;  p 
2  i;o-/» 


(10) 


If  I*  >  Ie,  p  reaains  constant  and  corresponds  to  the 
value  cosputed  froa  (10)  when  l'«Ie. 

For  the  control  with  RHS  value  of  the  stator  voltage 
proportional  to  the  speed  $  is  determined  by  solving 
(9).  This  yields  (11). 


sinj3 


I  -  Jh!(1V)I!  />a 
IIO-P1) 


do 


In  both  cases,  id  and  i*  are  cosputed  froa  I*  and  /J. 
id  -  -I*  sfn£ 

.  02) 
•  I*  cos0 

To  reduce  the  coaputing  tiae,  it  is  possible  to  tabulate 

the  values  of  i!  and  i*  as  a  function  of  I*  according  to 
equation  (10)  or  (11).  * 

id  and  iq  are  the  references  values  for  two  current 
regulators  which  are  here  two  PI  regulators.  The 
synthesis  of  these  two  regulators  is  easy  if  a 


decoupling  between  the  two  axes  is  realized  through  an 
appropriate  state  feedback.  The  values  of  Vd  and  Vq  are 
obtained  by  adding  to  the  outputs  of  the  current 
regulators  supplesentary  terns  cosputed  by  the 
decoupling  feedback  unit. 

Furtheroore  a  feedback  on  the  speed  regulator  s  output 
of  a  well  chosen  tern  proportional  to  the  speed  K$ 
inproves  strongly  the  insensivity  of  the  systea  against 
load  variations. 

The  Park  and  inverse  Park  transform  involved  in  the 
control  algoritha  are  performed  without  lost  of  tine  by 
using  tabulated  values  of  y-x,  cos(x2)  [1,6). 

Another  way  to  control  the  actuator  currents  consists  in 
controlling  each  phase  current  by  a  local  feedback  loop 
acting  on  the  corresponding  leg  of  the  inverter  feeding 
the  cotor.  The  probleas  related  to  these  local  feedback 
loops  have  been  investigated  in  reference  [7). 

The  two  proposed  control  eethods  are  core  intricate  than 
the  control  with  id-0.  Indeed  the  control  with  id«0  can 
be  achieved  by  using  only  one  PI  regulator  for  the  speed 
and  one  proportional  asplifier  for  the  control  of  iq, 
since  the  condition  id-0  eay  be  ensured  by  an  appropriate 
decoupling  feedback  (2).  Furthermore  if  the  electrical 
tice  constant  oay  be  neglected,  the  proportional 
asplifier  is  no  oore  necessary  and  it  is  also  possible 
to  avoid  current  oeasureoents  [1). 

5.  PERFORMANCES  OF  THE  SYSTEM 

Simulations  allow  to  cospare  the  performances  of  the 
three  control  aethods  discussed  in  this  paper.  The 
parameters  of  the  sisulated  actuator  are  the  following 
ones. 

R  -  1  ft  K,  »  2,06  V. S/rad 

£  -  0,038  K  J  -  0,08735  kgaJ 

Lq  -  0,197  H  K|  •  0,00342  Nas 

Its  saliency  coefficient  p* 5  is  high.  Figure  7  shows  the 
response  of  the  actuator  to  a  step  in  the  speed 
reference  and  to  the  application  of  a  positive  step  of 
torque  (nearly  equal  to  70%  of  the  rated  torque)  at  t«4s 
and  a  negative  one  at  t*6s.  If  the  speed's  performances 
are  similar  for  the  three  control  algorithas,  figures 
7.c  and  7.d  show  the  inprovecent  of  the  power  factor 
achieved  by  the  proposed  control  algorithas. 
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c.  Control  with  cos$*»l  d.  Control  with  RMS  value 
of  the  stator  voltage 
proportional  to  the  speed 

Fig.  7.  Sioulation  of  the  three  control  methods. 

6.  CONCLUSION 

In  this  paper,  two  control  algorithas  which  allow  to 
keep  the  power  factor  as  close  as  possible  to  unity  have 
been  proposed.  These  two  control  algorithas,  quasi 
equivalent,  are  a  little  core  intricate  than  the  one 
which  achieve  the  cocconly  used  control  with  i^O.  But 
their  hardware  icplecentation  can  be  done  without 
ieportant  additional  difficulties. 


The  use  of  control  algorithas  which  caintain  the  power 
factor  close  cf  unity  is  justified  only  for  actuators 
with  a  salfency  coefficient  significatively  greater  than 
unity,  as  buried  Hi  synchronous  actuators.  The  1^0 
control  sethod  reaains  the  cost  interesting  one  for 
actuators  without  saliency. 
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Abstract  -  A  simple  way  for  synthesizing  a  robust  position 
controller  for  a  DC  drive  is  presented.  The  applied  method  uses  the 
input  output  linearization  method  to  achieve  a  more  adequate 
representation  of  the  system.  The  robustness  is  assured  by  the  use 
of  the  sliding  mode  operation. 

L  INTRODUCTION 

In  this  paper  is  presented  a  principle  of  synthesizing  a  robust 
position  controller  for  a  DC  drive  using  sliding  mode.  This  method 
enables  to  define  in  an  easy  way  the  commutation  law  that  assures 
required 'performances  and  the  robustness  of  the”  system.  This 
commutation  law  supposes  that  a  correct  acquisition  of  the  position, 
speed  and  acceleration  is  possible.  Concerning  implementation,  the 
acceleration  value  may  put  some  practical  problems,  because  a  large 
frequency  bandwidth  transducer  is  not  available  and  noise  level  do 
not  enable  us  to  compute  the  speed  derivative.  To  overcome  this 
drawback,  an  acceleration  observer  is  used. 

Experimental  results  show  the  good  performances  achieved 
with  the  proposed  method  for  position  regulation  and  for  position 
tracking. 


"dT“c“ 
i'u 
dt  'r» 

^r-A,fc„er.)-Bro 


u-Vsgn(S)  ;  S-K,e,+  Kaea+Krer  (J) 

The  dynamic  of  the  system  is  now  represented  by  equation 
(6),  if  the  sliding  mode  control  is  effective. 

«,+ 2  ,6, 


HI.  IMPLEMENTATION 


II.  THEORETICAL  STUDY 


The  system  is  based  on  a  separated  excited  DC  motor  and  is 
represented  by  the  ecpiat  ions  (I),  where  T(t)  is  the  load  torque. 


dto 
dt  " 


K 

1 


}t(0 


dO 

dT 


CO 


(1) 


This  model  results  from  the  application  of  circuit  theory  laws 
mid  is  deliberately  simplified.  In  practice,  the  magnetic  circuit  is  not 
linear  and  there  exists  parameter  variations. 

The  model  described  by  equations  ( 1)  is  simple.  However,  it 
is  not  the  more  adequate  to  synthesize  a  robust  position  controller. 
For  this  purpose,  it  is  useful  to  chose  another  state  variables:  the 
position  0,  speed  to  and  acceleration  y.  It  is  important  to  refer  that 
the  acceleration  reports  a  Targe  quantity  of  information",  because 
this  variable  contains  information  about  the  current  and  also  about 
the  load  torque.  This  new  model  is  obtained  by  the  dipheomorfism 
represented  by  equations  (2),  which  could  be  obtained  in  an 
systematic  way  like  refered  in  ( 1  ]  and  f  2). 


0-0 


to- to 

V-}(K,i.T(0) 


Using  the  error  values  (3),  the  system  will  be  represented  by 
equations  (4). 

e,-0rf-0  c„-0rf.«  c,-0rf.y  {3) 

With  the  result  stated  by  (4)  and  using  as  applied  voltage  the 
value  given  by  (5)  where  the  commutation  law  S  is  a  linear  function 
of  the  errors,  a  robust  tracking  position  control  is  achieved.  This 
result  is  in  according  with  another  one  theoretically  presented  in 
reference  (3). 


As  indicated  by  equations  (3)  and  (4)  and  concerning 
implementation,  it  is  necessary  to  measure  the  acceleration  value. 
This  may  put  some  practical  problems,  because  a  large  frequency 
bandwidth  transducer  is  not  available  and  noise  level  do  not  enable 
us  to  compute  the  speed  derivative. To  overcome  this  drawback,  an 
observer  for  the  external  input  (torque  T  and  also  parameter 
variations)  is  used.  This  signal  is  combined  with  the  Hue  measure  of 
the  current  to  achieve  an  approximate  value  of  the  acceleration. 

The  model  of  this  observer,  also  used  in  reference  15),  is  as 
follow: 


5--g5  +  gKtoi  +  gaJ  a 
T„,-5-gJo 

»-j(  **>*-*««) 

where  g  is  a  theoretically  free  chosen  parameter. 

IV.  EXPERIMENTAL  RESULTS 


Experimental  results  were  obtained  using  a  1.5  kW  separated 
excited  DC  motor  supplied  by  a  four  quadrants  GTO  DC/DC 
converter  The  output  variable  0  is  measure  by  a  shaft  incremental 
encoder.  To  measure  the  speed  it  was  also  used  this  encoder  and  a 
dedicate  electronic  circuit  16).  The  perturbation  torque  T  is  generated 
by  a  gearbox  connected  pendulum,  as  the  one  presented  on  figure  I. 


Fig.  1  *  Pendulum  used  as  motor  load. 

The  error  signal  S  of  equation  (5)  with  the  current  limitation  is 
formed  using  a  simple  electronic  circuit  like  the  one  presented  on 
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figure  2  by  a  block  diagram.  However,  one  must  note  that  the 
solution  presented  in  this  figure  is  only  enough  for  regulator 
applications-  Indeed; when’ the  reference  changes,  Lc.  for  tracking 
applications/ It  is" necessary  to  consider  the  first  and  second 
derivatives  of  the  reference,  as  it  is  shown  by  the  experimental 
results  in  figure'1 

tf*) 


Fig.  2  -  Frror  calcuUhig  block  diagram  fora  regulator  application. 


a)  first  and  second  derivatives  of  the  reference  are  not  considered. 
S  S0f — 


1 

M 

o-S0 


0  1.25  2.S  3. 75  S 

Time  C*) 

b)  first  and  second  derivatives  of  the  reference  arc  considered 


Fig.'5  -  Position  and  current  evolution  when  a  step  reference  is 
applied.  '  ' 

In  figure  6  the  error  is  presented  when  a  50%  variation  on  field 
current  is  made.  The  result  shows  the  insensitivity  to  K$  parameter, 
wich  variation  is  compensed  by  a  large  current  value. 


Fig.  3  -  Experimental  recording  of  the  position  error  when  a 
sinusoidal  reference  is  applied  [0^-  0.3  sin(2nV2.5)l. 

There  are  minor  motives  that  cause  the  error  presented  in 
figure  3-b),  for  instance  the  frequency  limitation  in  the  DC/DC 
converter,  and  the  approximate  rnesurc  of  the  speed  However,  this 
error  is  mainly  due  to  low  pass  filter  characteristics  of  the 
acceleration  observer  In  figure  4  this  fact  is  underlined  by  the  error 
evolution  when  a  sudden  action  at  the  pendulum  is  made. 


0  1.25  2.5  3.7S  5 

Time  Cs) 


Fig.  4  -  Position  error  and  motor  current  evolution  when  a  sudden 
load  is  applied  to  the  motor  shaft. 

There  is  another  important  source  of  error  for  the  system.  This 
happens  when  the  compatibility  of  the  reference  variation  with  the 
physics  of  the  motor  is  not  assured.  In  figure  5  is  presented  the 
error  evolution  when  the  described  situation  is  created  by  a  step 
input  signal.  Also  in  this  figure  one  can  see  the  current  limiting 
action. 


Fig.  6  -  Error  and  current  evolution  when  a  field  current  variation  is 
generated. 

V.  CONCLUSIONS 

Theoretical  study  and  experimental  results  show  that  the 
proposed  principle  for  synthesizing  a  position  controller  is  easily 
applied  and  gives  good  performance  for  regulation  or  tracking 
applications.  Insensitivity  concerning  parameters  and  supply 
voltages  changes  and  also  perturbation  due  to  load  variations 
achieved  by  the  application  of  the  sliding  mode  control  with  a 
suitable  choice  of  the  commutation  law. 

Restrictions  in  the  performances  are  only  imposed  by  the 
physics  of  the  system  and  the  availability  of  direct  mesures  of  the 
state  variables.  At  a  secondary  level  restrictions  arc  also  imposed  by 
the  execution  of  the  control  electronics. 
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Abstract  -  This  paper  deals  with  the  stability  analysis 
of  anelectrical  drive  system  based  on  a  PM  synchronous 
actuator  with  field  oriented  control  when  field 
orientation  is. performed  through  a  decoupling,  state 
feedback.  It  is  shown  that  the; use  in  the  decouplfng 
state  feedback  of  predicted  values  of  the  actuator 
currents  instead  of  measured  ones  improves  substantially 
the  stability  of  the  system. 

1.  INTRODUCTION 

This  paper  deals  with  the  stability  analysis  of  an 
electrical  drive  system  based  on  a  PM  synchronous 
actuator  with  field  oriented  control  when  field 
orientation  is  performed  in  open  loop  by  using  an 
appropriate  decoupling  state  feedback  [1,2). 

As  field  orientation  is  performed  in  open  loop,  its 
effectiveness  depends  on  the  uncertainties  on  the 
actuator  ^parameters  and  on  the  errors  on  the  measured 
or  computed  values  of  the  speed  and  the  currents. 

The  aim  of  this  paper  is  to  investigate  the  effect  of 
these  uncertainties  on  the  stability  of  the  system.  The 
study  is  performed  by  considering  surface  mounted  PH 
actuators  which  present  negligible  electrical  time 
constants. 

The  main  result  arising  from  the  study  is  that  the 
stability  of  ■  the- system  is  substantially  improved  by 
using  for  the  computation  of  the  decoupling  feedback 
predicted  values  of  the  currents  instead  of  measured 
ones. 

2.  ACTUATOR  MODELLING 

The  Park  equations  cf  a  PM  synchronous  actuator  without 
damper  windings  reduce  to 

R44ld  s  -P$.  lq.  id  0 

».<•„  «,  *  K,  S,  U) 

•  K,  \  ♦  di-L,)!,.  i,  (2) 

In  these  equations 

’  L,j  and  Lq  are  the  self  inductances  of  the  d  and  q 
armature  equivalent  windings 

-  R4  is- the  resistance  of  an  armature  winding 
»  Kt  the  torque-constant 

-  0m  the  angular  position  of  the  rotor  and  its 
velocity 

-  2P  the  number  of  poles 
*  s  the  Laplace  operator. 

The  corresponding  block-diagram  1$  given  in  full  line  in 
figure  1.  In  this  figure  Jr  is  the  load  torque  and  0  is 
the  inertia  of  the  actuator's  rotor  and  of  the 
mechanical  load. 

3.  CONTROL  STRATEGY 

In  tVvdq  frame,  field  orientation  is  achieved  when  id  is 
equ  ,  to  zero.  This  can  be  performed  in  open  loop  by 
in  reducing  an  appropriate  state  feedback  (shown  in 
b  .ken  line  in  figure  1).  According  to  this  state 
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Fig.  I.  Block  diagram  of  a  PM  synchronous 
actuator  without  dampers, 

feedback  the  d  axis  voltage  oust  be  equal  to  : 

•  -P  K  l,  i,  (3) 

When  the  d  axis  current  is  equal  to  zero,  the  block 
diagram  of  the  q  axis  becomes  similar  to  that  of  a  DC 
oachine  and  the  speed  can  be  controlled  by  using  a  PI 
controller  which  generates  the  q  axis  voltage. 

The  above  mentioned  strategy  leads  normally  to  a  digital 
controller.  An  example  of  implementation  organised 
around  two  8  bits  MCS  8051  microcontrollers  Is  described 
in  reference  (I). 

4.  STABILITY  ANALYSIS 

With  the  selected  control  strategy,  the  value  of  the  d 
axis  current  depends  directly  on  the  precision  in  the 
generation  of  the  voltage  Ud  according  to  equation  (3). 
In  order  to  investigate  the  effects  on  the  system 
stability  of  an  error  on  the  value  of  Ud,  it  is  assumed 
that  this  voltage  is  determined  by  using  an  estimated 
value  of  lq,  affected  by  an  error  Alq. 

Li  -  l,  ♦  M,  (4) 

In  this  case,  if  the  discretisation  inherent  to  digital 
control  and  the  small  electrical  time  constants  of  the 
dq  axes  are  neglected,  one  gets  the  block  diagram  of 
figure  2  for  the  q  axis. 

It  can  be  seen  in  this  figure  that  the  error  on 
introduces  a  positive  feedback  loop  if  Al^O.  As  this 
feedback  depends  of  the  square  of  the  rotor  speed,  the 
tendency  of  the  system  to  be  unstable  increase  rapidly 
with  the  speed. 

By  linearising  about  a  steady  state  working  point  the 
equations  corresponding  to  the  block  diagram  of  figure 
2,  one  gets  the  following  equation  : 


«d 


Al-  < 


KT  +  G 


(8-b) 


— P’  25  i 
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Fig.  2.  Block  diagraa  of  .the  q;axi$’ 
with  an  unperfect  decoupling. 
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In  equations  (8)  G  is  the  gain  of  the  PI  controllers. 
One  oust- notice  that  condition  (8. a)  is  identical  to 
condition  (7. a).  Conditions  (8.b)  and  (8.c)  are  less 
restrictive  than  (7.b)  and  (7.c). 

Figure  3  shows  the  stability  lioits  according  to 
equations  (81a),  (8.b)  and  (8.c)  for  the  system  given  in 
appendix.  It  can  be  seen  that  at  high  speed*(6000  RPM), 
an  error  of  1.76%  in  excess  on  L,  is  sufficient  to  cause 
instability. 
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In  thisequation,  ill,,  6iq,  S5„  and  «Tr  are  the  variations 
of  the  different  variables  about  their  steady  state 


The  systen  shown  in  figure  2  is  intrinsically  stable  if 
the  real  part  of  the- pole  of  the  transfer  function  (5) 
is  negative.  This  implies  the  following  conditions  on 
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Condition  (7. a)  which  is  ir.dependant  of  the  current  iq  is 
the  most  constraining  one  as  it  depends  from  the  inverse 
of  the  speed's  square. 

In  paragraph  3  it  has  been  mentioned  that  the  speed 
control  may  be  achieved  with  a  PI  controller.  The  design 
of  this  controller  has  been  described  in  reference  [lj. 
The  application  of  the  Routh  Hurwitz  criterion  to  the 
closed  loop  system  yields  the  following  conditions  for 
the  stability  of  the  system. 
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Fig.  3.  Stability  limit  in  the  (Al^/lq,  speed)  plane. 

Simulations  of  the  system  have  allowed  to  confirm  the 
above  pentionned  theoretical  results.  In  these 
simulations*  the  sampling  period  of  the  digital 
controller  has  been  taken  $»?.!!  enough  (0,5ms)  in  order 
to  reduce  its  influence  on  the  behaviour  of  the  system 
since  it  has  been  neglected  in  the  theoretical  analysis. 
Figures  4. a  and  4.b  show  the  response  to  a  step  in  the 
speed  reference  from  0  to  6000  RPM  and  to  the  sudden 
application  of  *  load  torque  equal  to  the  rated  torque 
at  t«0,3s. 

In  figure  4. a,  the  decoupling  feedback  is  computed  by 
considering  L*« 1 , 0 1 LQ i  at  6000  RPM  the  system  remains 
stable  even  if  one  may  notice  some  oscillations  during 
the  transient.  In  figure  4.b,  the  decoupling  feedback  is 
computed  by  considering  L*«1,025L4  and  according  to  the 
theoretical  analysis  the  system  becomes  unstable. 

It  is  worthy  of  note  that  the  condition  {7, a),  (or 
(8. a)),  is  corresponding  to  an  electrical  instability. 
Indeed,  if  one  assumes  that  the  speed  is  constant  one 
gets  from  figure  2  : 


a  b 

Figv  4.  Simulation  of  the  dynamic  behaviour  of  the 
system  using  a  measured  value  of  ig. 
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7he  denominator  of  (9)  is  positive  when  condition  (7. a) 
is  met. 


Equation  (11)  doesn't  introduce  any  limitation  on  the 
maxi bus  error  on  Lg.  -  V 

If  the  mechanical  part  of  the  system  is  taken  into 
account,  the  curves  corresponding'  to  the  limit  of 
stability  in  the  (AL^Lq,  speed)  plane  become  those  of 
-  figure  ’5.  It  caff  be  easil/  seen  that  -»the  system 
stability  is  strongly  increased.  This  is.  clearly 
apparent  in  figure  6. .  Indeed,  this  figure  shows  the 
response  of  the  system’to  a  step  in 'the  reference  speed 
and’ to' the  sudden' application  of  a  load  torque  equal  to 
the  rated  torque  at  t»0,3sr  by  assuming  an  error.- of  50% 
on  the  estimated  value  of  Lq  (ALyO,5Lg).  .It  can  be  seen 
that  the  system  remains  stable.^ 


5.  DECOUPLING  BASED  ON  AN  ESTIMATED  VAtUE  OF  iq 

By  replacing  in  the  computation  of  Ud  the  measured  value 
of  ig  by  an  estimed  value  iq  (1,2) 

1,-2^  do 

one  gets,  if  the  speed  is  assumed  to  be  constant  ; 
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Fig.  5.  Stability  lioit  in  the  (4l^tq,  speed)  plane. 


fig.  6.  Simulation  of  the  dynamic  behaviour  of  the 
system  uding  an  estimated  value  of  1,. 


6.  CONCLUSION 

In  this  paper,  one  has  Investigated  the  stability  of  an 
electrical  drive  system  based  on  a  PH  synchronous 
actuator  with  field  oriented  control  if  field 
orientation  is  performed  through  a  decoupling  state 
feedback.  It  is  shown  that  it  is'  necessary  to  use,  in 
this  feedback,  a  predicted  value  of  the  g  axis  current 
instead  of  a  measured  one  In  order  to  avoid  a  to  high 
sensitivity  of  the  system  to  modelling  uncertainties  as 
concern  its  limit  of  stability.  The  theoretical  results 
have  been  validated  by  digital  simulation. 
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APPENDIX 


Parameters  of  the  simulated  actuator  : 
Rated  power  :  2  kW 
Rated  speed  :  6000  RPH 
P  •  3 
R,  •  0,550 
ld  •  2,2  mil 
l,  •  2,2  oH 
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K,  •  0,297  Kst/A 
0  -  6.10‘*  kgaJ 
K  •  9,5. 10-5  Nas/rad 
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Abstract  -  An  on-line  interactive  FFT  calculation  of  the 
harmonic  spectra  of  the  voltage  arid  current  variables  in  a 
VSI-IM  rail  traction  drive  simulation  is  presented.  The 
implementation  is  on  a  parallel  computer  and  is  user-driven 
by  a  'freeze  and  display'  command.  Simulation  results  are 
give  for  the, spectra  of, the  line  and  motor  current  waveforms 
during  syhcnronous  PWM  operation  of  the  drive, 

I.  INTRODUCTION 

A  major  simulation  task  in  a  VSI-IM  rail  traction  drive  is  to 
determine  the  harmonic  perfoimance  as  the  VSI  modulation 
strategy  is  changed.  Consequently,  most  industrial 
simulators  based  on  PCs  or  workstations  incorporate  a 
harmonic  analysis  facility  to'  continuously, display  the  spectra 
of  the  main  electrical  and  mechanical  variables  throughout 
the  traction  duty  cyde.  The  problem  addressed  in  this  paper 
is  to  provide  a  flexible  selection  facility,  under  interactive  user 
control,  for  the  extraction’ of  harmonic  information  during  the 
simulation  of  a  such  a  traction  drive. 

In  the  paper,  a  parallel-computer  based  VSI-IM  rail 
traction  drive  simulator  is  outlined,  The  software  structure  to 
enable  on-line  window  definition  and  extraction  of  FFT- 
derived  spectra  in  real  rime  Is  then  descnbed.  The  procedure 
is  earned  out  under  user-interactive  control  via  a" freeze,  arid 
display  facility.  Sample  results  for  the  harmonic  spectra  of 
input  and  motor  currents  during  synchronous  PWM  operation 
in  the  acceleration  period  of  the  drive  are  then  presented. 

II.  CONVERTER-MOTOR  SYSTEM  MODELLING 

The  VSI-IM  rail  traction  drive  modelled  is  illustrated  In  Figure 
1  It  is, operated  under  practical  conditions  with  a  three-part 
duty  cycle.  From  standstill  to  motor  base  speed,  the  IM  is 
driven  at  constant  flux;  this  is  followed  by  constant  power 
morion,  with  a  final  region  of  reduced  power  up  to  maximum 
speed.  To  achieve  this  duty  cyde,  the  inverter  is  operated  by 
PWM  to  the  base  speed  (asynchronous  PWM  at  startup, 
followed  by  synchronous  PWM),  and  thence  by  quasi-square- 
wave  modulation.  In  the  PWM  region,  various  modulation 


schemes  are  available  which  have  the  effect  of  reducing  or 
-minimising  harmonic  distortion  of  thesuppty  current  and 
■motor  voltage  waveforms.  However, -  these  involve  sudden 
change'  of  ' the  modulation  ,  ratio  and/or  index  at  certain 
speeds,  which  leads  to  poor  harmonic  performance  at  the 
'changeover  point. 

OC  Unk  Voftig^-sourc*  inverter 


Fig.  1.  VSI-IM  rail  traction  drive. 

The  converter-motor  system  model,  fully  described  in 
reference  ID,  simulates  the  filter,  VSI  and  IM.  The  IM  is 
modelled  in  the  rime  domain  by  sotting  up  and  solving  the 
motor  differential  equations.  Transformation  to  a  fixed  D-Q 
axis  model  Is  then  achieved  giving  the  model  equation  in 
terms  of  the  Laplace  operator  p  as 

|V).-.|[R]+8[G)+[LW(I)  (1) 

where  [VJ,  (I)  are  the  direct  and  quadrature  variable  matrices, 
[RJ,  !L),  [Gl  are  the  elements  of  the  IM  rotor  and  stator 
equivalent  circuits,  and  O'  is  the  rotor  speed.  The  input  DC 
filter  Is  modelled  as  part  of  the  IM  impedance  matnx,  thus 
including  the  system  input  current  and  DC  link  voltage  as 
additional  state  variables.  The  solution  of  Equation  (1)  is 
obtained  by  the  method  of  eigenvalues.  The  model  is  then 
solved  for  each  of  the  six  VSI  device  conduction  modes  using 
nodal  equations  and  matching  boundary  conditions. 

Typical  results  for  startup  from  rest  are  shown  in  Figure  2. 
The  transition  from  asynchronous  to  synchronous  PWM  at 


q  5T  I  '  9  0  0  t  0.4  0  4  0  4  I  oo  0  4  9  4 

n*»w 

>*  4 


•  0  9  »  9  4  0*  0  1  I  9  9  0* 

r*»W 

Fig.  2.  Simulator  outputs  for  one-second  traction  drive  acceleration  penod. 
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about  0  4  s  is  dearly  shown,- as"  also  are  variations  in'iriput' 
current  and  torque  as  the  drive  accelerates. 

111.  ON-LINE  FFT  GENERATION 

The  simulator  has  been  implemented  on  a  68020-based 
parallel  computer  [2).  The  task  structure  is  shown  in  Figure  3. 
The  procedure  to  initiate  a  FFT  calculation  is  to  initialise  the 
system  by  keyboard  command,  and  to  define'the  time  window 
-cf,  interest  by  selecting , the. starting,  instant  ,  the  sampling 
interval  and  the  number  of  points, ,  The  master  program  then 
informs  ttie  .calculatorvtask-.that  the,  FFT  calculation -is 
underway.  The  parallel  structure  of  the  simulator  ensures  the 
FFT  and  model  calculations  proceed  simultaneously, 
although  the  FFT  itself  is  calculated  serially. 


Fig.  3.  Simulator  task  organisation. 


Computation  of  the  FFT  is  based  on  the  Cooley-Tukey  Fig.  5.  System  input  and  motor  line  currents  and  FFT  spectra 
algorithm  (3)  developed  by  Brigham, (4).  The  algorithm 

implementation  is  acheved  by  a  main  program,  which  is  a  The  system  input  and  motor  line  currents  in  Figure  2  have 
routine  of  the  simulation  model,  with  a  subroutine  to  perform  been  windowed  as  shown  and  subjected  to  FFT  analysis, 

the  Fourier  analysis.  The  main  program  passes  the  data  set  The  results, Figure.5,  show  the  motor  line  current  as  almost 

(signal  amplitude,  total  window  angle)  and  the  number  of  sinusoidal,  with  more  harmonics  in  the  input  current.  Shown 

points  (in  binary)  to  the  subroutine,  where  the  sine/cosine  are  the  amplitude  of  the  most  significant  harmonic,  and  the 

value  table  is  set  up,  and  the  calculations  and  sums  modulation  frequency  (2.9  Hz),  real  time  (0.871  s),  and 

performed,  the  results  being  returned  to  the  main  routine.  calculation  time  (45  s). 

The  procedure  requires  the  input  function  to  be 

represented  in  2N  points  (Figure  4).  An  N-pomt  transform  is  IV.  DISCUSSION  AND  CONCLUSIONS 

used  to  compute  the  real  and  imaginary  parts,  Xr(n)  and  X|(n), 

of  the  discrete  Fourier  transform.  Measurement  of  periodic  signal  parameters  Is  generally 

subject  to  errors  from  spectral  leakage  and  harmonic 
interference.  Normally,  interpolation  algorithms  are  used  to 
minimise  these  errors.  Careful  choice  of  sampling  frequency 
and  truncation  envelope  is  thus  required.  The  uso  of  flat-top 
windows,  as  described  by  Salvatore  and  Trotta  (5),  is 
currently  being  investigated  to  reduce  errors. 

The  simulator  has  proved  to  be  a  convenient,  flexible  tool 
which  has  enabled  harmonic  production  at  switching 
transitions  between  various  forms  of  PWM  to  be  investigated. 
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Abstract. 

A  new  strategy  with  varying  sequences  for  thyristors 
converters  suitable  for  digital  control  is  presented. 

Modelling  (In  sampled'  data  sense)  is  given  and  a 
synthesis  method  is  proposed. 


Introduction:  Analog  versus  Digital  Control  of 
Converter, 

Analog  control  of  thyristors  converters  (In 
rectifier  or  inverter  mode)  has  a  long  history  and  Is 
well  known.  Classical  modelling  and  synthesis  with 
linear  modelling  are  often  used,  but  rlgourus  modelling 
of  these  devices  leads  to  non-linear,  sampled  data 
equations  (/3/,  */4/,  /5/)„  The  most  remarquable 
property  is  that  the  converter  realizes  in  fact  an 
analog-digital  conversion;  the  Instant  of  sampling  is 
identical  with  the  instant  of  control,  so  the  system 
has  no  delay.  This  is  a  good  property  with  regard  with 
the  stability  This  is  particvlary  Important  for  fast 
regulations  as  current. regulations  which  are  here  taken 
into  consideration, 

With  digital  control  the  problem  becomes  different.  We  see  that  this  delay  varies  between  0> and  three 

The  designer  must  choose  a  strategy.  Conceptually,  the  periods  of  the  converter.  The  delay  does  not  exist  in 

most  easy  is  an  “hybrid  stategy"  where  the  digital  analog  control.  And  It  is  well  known  that  a  delay  has 

component  “simulate"  an  analog  control:  with  a  bad  effects  on  the  stability  of  closed  loop  systems, 

frequency  as  high  as  possible,  tho  digital  component  This  is  partlculary  dangerous  here  because  the  delay 

generates  the  same  signals  as  the  analog  component.  can  be  greathor  than  one  period  (see  fig. la).  And  when 

This  strategy  has  some  good  properties-  (no  delay,  good  the  delay  is  very  large,  the  digital  control  can  lead 

stability),  but  the  microprocessor  is  always  computing  to  regulation  having  a  very  poor  stability  margin, 
the  signals.  So  it  as  no  time  for  other  activities 

relating  to  high  level  and  this  is  good  only  If  the  That  is  the  reason  why  we  propose  a  new  direct 

control  frequency  is  effectively  high  with  regard  with  digital  control  with  varying  sequence  which  is  choosen 

the  frequency  of  tho  converter  (300  Hz  In  Europa).  This  so  that  the  delay  is  always  smaller  than  a  period  of 

strategy  is  not  optimal  but  it  has  some  advantages  and  the  controller.  Fig. lb  gives  the  principal  of  the 
is  often  used.  control;  the  synchronization  Instant  is  “adaptative" 

and  depends  on  the  value  of  the  firing  angle  (In  the 

The  strategy  proposed  here  will  employ  some  classical  sense).  Fig. 2  defines  the  varying  sequences 
advantages  of  digital  control:  at  each  period  of  the  We  observe  4  different  sequences, 

converter  the  digital  component  makes  the  measurement  (1)  0  sH  */6  (2)  k/6  s  *  <  n/2 

one  time  and  the  control  is  computed  also  one  time.  So  (3)  n/Z  *  ♦  <  5x/6  (4)  5n/6  a  ¥  <  r 

the  digital  component  has  time  for  other  nctlvitates  In  this  case,  the  delay  Is  always  smaller  as  one 

But  the  questions  are.  Choice  of  the  sampling  Instants,  period,  thus  with  slmplo  algirithms  (P. I.  for  example). 

Determination  of  the  contol  algorithm,  which  is  deduced  we  can  have  good  dynamical  behaviour  (deadbeat  respons, 
from  tho  properties  of:  The  modelling  of  the  system.  for  example). 

A  SPECIFIC  STRATEGY  FOR  THYRISTORS  CONVERTER  CONTROL ,  SAMPLED  DATA  MODELLING. 

Then  we  propose  a  mathematical  method  to  model  the 
system  in  the  sens  of  the  sampled  data  system  theory, 
and  wo  deduce  from  this  modelling  a  method  for  compute 
the  parameters  of  the  controller  (by  choosing  the 
location  of  the  poles  ).  We  consider  small  perturbation 
around  the  steady-state,  and  we  note; 

l(tn)  ■  l(n)  «  iw  ♦  6i(n) 

Thus  a  structural  diagram  of  the  system  is  given  by 
fig, 3a.  the  supply  (thyristors  converters  and  pulses 
generator)  of  a  Resistance- Inductance  (R-L)  Is 
controlled  by  a  digital  control.  An  equivalent  scheme 
Is  given  by  fig, 3b  where  we  distinguish:  the  electrical 
tite  constant  T  ■  L  /  R  and  the  delay  of  the  control; 


For  thyslstors  converters  we  must  remark  that  at 
each  Instant  It  Is  possible  to  fire  three  different 
thyristors.  With  analog  control  the  sequence  Is  imposed 
and  normally  only  one  thyristors  Is  accessible.  For 
digital  controllers  the  freedom  is  more  important  and 
the  three  thyristors  are  effectively  eligible. 

The  most  immediate  strategy  is  the  following:  we 
choose  tho  “natural  firing  Instant*  (cf./l/).  The 
figure  i  shows  that  the  delay  tf  between  the  sampling 

instant  and  the  effective  control  instant  (thv  firing 
Instant)  depends  on  the  firing  angle: 

r-  t  -  */o  (1) 
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reset  to  zero  at  the  end  of  each  period  (cf./3/).  Thus 
we'  can  control  the  mean  value  of  the  current,  which  is 
in  practice  the  .value  of  Interest. 

Fig  4  present  the  structural  diagram  and  the 
equivalent  scheme  of  the  current  control  with 
measurement  of  the  mean  value  (k^  and  kj2  are  the 

-;gains  of  the*  A/D :  converter.  Fig.S  gives  the  snape  of 
the  .current  and  of  J  (the  current  integral).  The 
modelling  of  this  system -is  the  following.  Tne  current 
equation  Is  the  same  as  previous.  The  Integral  is 
directly  defined  In  sampled  data  sense: 

J(n*l)  *  f  tft*'i(t).dt  (5) 

Jtn 

It  can  be  demonstrated  that  the  equation  relative 
to  small  perturbation  around  an  operating  point  Is 

SJ(n*l)  *  «.5I(nM)  ♦  0.5i(n)  (6) 

with  «  ■  T  r(e  /e  -  1),  0  *  T  .(1-e  ).  Then  the 

"  o  l  O  ©  l 

complete  modelling  of  the  system  Is  given  by  (4)  and 
by; 


with:  V-  -kp.(eo.(Ha.KJ)  -  (1-0.KJ)), 

H  *  1-  yty^l.k  Al  *  a.Kp 
y(0J  *  <e /e  ),(»h(0  )/U.a  (0  ). 

CO  0  1  0)  O  CO 

k  -  K  ,k  .k, ,  k;  -  K,-k  /k  . 

p  p  dj  i  1  1  <12  <11 

The  closed-loop  transfer t  function  Is. 

*K» 


51(2) 


e  «|i  -  >.* 


tp  »  **  tn;  algebraic  calculus  on  rlgourus  (non¬ 

linear)  equations  gives  the  value  of  the  converter  gain 
(In  sampled  data  sense) 

K#  -  -h(tfw)  (2) 

where  h(tfw)  ■  2. sin  m/6,sln  Is  the  discontinuity  of 
the  supply  voltage  at  the  firing  Instant;  the  term  ao 

denotes  the  conversion  coefficient  between  the  output 
signal  u^  of  the  controller  and  the  firing  delay. 

a  «  dt*  /  dv  ,  (3) 

on* 

In  practice,  we  will  consider  two  output  conversions. 

-  linear  ;  u  *  u  (1-2  V/s),  aA  ■  -x  /  2.«.u 

*  *m  0  *a 

-  cosine  ;  u  »  u  cos  ajv>  )  ■  -1  /  u  ,sln  0 

«  u  0  «  *a  <o 

The  transferl  function  relative  to  small 
perturbations  is: 

5i(ntl)  «  eft  51(n)  ♦  (  eA  /  e  ),(-h  /  U.5t*  (4) 

0  0  1  n 

with  e  «  exp(  -  T/T  ),  e  »  exp(-  t  /  T  ). 


CONTROL  OF  THE  MEAN  CURRENT. 

A  problem  Is  dua  to  the  fact  that  on  one  period  the 
current  Is  measured  only  one  time,  so  we  have  no 
sufficient  Information  on  the  value  of  the  current  If 
it  has  a  big  ripple  (this  Is  specially  important  In 
discontinuous  mode).  Thus  we  propose  to  use  the 
Integral  value  (in  analog  sense)  of  the  current  with  a 


We  can  choose  two  roots  equal  to  zero  to  obtain  the 
deadbeat  response.  Thus  the  parameters  must  verify: 
eo*  **  *  0  and  V^”***"  ^  *he  solution  Is. 

K{  -  I/T.a-ere2)S  (9) 

■  <VSM‘-VV/U-eo)  (10) 

We  observe  that  (and  K^)  depends  on  the  operating 
point  (see  fig. 6)  and  Kp  depends  strongly  of  the 

operating  point  if  the  conversion  factor  is  a  constant 
(linear  firing),  but  varies  lightly  If  the  conversion 
factor  Is  a  sine  (cosine  firing,  see  fig. 6). 

Fig,  7  gives  dynamical  experimental  behaviour  of  the 
system.  The  observed  performances  are  good. 


We  have  proposed  a  solution  to  some  problems 
related  to  digital  control  of  thyristors  converters, 
as:  choice  of  the  synchronization  instants  with 
adaptatlve  sequences,  and  how  to  regulate  currents  with 
high  ripple.  We  have  proposed  a  precise  sampled-data 
modelling  and  given  formulas  to  optimize  the  dynamic 
behaviour.  The  modelling  can  be  generalized  to 
discontinuous  mode.  Implementation  validates  the 
theoretical  previsions. 


Flg.3:  Structural  diagram  (a),  and  equivalent  scheme  (b) 
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Fig. 6  :  Variation  of  K. 
and  K  when  the  operating 
pointpvarles.  (mean  value 
control). _ 


Fig.  7.  Dynamic  behaviours  of  the  current  regulation 
(mean  value) 


1551 


Modelling  and  Vottage  Feedback  Control  of  the  Injected  Current  Predictive  Modulator 
of  a  OC-AC  Parallel  Resonant  Converter 


J.  Fernando  Silva,  B.V.  Borges  arid  J.  Santana 


I.S.T.  / D  E  E  C.  UiqirinasElsctrfcas  e  Electronic*  de  Potincla 
Av.  Rovlsco  Pals  1095  Lisboa  Codex 
PORTUGAL 


ABSTRACT  —  No  adequate  model  is  known,  to  the  author,  for 
the  dynamic  behaviour  scidy  of  the  injected  current  predictive  mod¬ 
ulator,  for  a  resonant  DC  AC  convener.  An  approach,  based  into 
predictive  control  principles,  is  used  to  obtain  a  convenient  open 
loop  linear  model  of  the  DCAC  converter,  with  a  parallel  resonant 
high  frequency  link,  suited  to  high  power  levels.  Theoretical  and 
experimental  results  concerning  the  synthesis  and  the  performance 
of  the  voltage  feedback  regulator,  using  the  proposed  model,  are 
presented. 

1.  INTRODUCTION 

The  use  of  the  resonant  principle  in  DC  AC  power  conversion 
has  been  increased  in  the  last  few  years,  leading  to  new,  more  so¬ 
phisticated  and  efficient  topologies.  However,  in  most  of  them,  the 
control  process  for  the  converter  modulator  i*  difficult  to  establish, 
due  to  the  absence  of  a  convenient  open  loop  small  signal  model, 
suitable  for  the  synthesis  of  the  dosed  loop  regulators. 

One  example  is  the  injected  current  predictive  modulator,  em¬ 
ployed  to  control  the  output  current  of  a  cycloconvciter  driven  by  a 
current  fed  parallel  resonant  bridge  invertcr.'.Thc  predictive  modu¬ 
lator  controls  the  converter  output  current,  but,  in  most  environ¬ 
ments,  output  voltage  control  is  also  needed.  In  order  to  achieve 
this,  the  modulator  still  requires  additional  output  voltage 
regulators,  which  must  be  designed  with  the  aid  of  a  suitable  small 
signal  transfer  function.  The  definition  of  this  modulator  transfer 
function  presents  the  interesting  problem  of  building  a  predictive 
control  model,  adequate  for  thc_  output  voltage  regulator 
implementation,  using  simple  linear  feedback  theory.  The  feasibility 
of  this  control  method  has  already  been  demonstrated  with 
experimental  results  presented  by  one  of  the  authors  ( 1 }. 

2.  INJECTED  CURRENT  PR  ED  HIVE  MODULATOR: 

PRINCIPLES  OF  OPERATION 

The  operation  of  this  modulator  can  be  briefly  reviewed  as  fol¬ 
lows.  The  whole  DCAC  converter,  including  the  output  cyclocon¬ 
verter  is  a  line  commutated  circuit.  Hence  semiconductor  commuta¬ 
tion  b  performed  only  at  the  oscillating  voltage  zero  crossing  points 
of  the  current  fed  parallel  resonar.;  bridge  inverter,  in  order  to 
reduce  switching  losses.  Therefore,  the  output  voltage  waveform  b 
made  with  half  sinusoids  of  both  polarities  or  null  voltage  (fig.!). 
Thus,  during  each  half  cycle  of  high  frequency  input  voltage,  the 
cycloconvener  has  only  three  possible  output  voltages  or  modes  of 
operation:  (1)  full  positive  half  cycle  voltage ;  (2)  full  negative  half 
cycle  voltage;  (3)  null  voltage  (freewheeling).  In  the  beginning  of 
each  half  cycle,  one  of  this  three  operation  modes  is  selected  in 
order  to  inject,  in  the  load  and  output  filters,  a  certain  desired 
current. 

Tbb  b  accomplished  by  the  modulator,  whose  current  predic¬ 
tor,  at  the’end  of  each  output  voltage  half  period,  calculates  the  pre¬ 
dictable  variation,  for  the  next  half  cycle,  of  the  cycloconverter  out¬ 
put  current  for  all  the  three  operation  modes.  Current  prediction  is 
accomplished  by  calculating  the  three  output  current  first  time 
derivatives,  due  to  the  three  possible  output  voltages.  The  operation 
mode  for  the  next  half  cycle  is  defined  by  the  companson  of  the 
three  predicted  output  currenis  and  the  reference  or  desired  output 
current  value  ( lnt).  The  control  circuitry  selects  the  mode  that  will 
lead  to  the  nearest  current  value,  and  a  logic  circuitry  triggers  the 
•semiconductor  devices,  with  a  certain  discrete  trigger  angle  (o), 
suitable  to  establish  the  desired  circuit  topology.  This  process  is  re¬ 
peated  each  half  cycle  and  the  decision  taken  can  not  be  changed 
until  the  end  of  the  next  half  cycle. 
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Fig.l  I)  DC-AC  conxttter  wth  output  cycloconverler  and  parallel 
high  frequency  link.  U)  a)  Cyvlocomvrter  input  oscillating 
wltage  and  b)  output  moduhted  i  vltage. 

3.  MODELLING  THE  INJECTED  CURRENT 
PREDmVE  MODULATOR 

The  control  circuit  described  b  a  good  example  of  a  discrete 
system,  but  none  of  the  well  known  models  for  discrete  systems  b 
easy  to  use,  due  mainly  to  the  difficult  calculation  of  the  steady 
state  gain.  The  difficulty  arises  from  the  system  complexity 
(inductive  current  source,  parallel  resonant  LC,  load  and  output  fil- 
te»  ,  with  a  probable  fourth  order  transfer  function  with  a  non 
predictive  non  current  mode  modulator),  coupled  to  the  current 
mode  predictive  modulator,  and  to  the  large  fluctuations  of  the 
oscillating  voltage  amplitude  of  the  parallel  resonant  circuit  These 
large  variations  also  prevent  the  use  of  a  control  method  based  on 
the  principle  of  a  programmed  waveform  P\VM  modulator  12) 

However,  being  a  discrete  system,  with  the  best  possible  tran¬ 
sient  performance  that  can  ever  be  expected,  the  output  current  (io), 
will  follow  the  reference  current  (Iref)  in  the  next  (i+l)  half  cycle 
|3]«  Therefore,  it  can  be  written: 

0o)on#l  (1) 

where  0o)on»i is  ^  oulpm  current  due  to  trigger  angle  on,j 

From  (1),  (i'>)on*rI'efa°"(*A)0||*i-  Usin8  ,hc  Taylor 
series  expansion  c round  the  ttue  solution  on*i,  it  is  obtained  (4): 

j,  ^(o„.ro.)5*.  (2) 

Taking  only  the  first  two  terms  of  the  series,  approximating 
derivative*  by  finite  differences  and  with  some  algebraic  manipula¬ 
tion,  equation  (2)  gives: 

Irrf-<io)Oi,+  %raTm.  0) 

where  (,i0>on  represents  the  actual  output  current  due  to  trigger  an¬ 
gle  o  in  half  cycle  n,  and  Tm  is  this  half  cycle  time  value.  This 
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equation  justifies  why  the  predictive  modulator,  previously  bufil, 
lakes  samples  of  (i0)oa.  zad  of  the  input  and  output  voltages. 
These  samples  allow  the  calculation,  of  the  3  possible  first  time 
derivatives  of  the  output  current  and  corresponding  time  increments 
to  forecast  the  current  (i0)o0*  \ - 

Making  Ircf  “  gm  v- ,  where  vc  is  a  reference  voltage  and  gm  is 
a  practical  constant,  calculated  by  the  ratio  w.T/vt.^v  ,  il  is 
obtained: 

<io>Gi  ■&u'r - jjj'  °  To  .  (4) 


This  equation  shows  that  in  the  steady  state  the  whole  DC-AC 
converter  is  a  transconductance  amplifier  ( i0  -  gnVe ). This  coodu- 
sion  can  be.  supported  by  simulation  and  experimental,  results, 
shown  in'  figure  2. 


a) 


Fig.2  Steady  state  open  loop  transfer  function:;  a)  Computer 
simulation  b)  Experimental  result. 


The  small  signal  model  can  be  obtained  applying  Laplace 
transform  to  the  small  signal  equation  from  (3)  which  gives: 


IM 

v£(iy 


l+sTm 


(5) 


The  previous  equation  means  that  the  converter  can  be  seen  as  a 
transconductance  operational  amplifier,  with  a  dominant  pole 
whose  frequency  is  l/(2nTm)- 

4.  SYNTHESIZING  THE  VOLTAGE  FEEDBACK 
REGULATOR 


The  simple  model  obtained  allows  the  use  of  conventional 
linear  feedback  control  theory  to  implement  the  voltage  regulator 
The  block  diagram  of  the  circuit  uses  a  P.  I.  regulator  (fig  3),  which 
allows  zero  steady  state  error  and  fast  transient  response. 


Fig.3  Block  diagram  of  the  voltage  controlled  system. 


Experimental  results  shown  in  figures  4  and  5  confirm  the  cor¬ 
rect  choice  of  the  P-L  regulator  altogether  with  its  parameters, 
showing  that  the  now  proposed  converter  model ,  although  very  sim¬ 
ple,  is  also  \ery  accurate.  However,  a  discrete  model  shorld  be 
derived  from  this  simple  linear  continuous  model,  for  determining 
the  stability  of  the  converter.  This  subject  will  be  discussed  in  detail 
in  a  future  work. 
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FigJ  Experimental  results:  Reference  voltage  and  output  voltage 
a)  Square  uatr  input  b)  Rise  time  and  Overshoot. 


5.  CONCLUSION 

A  suitable  model  was  presented  for  the  injected  current 
predictive  modulator,  employed  to  control  the  output  current  of  a 
cycloconverter  driven  by  a  current  fed  parallel  resonant  bridge 
inverter.  The  whole  DC-AC  converter  is  theoretically  treated  as  a 
transconductance  amplifier,  and  experimental  results  confirm  the 
validity  of  this  model  and  its  usefulness  to  synthesize  voltage 
feedback  regulator  with  very  good  transient  response. 
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Cancelling  the  pole  due  to  the  load  with  the  regulator  zero,  and 
assuming  a  damping  factor  i>Vz/2  it  can  be  easily  shown,  that  the 
P.I.’  regulator  parameters  are: 


Tun“CoRo  J  Tpm**  2ctvgjnR©Tm  (6) 
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ABSTRACT —  A  linear  small  signal  model  for  the  association  of  a 
half-bridge  inverter  and  a  current  mode  pulse  width  modulator,  is 
introduced.  The  model  is  suitable  to  easily  synthesize  the  feedback 
loops  needed  to  obtain  a  four  quadrant  inverter,  with  GTO  thyristors 
switched  at  1  KHz.  which  then  performs  as  an  operational  amplifier. 
However,  this  continuous  model  is  not  convenient  for  the  stability 
prediction,  so  it  is  also  presented  a  discrete  model,  with  the  same 
steady  state  gair,  and  used  in  determining  the  converter  stability. 
Theoretical  and  experimental  results  concerning  both  operational 
amplifier  performance  and  stability,  are  presorted. 

1.  INTRODUCTION 

Voltage  inverters  (DC- AC  power  converters)  arc  the  mai ^elec¬ 
tronic  sub-system  of  the  now  widely  used  uninterruptible  power 
supply,  and  arc  also  the  fundamental  system  of  most  variable  speed 
drives  with  induction  motors.  Although  in  most  voltage  inverters 
the  control  process  is  well  established  giving  acceptable  capabili¬ 
ties,  recently  there  has  been  a  demand  for  better  reliability  and  per¬ 
formance.  This  fact  led  to  the  implementation  of  the  control 
method,  named  PWM  dual  current  mode  instantaneous  voltage 
feedback  control  [1 J.  Whilst  a  substantial  amount  of  work  has  been 
invested  in  studying  the  intrinsic  stability  of  the  modulator  princi¬ 
ple,  as  far  as  the  author  is  aware,  little  effort  has  been  done  both  on 
the  modelling  of  the  modulator  when  included  in  a  feedback  con¬ 
trolled  system  and  on  the  process  stability  study.  This  work  tries  to 
fill  this  gap,  so  adequate  models  for  dynamic  behaviour  design  and 
stability  analysis  of  the  current  mode  modulator  controlled  system 
arc  proposed. 

A  dominant  pole  continuous  linear  model,  applied  to  the  current 
mode  modulator,  is  refercd,  due  to  its  quasi -universal  applicability 
to  power  converters,  and  to  its  simplicity,  which  allows  a  correct 
feedback  control  design,  even  for  the  non  specialist.  However, 
stability  analysis  theoretical  results  obtained  with  the  continuous 
model  do  rot  match  experimental  ones,  so  the  model  is  not  suitable 
for  stability  studies.  Therefore,  a  discrete  model  is  also  presented, 
and  found  to  be  well  suited  for  stability  analysis.  However,  it  is 
more  cumbersome  for  feedback  regulator  design,  although  it 
reproduces  the  same  parameter  values. 


The  open  loop  dynamic  model  can  be  obtained  assuming  the 
statistical  mean  value  for  the  modulator  discrete  delay,  Ta 
(Ta-TpwM/2  where  Tp\w  is  the  period  of  the  PWM  carrier  fre¬ 
quency).  Hus  gives  io-gm  vc  rsTm,  as  the  dynamic  model.  Ap¬ 
proximating  the  exponential  by  the  first  two  terms  of  the  cx  Taylor 
series  expansion,  which  is  valid  for  sTai<0.6,  it  is  obtained  a  domi¬ 
nant  pole  continuous  linear  model  (industrial  model)  (2],  for  the 
association  inverter-modulator. 

io(s)--J§^vc(s).  (I) 


Fig  I  Experimental  open  loop  frequency  response.  Note  that  the 
frequency  of  the  theoretical  pole  is  closed  to  the 
frequency  of  the  experimental  one  (Tp\\\i  “  I  ms). 

These  assumptions  are  validated  experimentally  by  figures  1  and  2 


2.  CONTINUOUS  LINEAR  MODULATOR  MODEL 
'  AND  INSTANTANEOUS  VOLTAGE 
FEEDBACK  REGULATION. 

T/ie  dual  currents  mode  modulator  compares  a  signal 
proportional  to  the  inverter  output  filter  and  load  current,  with  two 
voltage  levels  ve+Va  and  vc-Va ,  where  vc  is  the  error  voltage  value 
at  the  modulator  input  and  Va  is  a  constant  voltage  tfiat  must  be 
larger  than  a  voltage  proportional  to  the  half  maximum  output 
current  ripple.  A  dedicated  logic  circuitry  associates  the  companson 
results  with  a  fixed  frequency  two  phase  clock,  deciding,  in  each 
period,  which  power  semiconductors  must  be  turned  on  or  off, 
synchronously  with  the  two  clock  phases,  and  which  must  be  turned 
off  or  on  upon  the  comparison  results,  thus  varying  the  duty-cycle. 
In  this  way,  this  fixed  frequency  modulator  allows  four  quadrant 
converter  operation  and  provides  output  current  auto-dipping 
capability,  (1). 

The  association,  pow-er  inverter-dual  current  mode  PWM 
modulator  with  instantaneous  output  current  value  sampling,  can  be 
approximately  described  in  the  steady  state,  continuous  operation, 
by  the  equation,  lo^gmVt,  where  !<>  is  the  output  current  mean 
value  per  period,  vc  is  the  control  voltage,  and  gn,  is  the  association 
related  constant  (transconductance).  Thus,  the  converter  is 
equivalent  to  a  voltage  (vf)  controlled  current  source  (or  sink)  with 
transconductance  gm  or  an  operational  iransconductance  amplifier 
(OTA). 


Fig.2  Open  loop  experimental  results.  Reference  wltage  vt  and 
output  current 

To  control  the  inverter  output  voltage  V0,  the  use  of  a 
proportional  integral  (P.  I.)  regulator,  handling  the  error  between 
the  voltage  reference  vr  and  the  output  voltage  instantaneous  value 
v0,  guarantees  zero  steady  state  error  and  sufficiently  fast  transient 
response.  The  whole  inverter  block  diagram  is  shown  on  figure  3 


Fig.3  Block  diagram  of  the  wltage  controlled  system. 
Cancelling  the  pole,  dependent  on  the  load  resistance  Ro  and  Co 
of  the  LC  output  filter,  with  the  zero  of  the  P.I.  regulator: 


T*m  "  C0Ro  » 

a  2nd  order  closed  loop  transfer  function  can  be  written 


(2) 

(3) 


where  the  damping  factor  must  have  a  value  close  to  to 
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obtain  low  overshoot  and  last  response.  Thus: 

Tpm  ”  2<XygmR0Tm (4) 


The  equations  (2)  and  (4),  together  with  the  proper  tuning  of  the 
LC  output  filter  for  40  dB  ripple  carrier  rejection  {3J,  allow  the 
design  of  a  power  operational  amplifier  with  the  voltage  gain  set  by 
I/dy  (20dB),  maximum  power  bandwidth  of  120  Hz  (DC  to  120 
Hz),  small  signal  bandwidth  of  160  Hz  (fig.  4)  and  good  static  and 
dynamic  performances,  which  can  be  seen  in  the  experimental 
results  presented  (fig.  5  and  6). 


Fig.4  Experimental  dosed  loop  transfer  function. 
7MJWW  a  MV  SM—  /*■«  — 


FigJ  Closed  loop  experimental  results:  Sinusoidal 
reference  ve  and  output  v0  voltages. 


The  usefulness  of  the  simple  continuous  model  for  feedback 
synthesis,  justifies  its  presentation  in  this  work  (for  divulgation  pur¬ 
poses).  as  it  can  be  a  first  order  model  for  most  power  converters. 


Fig  6  Closed  loop  experimental  results  a)  Triangular  reference  vc 
and  output  v0  b)  Pulse  reference  ve  and  output  v0  response. 
3.  MODULATOR  DISCRETE  IMPULSE  MODEL  AND 
STABILITY  ANALYSIS 


To  determine  a  linear  incremental  model  which  accurately 
accounts  for  the  discrete  action  of  the  dual  current  modulator 
principle,  let  us  consider  only  infinitesimal  perturbations  Avt,  that 
originate  also  infinitesimal  perturbations  in  the  duty-cycle  and 
hence  on  the  inverter  output  puked  voltage  Av,.  These  pertuibations 
have  the  form  of  infinitesimal  narrow  pulses,  so  they  can  be 
replaced  by  6  functions  (Dirac  impulses)  multiplied  by  the  pulse 
area.  The  incremental  output  voltage  of  the  power  inverter  can, 
thus,  be  modelized  by  an  ideal  sampler  (fig  7),  which  samples  Avt 
at  the  commutation  instant.  The  perturbations  on  the  output  current 
mean  value,  linked  to  these  pulses,  are  delayed  by  the  output  LC 


’filter  inductor.  Then,  the  idea!  sampler  must  be  followed  by  a 
continuous  lag  with  gain  go,  and  time  constant  TpwM-  sampling 
frequency  is  the  fixed  frequency  of  the  two  phase  dock  (  fpyvM  • 
1/TpwM  )>  and  it  will  be  assumed  constant.  This  is  a  valid- 
assumption,  if  there  is  only  interest  in  determining  the  conditions 
-leading  to  the  onset  of  instability,  which  is  the  normal  situation. 


Fig.7  Discrete  impulse  mode!  of  the  voltage  controlled  system. 

A  discrete  Z  transform  based  theoretical  study  shows  that  the 
discrete'opcn  loop  transfer  function  TF*(z)  is  (fig.7): 

TF(z)  -  G1G2G  jGj'fz)-  K  ,  (5) 

(z-l)(z-a)(z-fi) 

where  K*gmavTzm(l <t)fTpn£'x>,  fi*c(TPWM/CoRo)  and  cc«e(-I). 
Equating  the  expression  (5)  to  minus  one  (using  the  modified 
Routh-Hurwitz  criterion),  the  critical  gain  Kc  can  be  calculated,  and 
by  root  locus  analysis  technics,  the  instability  frequency  (Oc,  can  be 
determined:  the  root  locus  (fig  8a)  crosses  the  unitary  drde  in  the 
point  (-1,0),  then  (OeTpWM  ■  .Therefore: 


Fig  8  a)Theoretical  root  loci  and  b)  experimental  instability 


Kc-2(<x*l)  ; 


(0c“ 


jgPWM 

2 


(6) 


Instability  at  this  frequency  (half  the  sampling  frequency)  or 
Nyquist  instability  , verified  in  practice  (fig.  8b),  is  due  to  the  sys¬ 
tem  discrete  nature,  but  is  not  predicted  by  the  continuous  linear 
model  presented  in  section  2.  This  model  predicts  oscillations  at  a 
gain  dependent  frequency  always  bellow  half  of  the  sampling 
frequency. 

4.  CONCLUSION 

This  work  presents  two  models  for  the  dual  current  mode 
instantaneous  voltage  feedback  inverter.  For  the  design  of  the  P.J. 
regulator,  the  whole  DC-AC  converter  is  theoretically  best  treated 
as  a  transconductance  amplifier  with  a  dominant  pole.  Experimental 
results  confirm  the  validity  of  this  model  and  its  usefulness  to 
synthesize  voltage  feedback  regulator  with  very  good  transient 
response.  Experimental  results  also  show  good  performance  levels 
due  to  the  instantaneous  feedback  dual  current  mode  PVVM 
modulator  and  voltage  feedback  P.I.  regulator.  Nevertheless,  this 
model  predicts  wrong  critical  gains  and  oscillating  fre<fiency. 

For  studying  the  closed  loop  system  stability,  the  DC-AC 
converter  is  bat  treated  as  an  ideal  sampler  followed  by  a  lag.  This 
model  enable  the  correct  calculation  of  the  critical  gain  and  the 
oscillation  frequency,  which  is  verified  in  practice,  but  requires  a 
great  mathematical  effort  to  evaluate  the  P.I.  regulator  parameters. 
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Abstract  --This  paper  presents  detailed  and  global  models  for  the 
simulation  of  the  PWM  voltage  converter  connected  to  the  AC  Mains. 
A  personal  computer  program  ^introduced,  allowing  detailed  and 
global  simulation  with  system  or  with  Park’s  coordinates.  The 
numerical  integration  method  used  is  an  explicit  Euler-Backward 
method.  Theoretical  results  obtained  with  the  detailed  and  global 
models  and  experimental  results  are  presented  and  compared. 

L  INTRODUCTION 


prototype  (1  KVA).  It  is  shown  that  experimental  and  theoretical 
results  are  in  good  agreement. 


n.  SYSTEM  MODELING 
A.  System  coordinates 

The  first  detailed  model,  in  system  state  variables,  is  a  classical 
model.  The  PWM  voltage  converter  is  seen  as  a  system  with  three 
transfer  functions  relating  die  ei2,e2jandej|  line  voltages  with  the 
DC  voltage  e*.  The  corresponding  equations  are: 


Conventional  thyristor  AC/DC  converters  arc  limited  in  performance 
due  to  the  line-commutation  process.  The  advent  of  high  power,  high 
frequency  semiconductor  switches,  with  gate  controlled  fast  turn-off 
capability  (GTO’s),  allows  the  use  of  forced  commutation  and 
therefore  PWM  techniques  can  be  applied  to  these  converters  (2). 
PWM  converters  can  be  classified  in  two  general  types:  current 
converters  and  voltage  converters  [4|.  Voltage  converters  have  useful 
characteristics  {3  J  that  can  be  summarized  as: 

-  near  sinusoidal  current  waveforms 

-  AC  four  quadrant  operation 

-  DC  link  unidirectional  voltage 

-  bidirectional  power  transfer  capability  by  reversing  the 

flow  direction  of  the  DC  link  current. 


el2*(Sr82)e*  •  c23'(3r8!)'4  :  *3l*(gs-gl|«<fc  ()) 

The  transfer  functions  gj,  g2and  gj  are  non-linear  functions  that  can 
assume  the  values  0,1  according  to  the  state  of  each  converter  arm, 
which  are  dependent  of  the  PWM  control  circuit.  Thus,  they  are  lime 
functions  of  the  modulation  index  Ia,  and  of  the  power  angle  6. 
Equation  (1)  can  be  written  using  phase  voltages  The  transformed 
equation  is' 

erf»eac  ;  e2-f2  :  ey-fye*  (2) 

«de“fl  ij+f2  i2+f)  i)  (3) 

where: 


This  paper  presents  some  models  for  the  simulation  of  the  PWM 
voltage  converter  connected  to  the  AC  mains,  figl. 


.  f _  f^.2g,-(g,+gj  (4) 

The  introduction  of  the  three  inductor  equations  of  the  AC  side  and 
the  capacitor  equation  of  the  DC  side  gives  the  state  model  (5).  In  this 
model  only  the  two  independent  AC  currents  are  needed.- 
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Fig  1.  Voltage  converter  connected  to  the  AC  mains 

Several  applications  are  possible,  specially  in  AC/DC  conversion, 
where  the  DC  receiver  needs  a  supply  with 'DC  voltage 
characteristics.  With  the  new  high  power,  gate  turn-off  switches, 
applications  on  High  Voltage  Direct  Current  (HVDC)  transmission 
13  j  will  be  feasible. 

The  PWM  control  circuit,  described  in  14),  is  composed  of  a'nalog 
and  digital  circuits.  The  inputs  of  this  system  are;  1)  the  modulation 
index  Im.  and  2)  the  power  angle  5  measured  between  the  mains  and 
the  AC  fundamental  converter  voltages.  Two  detailed  and  two  global 
models  are  presented.  The  first  detailed  model,  in  system  state 
variables,  is  quite  classical.  A  second  detailed  model,  in  Park’s 
coordinates,  is  obtained  from  the  first  model  by  a  Parks  variables 
transformation.  Global  models  (in  state  and  in  Park’s  coordinates)  are 
obtained  neglecting  the  harmonic  content. 

The  paper  presents  experimental  results  obtamed  from  a  laboratory 


'  did' 

r  Ci 

Vd’ 

dt 

*E  05  'L 

- 

L 

& 

dt 

- 

r  Jq 

•"  T  -f 

+ 

de* 

fJ  h  o 

Cdc 

,  *10*1 

dt 

c  c 

c 

B.  Park’s  coordinates 

A  second  detailed  model,  in  Park’s  coordinates,  is  obtained  from  (5) 
by  a  Fork's  transformation  of  variables  (6): 


(6) 


Cj  and  f,  are  computed  from  f|,  f2  and  f3  in  the  same  way  as  V4  and 
V«  are  computed  from  Vi,  V2  and  Vj 


The  two  global  models  (in  state  and  in  Park’s  coordinates)  are 
obtained  from  the  corresponding  detailed  models  by  neglecting  the 
harmonic  content.  In  the  state  variable  model,  in  steady  state,  fl  and 
f2  are  sinusoidal  functions  of  time,  whose  amplitude  is  proportional 
to  the  Im  value  and  who.,  phase  is  a  function  of  the  power  angle  6 
In  the  Park’s  model  &and  f^are  functions  of  Iaand  5 
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III.  THE  SIMULATION  PROGRAM 

The  program  implemented  allows  global  and  detailed  simulation  in 
systems  or  in  Park’s  coordinates.  To  integrate  equations  5  and  6;  two 
explicit  Euler-Backward  formulas  (7  and  8)  are  used. 
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The  system  is  simulated  with  a  fixed  modulation  index  Ira  (e.g. 
without  the  reactive  power  controller).'The  eye  controller  is  a  PI 
designed  to  allow  low  overshoot  and  fast  response.  Equations 
corresponding  to  this  controller  are  integrated  by  the  Euler  Forwaid 
method. 


IV.  RESULTS 

The.  results  of  the  Park’s  model  and  the‘,systcm  state  model  are 
equivalent.  In  loth  cases,  the  global  models  are  five  times  faster  than 
detailed  models.  The  numeric  stability  is  very  good. 

The  DC  voltage  controller  parameters  were  selr  .ed  in  order  to  have  a 
natural  frequency  ton"10  rad's  and  a  damping  ratio  £*0.7,  m  closed 
loop.  For  DC  voltage  control,  experimental  and  theoretical  results 
obtained  with  both  models  are  shown  in  fig  2  and  3.  If  the  controller 
parameters  were  modified  in  such  a  way  that  the  natural  frequency  Wn 
is  increased,  a  malfunction  occurs  causing  DC  components  In  AC  line 
currents,  fig4.  The  global  model  also  confirms  this  malfunction’ 


(b)  Detailed  Simulation  (c)  Global  Simulation 
Figure  2.  Response  to  a  DC  voltage  reference  variation  (150-*  120V) 
1  •  AC  line  current  '2  •  DC  voltage  3  •  Power  angle 


(a)  Experimental  (b)  Detailed  Simulation 

Figure  3.  Steady  state  AC  vanables 
(converter  phase  voltage,  line  current) 
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(b)  Detailed  Simulation 
Figure  4.  Response  with  improper  controller  parameters, 
(line  current,  power  angle) 


V.  CONCLUSION 

Personal  computer  algorithms  for  the  simulation  of  the  voltage 
converter  connected  to  the  AC  mains  were  presented.  Two  detailed 
and  two  global  models  have  been  presented,  whose  equations  were 
integrated  by  an  explicit  backward  method.  The  results  of  simulation 
we:.;  compart  with  experimental  results  obtained  from  a  laboratory 
circuit  test. 

The  models  and  the  program  presented  are  well  adapted  to  the 
experimental  prototype,  being  a  valuable  tool  for  system  dynamics 
and  regulation  studies. 
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Abstract  -Uncharacteristic  harmonic  currents  are  generated  by  a 
six-pulse  power  converter  due  to  imperfections  in  supply  or  control 
system.  A  quantitative  analysis  of  the  effects  of  supply  system 
imperfections  (i.e.V  voltage  unbalance  and.  asymmetrical 
commutation  reactances)  upon  the  production  of  uncharacteristic 
harmonics  is  presented.  Two  unbalance  factors  (UBFV  and  UBFX) 
are  proposed  to  characterize  the  imperfections  "so,  that  the 
mathematical  relation  between  harmonic  intensity  and  the  degree  of 
unbalances  can  be  established  and  reviewed. 

Although  uncharacteristic  harmonic  currents  produced  by  .static 
converters  are  often  neglected  because  of  their  relatively 
insignificant  quantities  compared  with  that  of  characteristic 
harmonics,  the  impedances'a:  uncharacteristic  harmonic  frequencies 
may  be  quite  high  since  filters  are  seldom  provided  for 
uncharacteristic  harmonics.  Therefore,  significant  uncharacteristic 
voltage  harmonics  may  appear, at  the  point  of  common  coupling, 
especially  when  supply  system  possesses  some  degree  of  unbalance 
in  voltage  or  impedances.  Studies  dealing, witfy  uncharacteristic 
harmonics  of  six-pulse  conveners  apply  complicated  formulations 
to  calculate  the  converter  current  during  commutation  intervals.  This 
necessitates  intensive  computational  effort  for  harmonic  analysis 
(1*3),  Direct  simulation  of  convener  operation  (4),  although 
efficient  and  accurate  as  a  tool  for  transient  analysis,  is  not  efficient 
to  analyse  current ‘Spectra  in-steady  state.  On  the  other  hand, 
available  analytical  harmonic  models  (5)  do  not  take  into  account  the 
uncharacteristic  harmonics:  In  this  paper,  a  general  analytical  model 
with  simplified  commutation  current  for  the  determination  of  both 
characteristic  and  uncharacteristic  harmonic  currents  of  a  six-pulse 
convener  under  unbalanced  voltage  and  commutation  reactances,  is 
developed.  Formulation  for  unbalance  factors  of  voltage  and 
reactance  is  also  given.  The  formulation  permits  to  establish  the 
mathematic  relationship  between  the  unbalances  (of  voltage  and 
reactances)  and  resulting  uncharacteristic  harmonics.  Finally,  the 
effects  of  the  two  kinds  of  unbalances  arc  compared, 


In  Fig,  l,  the  trapezoidal  wave  indicates  the  current  waveform  and 
the  square  wave  represents  its  derivative.  Note  that  current  during 
commutation  is  approximated  by  a  straight  line.  The  commutation 
angles  p|  and  p2  differ  from  each  other  due  to  unbalanced 
commutation  reactances.  The  current  harmonics  can  be  calculated  as 
far  as  current  waveform  can  be  defined.  This  implies  to  define  Pj, 
p2>  a°d  #4-  The  calculation  of  pjand  p2  gives  no 

difficulties  if  the  effect  of  voltage  unbalance  on  p  is  negligible  (5J. 

p»  Cos-^cosa  *  (xi+x2)ty(W6)) .  ct-  (I) 

where  a  indicates  firing  angle,  V  phase  voltage  in  rms  value,  !d  dc 
current,  and  xj  and  x2  reactances  concerning  the  corresponding 
commutation  Whereas,  to  find  Oj,  02, 03  and  04  needs  careful 
considerations  for  the  effect  of  voltage  unbalance  upon  line-voltage 
zero  crossings.  Fig.  2  shows  an  efficient  method  of  deterrmng  the 
angles  by  which  the  zero  crossings  of  line  voltage  deviate  from  their 
normal  values  due  to  voltage  unbalance.  A  tnangle  formed  by  L'ab, 
^be  ►  Uw  and  an  equilateral  tnangle  with  U^  as  one  of  its  sides 
are  shown.  Taking  Uat>  as  reference,  5ab  and  8^  represent  the 
deviation  angles  for  U^  and  Uw,  respectively.  The  law  of  cosines 
gives" 


S^CosIlfU^U^U^U^F)^  (2 1), 

8^/3-Cos-J((Ubc2+Uca2-U^2)/(2UcaUbc))  (2.2) 

Hence,  0j  and  02  can  be  written  as: 
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Fig,  1  AC  current  waveshape  and  us  dcnvative 


where  t=a,  b  or  c  according  to  the  phase  specified  p  indicates  the 
phase  shift  between  three  phases.  (pa,  pb,  pc)  =  (-rc/3,  n! 3,  Jt), 
(8,a,8Ib.5lc)» 

(^ca»  5,t>-  0);  (82,,  82b,  82c)  »  (S,v0, 6C,). 
Because  of  the  half  penod  symmetry  of  current  waveform,  03  and 
04  are  expressed  as: 

05,-0, i  +  s;  O4i  =  02i  +  (i  (4) 

Note  that  Id.  pj.  p2.®i.  02.  and  04  uniquely  determine  the 
curecnt  waveform  Consequently,  the  application  of  Fourier 
analysis  to  the  waveform  gives  the  harmonic  components  of  line 
current.  However,  direct  Founer  analysis  on  the  trapezoidal  current 
wave  is  a  tedious  work.  A  wiser  method  is  to  find  the  Founer  scries 
of  di(0)/d0  and  then  integrate  the  obtained  series  Thus,  the  nlh 
harmonic  current  in  can  be  written  in  a  compact  formula: 

*n  "  -  an2(e-in®2  -  e**4)J  (5) 

where  anls(sin(n  \Lxl2)]/(  n  p,/2);  a^sinfn  [V/IM  n  p2/2);  n  » 
1, 3, 5,....  For  even  n,  1^=0.  It  can  be  shown  that  when  the  supply 
system  is  perfectly  balanced,  anl  and  an2  are  equal,  O2-0,  =  04-03 
=»  2n/3,  and  the  harmonic  currents  of  orders  3, 9, 15, 21,  are  null 
U  c ,  the  uncharacteristic  harmonics  do  not  exist)  Note  that  good 
agreement  between  the  current  spectra  calculated  by  Eq.  (5)  and  by 
digital  simulations  has  been  obtained  to  validate  the  model 

11L FORMULATION  OF  UNBALANCES 
Voltage  unbalance  and  reactance  unbalance  arc  formulated  m  order 
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to  relate  uncharacteristic  harmonics  with  unbalance  factors  UBFV 
and  UBFX. 

A,  Voltage  Unbalance  -  Unbalance  factor  of  three-phase  voltage 
UBFv  is  defined  as  the  ratio  of  negative  sequence  component  to 
positive  sequence  component.  Or. 

UBFV  =.  V'/V-'-'tyCA,  (6) 

The  magnitude  and  phase  6,  can  be  more  easily  calculated  from 
the  magnitudes  of  line  voltages  by  solving  the  equations  (7). 

(X2*- Y2-X-Y -X  Y+l  )/(X+ Y + 1)2  -  (tv  2)/d  0) 

(sf3)(X-Y)/(X+Y-2)  =  tan0v  <8> 

X  =  UWUtc  Y  =  OeAe  <9) 

sign(sin  0,  )  »  sign(X-Y)  (*0) 

With  Eq  (7)-(10),  Tv  and  0V  have  one-to-one  cotrespondance  with 
X  and  Y  If  is  taken  as  reference  (1  0  p  u ),  UJb  and  can  be 
'  solved  from  tY  and  8V . 

B  Commutation  Reactance  Unbalance  -  The  unbalance  factor  of 
reactances  is  defined  similar  to  UBFV.  That  is,  UBFX  »-txei&x, 
where  Tx  and  0X  are  determined  by  Eqs.  <7)-(10)  by  substituing  the 
three-phase  reactances  X|,  x2  and  X3  for  Uab,  Utc  and  UM.  and  vx 
and  0X  for  tv  and  0V.  It  is  not  difficult  to  show  that  (xlt  x2.  x3)  and 
(g x,0x,x4Ve)  have  one-to-one  cotrespondance  where  xin  “  (X|  +  x2 
+  x3)/3.  That  is,  from  tx.  0X  and  xlve.  the  commutation  angles 
p2  of  line  current  can  be  found. 

C  Relation  between  Harmonics  and  Unbalance.  FaeiCIS  The 
mathematical  development  of  UBFV  and  UBFX  allows  to  calculae  In 
from  xN,  0V,  \  and  8X,  provided  that  xavt  ts  given  and  U^  is  taken 
as  reference.  In  fact,  in  Eq.  (5),  it  can  be  seen  that  UBFV  affects 
mainly  0j.  02,  O3  and  04*  and  UBFX  affects  anj  and  aftj.  The 
seperauon  of  the  effects  of  UBFV  and  ULFX  is  useful  in  the 
following  assessment.  Moreover,  as  the  uncharacteristic  harmonic 
currents  are  unbalanced,  it  is  convenient  to  define  a  harmonic 
intensity  indicator  Cn  that  reflects  the  effect  of  three-phase  harmonic 
currents.  <Jft  »$  defined  as: 

C„  - 100  •  €da„2+  w+  imm  Ia,2+  ibl2-  Id2)i  (11) 

n  =  3. 9, 15.21... 

It  is  clear  that  the  relation  between  ort  and  unbalance  factors  is 
described  implicitly  by  Eqs,  (l)-(U), 

IV.  EFFECTS  OF  UNBALANCES  ON<tq 

Both  unbalance  factors  are  composed  of  their  magnitudes  (tv,  Tx) 
and  phases  (0V,  0X),  Although  not  included  in  this  paper,  it  can  be 
shown  that  cn  is  a  Strong  function  of  tv  and  tx  but  a  weak  function 
of  0X  and  0X,  Furthermore, most  industrial  standards  set  Iimites  for 
rv  and  tx  but  not  for  0Y  and  0X.  Therefore,  it  is  reasonable  to 
concentrate  our  study  on  the  influences  of  xv  and  rx.  Fig.  3  shows 
the  variation  of  03  versus  tv,  with  tx  as  the  parameter.  Note  that 
for  n>3,  o„  is  always  bounded  by  03  The  values  given  in  Fig.  3 
are  computed  under  the  following  conditions:  V»1.0  pu;  l<pl,0  pu; 
a*30®;  xavc=0. 1  pu;  0V*6O°;  and  0X=3O°.  Note  that  tx*40%, 
0X*3O°  and  xavc*d).l  pu  correspond  toX|=0.1 1  pu,  x2=0.1  pu  and 


X3=  0  06  pu,  which  is  a  unbalance  unrealistically  large  Fig.  3 
clearly  illustrates  that,  when  X  is  very  small,  O3  is  dominated  by  xx 
As  xv  increases  to  the  value  where  the  ratio  ix/tv  is  less  than  1 0,  the 
effect  of  Xx  is  replaced  by  xv  It  shoud  be  noted  that  the  curve  of 
Xj~0  is  above  other  curves  when  Tv  is  large.  This  is  indeed  the 
attenuation  effect  of  commutation  reactances. 

V.  CONCLUSIONS 

An  analytical  model  for  the  deteimination  of  both  characteristic  and 
uncharacteristic  current  of  a  six-pulse  converter  under  voltage  and 
reactance  unbalance  has  been  developed.  The  model  resolves  the 
causes  of  the  production  of  uncharacteristic  harmonics  into 
functional  (the  deviation  of  0i>02»  63  and  04  due  to  voltage 
unbalance)  and  structural  (the  asymmetry  of  and  ^t2due  t0 
reactance  unbalance)  perturbations.  An  application  of  the  model,  the 
relation  between  two  unbalance  factors  and  a  harmonic  indicator  cn, 
i$  presented,  It  is  concluded  that  the  model  is  useful  for  clear 
explanation  of  the  genesis  of  uncharactenstic  harmonics  and  is 
applicable  to  the  harmonic  problems  concerning  Six-pulse 
converters. 
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Abstract  The  ala  of  this  paper  is  to  show  that  a 
stabilizing  signal  proportional  to  the  transmission 
line  power  flow  variations  added  In  the  SVC  voltage 
regulator,  increases  the  dynamic  stability  of  power 
systems. 

Two  kinds. of  signals  are  studied  here;  the  design 
of  the  first  one  uses  an  analogical  method  and  the 
second  signal  is  achieved  by  using  an  optimal  adaptive 
control.  This  last  gives  good  results  for  small 
disturbances.  In  case  of  high  magnitude  perturbations, 
the  analogical  method  can  be  more  efficient. 


InU.vducllaa 

The  Integration  of  a  far  away  power  plant  to  an 
existing  network  creates  always  stability  problems.  To 
solve  It,  sophisticated  generator  control  systems 
(excitation  control  with  auxiliary  signals  and  fast 
valving  in  the  generator)  arc  often  used.  However,  when 
long  lines  are  considered,  theso  control  technics  are 
not  sufficiently  effective  to  avoid  instability.  So 
reactive  power  compensation  is  rcarted  to. 

Voltage  keeping  can  be  carried  out  by  reactive 
power  control  and  it  Is  well  known  that  the  use  of 
static  VAR  compensators  involves  high  reactive  power 
control  performances.  However,  maintaining  a  constant 
voltage  is  not  sufficient  in  many  cases  to  obtain  a 
good  damping  of  the  power  system  oscillations;  a 
stabilizing  signal  is  then  required. 

In  this  paper,  wo  propose  to  introduce  an 
auxiliary  signal  in  the  SVC  voltage  regulator, 
proportional  to  the  transmission  line  power  flow 
variation.  This  auxiliary  signal  could  be  concolvcd  by 
tho  analogical  method  (1)  or  by  using  a  modern  method 
as  the  adaptive  control  (31.  These  two  approaches  will 
bo  compared  in  different  conditions. 


I  Bis  stabilizing  signal 

Tho  studied  SVC  consists  of  fixed  capacitors  and 
thyristor  controlled  reactors  and  has  been  chosen  for 
the  rapidity  of  its  responso  and  its  possibility  to 
lead  and  lag  reactive  power.  Within  the  linear  control 
range,  it  is  assimilated  to  a  variable  susceptance  B. 

The  stabilizing  signal  is  proportional  to  tho 
transmission  line  power  flow  variation  at  the  point  of 
connexion  of  the  SVC.  A  filtering  Is  necessary  at  the 
output  of  the  measurement  block. 

Among  the  various  methods,  the  self  turning 
control  method  is  the  most  feasible  In  terms  of 
implementation  and  it  can  be  used  for  no-minimum  phase 
systems.  The  process  is  identified  by  use  of  an 
algorithm  based  on  a  least  square  recursive  method  with 
a  missing  factor.  The  optimal  control  minimizes  a 
quadratic  index  performance  (21(4).  The  block  diagram 
is  given  on  figure  1 


fig  1:  Block  diagram  of  the  adaptive  control. 


I!  Studied  cower  system 

It  Is  simply  considered  a  generator  feeding  a 
medium  power  system  through  a  transformer  and  a  long 
transmission  line.  Tho  power  system  is  assimilated  to 
an  infinite  bus  bar  with  a  short  circuit  reactor.  Two 
loads  are  alsc  plugged  in.  Tho  generator  comprises  a 
synchronous  machine  with  its  excitation  and  governor 
control.  These  are  chosen  without  any  sophisticated 
part  which  can  mask  the  compensator  effect.  The  SVC  Is 
connected  at  the  middle  of  the  line. 


fig  2:  The  studied  power  system. 

HI  Test  results 

Simulations  have  been  done  with  a  muIUmachinc 
transient  stability  program  developped  at  Electricity 
de  Franco. 
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In  case  of  small  .disturbances  resulting  from 
app  Hying  a  negative  step  of  5V.  on  the  <  reference'  of  the 
generator  excitation  control,  the  power  system  presents 
an  instability  When  the  SVC  is  added,  the  oscillations 
are  slightly  damped  but  it  is  hot  sufficient  to 
stabilize  the  generator. 

When  the  .  analogical  signal  is- Incorporated-  into 
the  SVC,  the  oscl nations  are  significantly  damped.  The 
curve  b  on  figure  3  has  been  obtained  for  a  gain  of- the 
additional  loop  equal  to  S,  but  an  increase  makes  the 
signal  less  efficient. 

The  adaptative  signal  gives  better  results  as  it 
can  be  seen  on  curve  a.  The  oscillations  are  damped 
rapidly  (after  4.5  s).  Moreover,  since  the  adaptative 
signal  does  not  entail  an  immediate  response,  which 
analogical  signal  does,  there  -Is  no  overvoltage 
observed  at  the  beginning. 
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fig  5.  Rotor lc  angle  (a)  and  voltage  (b)  after  a 
thrce-phaso  short  circuit,  the  SVC  is  equipped  with  an 
adaptive  signal. 


fig  3;  Rotorlc  angle,  the  SVC  Is  equipped  with  adaptive 
(a)  or  analogical  (b)  auxiliary  signal. 

Two  simulations  of  high  magnitude  disturbances, 
have  been  experimented:  a  reporting  load  and  a 
thrce-phaso  short  circuit  at  tho  output  of  the  group. 

The  reporting  load  consists  of  tho  opening  of  a 
portion  of  tho  lino  between  the  group  and  tho  point  of 
connexion  of  the  SVC,  In  this  case,  the  results  arc 
similar  to  those  obtained  for  low  disturbances.  The 
oscillations  are  better  damped  and  the  system  is  more 
rapidly  stabilized  with  the  adaptive  signal. 


fig  4,  Rotoric  angle  In  case  of  reporting  load,  the  SVC 
Is  equipped  with  adaptive  (a)  or  analogical  <b)  signal. 

In  the  case  of  a  three-phase  short  circuit 
suppressed  after  100ms,  the  results  obtained  with  the 
adaptive  signal,  are  not  satisfactory.  A  slight  damping 
of  the  oscillations  of  the  rotoric  angle  can  be 
observed.  Control  reaches  quickly  limit  values.  This  is 
clear  on  figure  5b  which  gives  the  voltage  at  the 
output  of  the  generator.  The  error  obtained  at  the 
identification  level  grows  more  and  more. 


For  the  same  disturbance,  tho  system  is  stabilized 
with  an  analogical  signal  and  the  best  results  are  get 
for  a  gain  equal  to  8. 


CaasMlaa 

It  has  been  shown  In  this  paper  that  tho  SVC  with 
an  auxiliary  signal  proportional  to  variation  of  power 
flow  Improves  dynamic  stability.  Adaptive  control  gives 
much  better  results  than  ah  analogical  signal  for  low 
magnitude  disturbances.  Identification  still  poses  a 
problem  In  hard  conditions  and  the  adopted  algorithm 
needs  improving.  Another  solution  would  be  to  fix  tho 
parameters  during  the  fault  and  to  use  optimal  control 
after  a  small  delay. 


Appendix 

n-n.r^tor:  S^-1070  MVA,  XfX-Z.VIt*.  x;-.4p, 

T‘  »8.9s;  T*o*.05s;  H»2.Ss 

SVC:  X  *15  kVAR/kV;  rating  ±300  MVAR;  TaR»20ms; 

T*  50ms;  T»5os. 
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Abstract  -This  paper  presents  a  simulation  study  for  the  summation  cf 
random  harmonics  produced  by  a  number  of  independent  harmonic 
generators.  The1  harmonic  generators'  considered  are  six-pulse 
converters  with  random  firing  angles.  At  the  first  stage,  a  probabilistic 
converter  model  is  developed  to  derive  the  distribution  functions  of 
the  magnitude  and  phase  of  harmonic  currents  for  a  single  converter. 
This  is  followed  by  considering  the  combination  effect  of  harmonics 
of  N  converters.  The  statistical  behavior  of  the  vectorial  sum  of  N 
harmonics'  of  the  same  order  is  analysed  and  justified  by  the  results 
of  Monte  Carlo  simulation.  The. potential  applications  of  the 
simulation  results  are  also  discussed. 


I.  INTRODUCTION 

Much  attention  has  been  given  to  the  problem  of  random  harmonic 
summation  because  of  the  increasing  presence  and  random  features  of 
power  electronic  installations  in  power  systems.  In  stationary  state, 
the  stochastic  change  in  the  loads  of  static  converters  leads  to 
stochastic  variation  in  the  current  spectrum.  Most  publications  dealing 
with  random  harmonics  assume  that  the  magnitude  and  phase  of 
harmonics  are  statistically  independent,  so  a  phasor  can  be  resolved 
into  two  independent  components  on  x-axis  and  y-axis,  and  the 
resultant  harmonic  can  consequently  be  calculated  on  each  axis  [1-4], 
Hence,  complicated  vectorial  summation  of  random  variables  becomes 
simple  arithmetic  summation  for  each  random  component  The  central 
limit  theorem  is  often  cued  to  explain  the  tendency  towards  normal 
distribution  for  each  projection.  However,  neuherthe  assumption  of 
independence  between  the  magnitude  and  phase  of  harmonics  was 
assessed,  nor  the  mechanism  of  harmonic  generation  of  realistic 
nonlinear  loads  was  considered.  In  this  paper,  a  type  of  realistic 
industrial  harmonic  generator  (i.e.,  six-pulse  converter)  is  modeled 
and  simulated  by  Monte  Carlo  methods  to  evaluate  the  validity  of  the 
foregoing  assumption. 


H.  CONVERTER  MODELS 

Six-pulse  power  converter  has  been  widely  used  in  the  industry. 
Although  sophisticated  models  of  this  type  of  converter,  that  take  into 
account  the  effect  of  dc  current  ripple  [5]  and  commutation  [6],  are 
available,  the  simplest  model  existing  in  most  standard  textbooks 
(e.g.,  [7])  is  employed  m  this  study.  Negligence  of  commutation  and 
dc  cuiTcnt  npplc  may  seem  restrictive  and  unrealistic.  It  has,  however, 
been  studied  by  simulations  that  the  negligence  docs  not  really 
influence  the  'statistical  behavior  of  resultant  harmonics  and, 
moreover,  makes  possible  the  analytical  formulation  of  their 
probability  density  function  (pdf)-  Assuming  that  the  commutation 
and  ripple  effects  are  neglected  and  the  dc  load  of  the  converter  is 
resistive,  the  magnitude  and  the  phase  of  the  hth  harmonic  current  can 
be  written  respectively  as  (h«l.  5,7, 11, 13,...): 

Xh<a)  -  Khcosa;  %  -  -ha  (1) 

where  Kh-18(V2)V/(hRx2),  R  is  the  resistance  on  dc  side,  a  the 
firing  angle  and  V  the  phase  voltage  in  ims  value.  If  the  firing  angle 
a  vanes  randomly  and  uniformly  between  ax  and  a2,  then  the  pdf 
of  Ijj  and  can  be  obtained  by  a  transformation  of  random  variables 
[81: 

fb(Ib)  -  - 1  ,  KuCOSCC^I^KhCOSCtx  (2) 

<ara;i)VK2-i£ 


**  Elcctricitd  de  France 
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(<{>*)  ,  -ha2«ph<-hai,  Aa=a2-ai  (3) 

hAcr 


Eq.  (2)  and  (3)  indicate  that  the  harmonic  magnitude  and  phase  are 
statistically  independent  although  a  is  the  only, varying  parameter  that 
correlates  lh  with  <ph.  If  Ij,  is  expressed  in  complex  form,  the 
physical  interpretation  of  Ih  is  obvious:  4  is  indeed  the  sum  of  two 
random  currents: 

Ih  «  Khcoso[cos(ha)-jsin(hct)] "  (Kfo/2)[  1  /-(h+lkt  +l/;(hd)fl1  (4) 

Each  of  them  has  constant  magnitude  (Kh/2)  and  random  phase 
•(h±l)a , 

IT!  RESULTANT  HARMONICS  OF  N  CONVERTERS 

A  computer  program  based  on  Monte  Carlo  methods  [10]  has  been 
written  to  simulate  the  resultant  harmonic  current  of  N  converters 
(N*l,  2,..., 10).  Tlie  converters  are  assumed  to  have  the  same  power 
rating  and  the  same  variation  range  of  a  (10°<a<70°)  V  and  R  "re 
both  set  to  be  equal  to  1.0  as  base  values.  All  resultant  harmonic 
currents  are  normalized  by  a  factor  of  Nlh(mu). 50  ^e  va^ucs  in 
the  interval  [0,1],  The  normalized  resultant  hth  harmonic  current  is 
noted  as  4*  Note  that  4(m«)  indicates  the  maximum  hth  harmonic 
current  generated  by  one  converter. 

Fig,  I  illustrates  the  pdfs  of  the  fundamental  current  for  N*  1, 2, 
and  10.  Fig.  2  shows  the  pdfs  of  the  7th  harmonic  currents  for  the 
same  N.  It  is  obvious  that  the  pdfs  in  Fig,  1  and  Fig.  2  for  N»1  arc 
identical  because  of  the  normalization.  In  fact,  for  N**l,  normalized 
pdfs  arc  all  identical  for  all  h.  The  analytical  formula  of  4(4)  given 
in  Eq.  (2)  agrees  with  the  simulation  results.  Note  that  the  pdf  curves 
arc  not  smooth  due  to  inherent  characteristics  of  Monte  Carlo 
simulations. 

When  N  increases  to  about  10,  the  pdf  of  fundamental  resultant 
current  approaches  a  normal  distribution.  This  result  is  not  surprising 
since  the  possible  variation  range  of  (pl  (°Aa)  is  confined  within  (0°, 
90®)  and  the  simulation  reduces  again  the  range  to  (10°,  70°)  Vectorial 
summation  of  such  vectors  with  little  freedom  of  phase  variation 
approaches  evidently  their  arithmetical  summation  that  follows  a 
normal  distribution  according  to  the  central  limit  theorem. 

On  the  other  hand,  the  pdf  of  the  7lh  harmonic  resultant  becomes  a 
Rayleigh  distribution  as  N  increases  [91.  Although  not  illustrated  in 
the  paper,  it  can  be  shown  that  the  higher  the  harmonic  order  is  (from 
h£5),  the  more  rapidly  the  pdf  of  the  resultant  harmonic  current 
approaches  a  Rayleigh  distribution.  This  phenomenon  can  be 
explained  by  the  fact  that  gh  becomes  more  and  more  uniform 

in  the  interval  l-x,  xj  when  the  product  hAa  increases.  As  long  as  N 
is  large  enough  to  apply  the  central  limit  theorem  to  the  sum  of 
resolved  components,  the  resultant  approaches  a  Rayleigh  distribution 
[21. 

It  should  nevertheless  be  noted  here  that  gh  <<ph)  is  not  exactly 
uniform  in  the  interval  [-x,  xl  although  it  is  uniform  in  [-hc^,  -ha(]. 
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As  N  increases  to  a  certain  level, , the  resultant  Harmonic  deviates  from 
a  Rayleigh  distribution  to  a  normal  distribution.  In  our  simulations, 
N=10  is  not  large  enough  to  show  the  effect  of  inexact  uniform  phase 
but  is  sufficiently  large  for  the  application  of  the  central  limit  theorem. 
The  problem  of  imperfect  uniform1  phase  distribution  is  to  be 
discussed  in  a  future  papov 


The  prediction  of  harmonic  level  is  important  for  many  applications . 
such  as  sizing  of  harmonic  filters  and  step-down  transformers, 
reactive  power  compensation,  harmonic  penetration  analysis,  ,,  and 
so  on.  It  is,  however,  more  desirable  to  know  the  statistical 
distribution  of  harmonic  level  than  the  viduc  given  by  some  other 
summation  critera  (i  eM  arithmetic  sum,'  root  sum  square,’,  etc.).  For 
example,  one  may  wish  to  know  the  value  for  which  D%  of  the  time 
the  resultant  harmonic  current  is  smaller  than  it.  D%  is  often  called  the 
non-exceeding  probability  and  is  conventionally  chosen  to  be  99%, 
97.5%  or  95%  depending  on  the  requirements  of  the  application. 

Fig,  3  illustrates  the  variation  of  normalized  resultant  harmonic  current 
Jh  at  a  non-exceeding  probability  equal  to  99%  versus  the  number  of 
conveners  N.  The  physical  interpretation  of  Jh  is  indeed  the  harmonic 
diversity  factor.  It  can  be  seen  that  decreases  with  N.  When  N»10, 
the  magnitude  of  the  resultant  hannonic  current  of  order  5, 7,  and  U 
has  99%  of  probability  to  be  equal  or  less  than  about  60%  of  their 
arithmetic  sum. 


The  physical  interpretation  of  random  harmonic  currents  absorbed  by 
a  six-pulse  convener  is  given.  The  discrepancy  between  ideal  random 
phasors  and  random  harmonics  generated  by  a  realistic  nonlinear  load 
can  consequently  be  assessed.  In  general,  Rayleigh  distribution  is  a 
good  approximation  of  resultant  harmonic  distnbution.  The  quality  of 
Rayleigh  distribution  approximation  becomes  belter  when  hAo 
increases.  Good  agreement  is  obtained  between  the  results  of  Monte 
Carlo  simulation  and  analytical  formulation.  Tv-  simulation  gives 
also  information  on  hannonic  diversity  factor  at  a  designated 
non-exceeding  probability,  which  is  useful  for  many  applications 
concerning  harmonic  penetration  problems. 


Eig.  l  pdf's  of  fundamental  currents  for 
N-l,  2,  and  10 


2  pdfs  of  the  7th  harmonic  currents 
for  N-  1,  2,  and  10 


m 

12] 

13] 

H) 

15] 

16] 

(7] 

18] 

19] 

[10] 


W.  G.  Sherman,  "Summation  of  harmonics  with  random 
phase  angles",  Proc.  1EE.  Vol.  119,  No.  11,  pp.  1643-1648, 
Nov.  1972 

N.  B.  Rowe,  "Thesummation  of  randomly  varying  phasors  or 

vectors  with  particular  reference  to  harmonic  levels",  1EE 

Conf.  Publ,.  No.  110,  pp.  177-181, 1974 

A.  Kloss,  “Statistical  analysis  of  harmonic  problems  in  power 

electronic  installations"  (in  Gennan),  Bull.  Assoc.  Suis 

Electr,.  vol.  66,  pp,  427-433,  April  1975 

Y  Baghzouz  &  0,  T.  Tan,  "Probabilistic  modeling  of  power 

system  harmonics",  IEEE  Trans,  on  Indust.  Aopl..  Vol. 

IA-23.No.  !,pp.  173-180,  Jan/Feb  1987 

L.  G.  Dobinson,  "Closer  accord  on  harmonics".  Electronics  & 

Power.pp,  567-572,  May  1975 

A.  D,  Graham  &  E.  T.  Schonholzer,  "Line  harmonics  of 
converters  with  dc  motor  loads",  IEEE  Trans,  on  Indust. 
AppL.  Vol.  IA-19,  No.  1,  pp.  84-93,  Jan/Fcb  1983 
b.  K.  Bose,  :Pmr  Electronics  and  AC  Drives". 
Prentice-Hall,  1986 

A.  Papoulis,  ^Probability  Random  Variables _and_Siochasiic 

Processes".  McGraw  Hill,  1965 

L.  Pienrat, "Combination  of  vectors  with  random  phase 

differences  -  Quasi-invariance  of  the  resultant  amplitude 

distnbution"  (in  French),  to  be  presented  at  La  23S  ioumfe  de 

statistique.  Strasbourg.  France.  May  1991 

R.  W.  Hamming.  ."Numerical  methods  for  scientists  and 


engineers:,  pp.  583-393,  McGraw-Hill,  1962 


fig  — 3  Variation  of  Jh  versus  N 

(non-exceeding  probability  -99%) 


1563 


IMPLEMENTING  AN  ELECTRONIC 
TOPOLOGICAL  PICTUREBOOK 

VAROL  AKMAN,  AHMET  ARSLAN  AVM.  RANDOLPH  FRANKLIN 

Bilkent  University  Rensselaer  Polytechnic  Institute 

Bilkent,  06533  Ankara,  Turkey  Troy,  New  York  121SO-3590,  USA 


Abstract 

Ab  electronic  topological  picturebook  Is  envisaged  as  a  comput¬ 
erised  version  of  George  K.'  Frauds’  A  Topolo$tca! Picturtbook, 
Springer- Verlag  ( 1987).  Francis’  book  is  full  of  complicated  topo¬ 
logical  figures,  mostly  drawn  manually.  Our  ultimate  goal  is 
to  automate  the  production  of  such  illustrations  and  to  obtain 
publication-quality  hardcopy  using  assorted  techniques  of  graph¬ 
ics-  The  present  paper  can  be  regarded  as  a  modest  first  attempt 
in  that  direction. 

1  INTRODUCTION 

Several  methods  exist  for  generating  computer  models  of  real  or  imag¬ 
inary  objects.  The  most  popular  approach  is  to  use  polygons  as  low 
level  primitives  which  define  more  complex  objects.  Obviously,  it  is 
difficult  and  time  consuming  for  a  designer  to  define  an  object  by  such 
simple,  low  level  primitives.  Therefore,  a  higler  level  primitive  such 
as  a  B-sphne  (or  a  B6tier)  surface  is  preferable. 

In  this  paper,  the  suggested  techniques  for  Representing  3-D  shapes 
are  ail  based  on  some  kind  of  sweeping.  The  implemented  program  T> 
.(which  stands  for  ‘  Topology  >ook’)  is  a  rudimentary  graphical  work¬ 
bench  to  help  topologists  illustrate  their  ideas  more  effectively.  Central 
to  our  implementation  is  a  paradigm  of  solid  modelling,  viz.  shape  =i 
stceep  +  control  (13),  T>  is  written  m  the  C  programming  language 
and  runs  on  a  colour  Sun  workstation.  Our  other  papers,  covering 
or  treating  related  issues,  include  (2, 3, 4, 5).  An  early  >  ct  elegant  work 
which  was  inspirational  for  us  is  Baumgart’s  geometric  editor  (7). 

This  paper  owes  its  existence  to  A  Topological  Picturebook  of  George 
K.  Francis— a  book  that  was  written  to  encourage  mathematicians  to 
illustrate  their  work  and  to  help  artists  to  understand  the  abstract 
ideas  expressed  by  such  drawings.  Here  we  are  running  the  risk  of 
oversimplifying  Francis’  work  considerably  for  we  cannot  jet  match 
the  quality  and  the  complexity  of  the  figures  in  that  book,  via,  the 
two  figures  reproduced  in  (10]~  these  are  known  as  the  ‘tetrahedral 
eight  knot'  and  have  to  do  with  Thurston’s  acclaimed  work  ^cf.  (12) 
for  a  popular  account)  in  3-manifolds. 

A  topological  picturebook  may  sound  somewhat  futile  vis-k-vis  the 
fact  that  sketching  and  visual  presentation  of  topological  construe- 
lions/ proofs  axe  slowly  losing  their  stronghold  they  once  held  It  may 
appear  that  the  science  of  the  ‘deformation  of  shapes’  is  becoming 
sterile  in  terms  of  figures.  However,  we  believe  that  clrere  is  definite 
place  for  a  topological  picturebook  of  the  sort  to  be  described  here. 
The  following  excerpt  succinctly  explains  our  view  (10), 

The  pedagogy  that  underlies  the  entire  book,  and  which 
I  bring  out  specially  here,  comes  from  Bernard  Morin  of 
Strasbourg,  Pictures  without  formulas  mislead,  formu¬ 
las  without  picture*  confuse.  I  don’t  know  if  Bernard 
would  say  it  this  way,  but  St  is  how  I  have  understood 
his  work... Morin’s  vivid,  pictorial  description  of  his  bold 
constructions  has  inspired  their  realisation  in  many  a  draw, 
ing,  model,  computer  graphic  and  film.  But  he  insists  that 
ultimately,  pictorial  descriptions  should  also  be  clothed  in 
the  analytical  garb  of  traditional  mathematics.” 

2  THE  SWEEP  PARADIGM 

A  class  of  solids  called  nonprofiled  sweep  objects  is  defined  by  two 
parametric  curves,  a  2-D  contour  and  a  3-D  trajectory.  The  contour 
is  moved  along  the  trajectory  to  generate  a  parametric  suriace.  A 
classification  of  sweep  objects  is  given  by  Broasvoort  et  al.  (8). 


•  Translational  sweep:  The  contour  is  arbitrary  and  the  trajectory 
is  a  straight  line. . 

•  Rotational  sweep.  -The  contour  is  arbitrary  and  the  trajectory  is 
a  circle. 

•  .Circle. sweep.  The  contour  is  circular  and  the  trajectory  is  arbi¬ 
tral)'. 

•  General  sweep:  Both  the  contour  and  the  trajectory  are  arbi¬ 
trary. 

A  non  profiled  generalised  cylinder  is  defined  by  an  arbitrary  closed 
2-D  contour  and  an  arbitrary  3-D  trajectory  (8, 15}.'<Here,  the  2-D  con¬ 
tour  c  can  be  defined  in  the  parametric  form  as  c(v)  =  (ct(v),cv(r)), 
where  t,  <  c  <  ty  and  c(v,)  =  c(ty).  As  a  parameter,  v  varies 
from  v,  to  ty  The  parametric  functions  cr  and  Cy’ trace  out  the  con¬ 
tour.  The  3-D  trajectory  t  can  be  defined  in  the  parametric  form  as 
!(u)  c  (tj(u),f>(u),l,(u)),  where  u,  <  u  <  uj  As  a  parameter,  u 
varies  from  u,  to  u/.  The  parametric  functions  ty  and  tM  trace  out 
the  trajectory.  For  a  nonprofiled  generalised  Cylinder,  the  contour  c 
moves  along  the  trajectory  t. 

The  spatial  relation  between  the  contour  and  the  trajectory  must 
be  defined  at  every  point  of  the  trajectory.  The  contour  plane  is  per¬ 
pendicular  to  Ci,  the  unit  vector  tangent  to  the  trajectory.  As  Ca,  a 
fixed  unit  normal  to  the  plane  of  the  trajectory  is  chosen.  Note  that 
<2  is  the  vector  product  of  e3  and  ej  Consequently,  it  is  possible  to 
present  a  nonprofiled  generalised  cylinder  as  a  5  D  vector  function  Tnj> 
of  two  parameters  u,  v  as 

Tn,{u,v) »  *,(«),  e*(v) « <2(«),cv(v)  •  c3(u)), 

where  u,  £  u  <  tij  and  c,  <  v  <  ty.  Here,  there  is  a  problem  if  the 
curvature  of  the  trajectory  is  zero.  In  this  case,  some  approximations 
are  necessary.  For  example,  the  direction  of  the  tangent  vectors  for  the 
first  and  the  last  points  of  the  trajectory  can  bo  selected  as  directions 
of  the  first  and  the  last  segments.  For  other  points,  the  direction  of 
the  vectors  as  specified  by  the  previous  and  the  next  points  can  be 
selected. 

If  some  deformations  are  necessary  on  a  nonprofiled  generalised 
cylinder  by  scaling,  then  a  profiled  generalised  cylinder  can  be  defined 
as  a  mathematical  model  (8, 15).  The  contour  can  be  scaled  indepen¬ 
dently  in  x  and  y  directions,  and  is  determined  by  e *  and  e3.  Foi 
each  point  of  the  trajectory,  two  scale  factors  Sr  and  5,  are  sp.-cified 
The  scale  functions  are  expressed  in  terms  of  the  parameter  of  the 
trajectory;  S(u)  =  (5,(u),  $„(«)),  where  vg  <  t*  <  «/,  If  the  scale 
factor  is  substituted  in  defined  m  the  previous  section,  then  it  will 
have  an  effect  only  on  the  contour  functions,  and  the  resultant  profiled 
generalised  cylinder  will  be  defined  as  a  5-D  vector  function  Tf. 

r,(ti,»)  =  (t,(u),r>(tt),  t,(«), 5,(u)  cA«)  <j(«),S,(u)  c„(t>)  e3(ttl), 

where  u,“  <J  u  <  u/  and  tv  $  t>  <  Vf. 

3  NORMAL  FORMS  FOR  SURFACES 

Classification  of  2-mamfolds  is  a  completely  solved  problem  of  topol¬ 
ogy.  Here  we  omit  classical  introductions  to  the  subject  (cf.  Ringel 
(16)  for  that)  and  just  summarise  the  results 

The  plane  representation  of  a  polyhedron  is  all  the  information 
provided  by  the  polygons  in  the  plane  representation,  the  pairing  of 
the  sides  and  the  orientation  of  the  sides  (before  identification). 
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Given  a  polyhedron  Paadltt  symbcSc  wjcwtusk#  tdtM  « 
one  defines  four  elmecicrg  opt  returns.  The  property  id  tio*  opera 
tioM  ss ,tkat  although  they  <Soch2rge  P  a*  **3  as  S,  the  surface  itedS 
is  aZi  dstsgtd.  The  dementary  cpmtxszs  are  as  foGcrars: 

•  SU1  (Snbdmrion  of  di=*z*5c«  coe):  An  edge  of  the  polyhedron 
is  divided  bio  two  zxr*  edges  by  laibg  an  saner  poet  efthe 
edge  as  an  additional  vertex. 

•  <#  l  (Compoctkmcd  dimension  coe):  Tils  is  the  reverse  oper¬ 
ation  of  SUL 

•  SC2  (Subdivision  of  dimension  t«©>  Two  tcti»  of  a  poJygon 
ia  the  polyhedron  w3  be  connected  by  ar*v  edge  dividing  tie 
polygon  Is  to  two  Be*  polygon*. 

•  002  (Composition  o' dimension  tvs):  This  Is  tie  reverse  oper¬ 
ation  of  SU2. 

Two  pdyhedra  P  and  P*  are  said  to  be  clementcrifj  related  If  P 
caa  be  t  rarsformed  Into  P*  by  usiag  a  finite  number  of  tie  elementary 
operations  SU1,  C01,  SC2  asd  002-  A  polyhedron  Is  called  orientcMc 
if  ose  caa  choose  aa  orientation  for  each  polygon  such  that  each  edge 
Is  used  ia  both  possible  directions*  It  tares  oat,  by  the  fundamental 
theorem  of  the  classification  of  2-manifolds,  that  each  polyhedron  is 
denies  tardy  related  to  ose  of  the  following  Dorsal  form; 

Oh)  Cfl- 

(II,)  °»Mi  -  h  -  ~ br~ 

(Ct)  eiCiCjcj-.-CjC, 


Fig.  1  A  k-tori  and  a  knot  drawn  by  our  program 

4  REALISM 

There  are  assorted  techniques  that  can  be  used  to  gj\e  a  realistic  view 
of  an  object  as  a  2-D  image.  The  following  are  some  important  one* 
taken  from  Francis  [10]: 

1.  ‘For  complicated  objects  it  Is  often  impossible  to  find  a  view 
which  does  not  hide  some  important  structure  behind  a  surface 
sheet.  One  remedy  is  to  remove  a  regular  patch  from  the  object, 
creating  a  transparent  1 rindou  through  which  this  structure  can 
be  seen  in  the  picture/ 

2.  'To  distinguish  an  edge  whose  other  face  is  hidden  from  an  edge 
which  merely  separates  two  visible  faces,  drafting  teachers  rec¬ 
ommend  heavier  lines  for  the  former  relative  to  the  latter.' 

3.  ‘The  shading  technique  I  use  makes  no  pretense  of  accuracy  and 
realism.  It  merely  encodes  positional  information  and  helps  dis¬ 
tinguish  rounded  contours  from  sharp  borders.  It  is  based  on  a 
few  optical  principles.' 

4.  ‘Line  patterns  based  on  boxes  in  affine  projection  suffer  from  the 
Keeker  cube  illusion:  ‘Whirl  Is  front  and  which  is  back?’  Thick¬ 
ening  the  facing  borders  helps  decide  which  view  is  intended/ 


Fig.  2  A  lores  drwn  byocr  programs 
5  CONCLUSION 

Tit  cUr*Oxn rf  lb-  7>  .rti  tlx  at:  a  SUKgg3&e«£  tcrex= 
fit,  1  tea  m  be  rodent!  bj  idMtbg  si!,  £te  court*  yizzi  to 
produce  tie  cotou  nd  »  rate.  Oti- fodisou!  sis  a  »  Sit 
dtuds  of  tie  process  Ooa  lie  rstr.  U  tiler  ««d*,  tie  aer*2 
dogs  object*  2s  iS  be  or  sb. b  doerg  fcefrbraic-ftrrsSi^  eaorp 
bg  oet  i4i  tevrt  o?er«i».  US*  b  b  tie  precise  *^rh  sdPotrodV 
sietddsg  system  [!*J- 
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Insoducrion  : 

T be  modelling  of  the  variable  speed  drives  is  so  complex  that  it 
males  their  analysis  difficult.  This  complexity  is  all  the  greater 
as  the  machine  is  ah  alternative  one.  In  the  case  of  synchronous 
machines,  or  of  turbogenerators,  the  model  is  non  linear  and 
highly  on&xed. 

Most  studies  use  linear  models  and  neglect  the  fast  transients, 
which  do  not  modify  the  working  of  the  machine  as  a  whole. 
These  simplifications  are  based  on  the  knowledge  of  the  physical 
phenomenon  and  not  on  an  accurate  model  analysis.  Actually,  it 
can  be  shown  that  these  approaches  are  agregation  ones.  So  they 
present  the  same  drawbacks:  the  informations  on  the  ne ;lected 
modes  cannot  be  taken  into  account. 

AnoVer  difficulty  is  to  evaluate  the  sensitivity  of  the  errors  on 
the  measured  parameters  toward  the  simplified  model.  This  is 
quite  impossible  to  achieve  when  the  simplified  model  is  not 
obtained  by  a  rigouroas  mathematical  technique. 

We  propose  a  multitime  scale  approach  to  separate  the  model  into 
reduced  subsystems,  which  allows  to  consider  the  fast 
behaviours,  ie  transient  and  suboansient  operations,  in  the  case  of 
a  synchronous  machine. 

i-SimplificaUQn  methods: 

The  equations  of  a  synchronous  machine  using  Park’s 
transformation  can  be  represented  by  the  field  winding  and  the  nd 
and  nq  damper  windings  in  a  set  of  d*q  axis  bound  to  the  rotor 
(fig.  la).  The  model  is  then  given  by  two  d*q  equivalent  circuits, 
(fig  lb).  The  stator  (la,  ra)  and  the  field  excitation  (If,  rj)  are 
separated  by  the  damper  windings. 

The  most  commonly  used  simplification  neglects  the  stata* 
resistance  (ra  =  0),  i.e.  supposes  that  the  stator  magnetic  flux  Yd 
and  Vq  reach  instantaneously  the  steady  state. 

A  structural  simplification  consists  in  using  only  one  damper 
winding  for  the  d-axis  and  two  for  the  q-axis  (nd  n  1, 
nq«  2):  in  this  case,  the  model  is  2x2  with  the  time  constants  : 

CTd,  T“q)  and  (Td,  Tq)  :  respectively  the  subtransient  and 
transient  short-circuit  rime  constants, 

(T"do,  T’qo)  and  CTdo,  Tqo) :  respectively  the  subtransient  and 
transient  open  circuit  time  constants. 

A  plotting  of  the  frequency  character^  responses  of  1/Xd  Os)  and 
1/Xq  Os)  can  be  computed  by : 


1  i  (I  +jsTd0Ml+jsTd0) 
x<jOs)“V  (I  +  jsTdMl+js'Td) 


q-aris 


Fglh.  Equivalents  circuits. 

I  i  (1 +jsTq0H! +jsT"q0) 

SJF)S\  (l+jsTq>(I+jsT-q) 

using  model  (1].  This  plotting  exhibits  two  distinct  behaviours : 
transient  behaviour  and  established  behaviour,  (fig.2). 
Nevertheless  the  equations  of  the  model  are  coj  t-ected  by  d-axis 
damper  winding  flux.  An  additional  hypothesis  (n>i  *0),  121 
valid  for  the  solid-iron  rotor,  allows  to  reduce  the  model  again. 
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T*>  1.2*99  r<WSK6 
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r<flj0l7i  T-<-07WS  T-M006I 
rqlV2.l«6  r«/V3JII7  T~4>-CXK*6 


■odd  2*4  *  >» 

XM1* 

XWU»5  X*HM7»  r<KJ'«  X”«M  29J1 
rMaw  t-momi 

r<»7>4  r«MU956  T"<JO*Oj56?3  r"»%00023 


For  a  greater  number  of  damper  windings  (nd  b  %  nq  *  3  : 
model  2x3)  some  authors  (3)  have  measured  new  time 
constants  T"d  and  T”q  smaller  than  T*d  and  T"q.  We  have  a 
larger  number  of  observed  dynamics  with  different  time- 
constants.  The  computational  locus  curves  are  similar  to  the 
experimental  curves  of  (3J .  The  new  behaviours  are  called  sub¬ 
subtransient  with  the  time  constants  (T"d,  T"q)  and  (T^d,  T"q) 
and  so  on. 

We  have  validated  the  existence  of  these  different  dynamics  for  a 
model  2x4,  (fig.  2). 


For  a  salient  pole  rnachine.the  asynchronous-torque  and  the 
reactive -current  characteristics  obtained  from  the  sum  (1/Xd(j$) 
+  1/Xq(js)  )  are  plotted  in  fig  3  for  the  three  equivalent 
circuits  (  2x2,  2x3  and  2x4  ).  The  asynchronous  starting 
emphasies  the  importance  of  the  part  played  by  the  number  of 
damper  windings  considered  in  the  model.  The  large  number  of 
them  shows  that  the  operation  of  a  synchronous  machine  can  be 
compared  to  an  induction  machine  with  a  squirrel-cage  winding. 

All  these  simplifications  are  in  fact  singular  perturbation 
hypothesis. 

The  mechanical  and  electrical  parts  are  decoupled  using  the  stator 
synchronous  speed  w*. 

The  choice  of  a  small  parameter  multiplying  the  derivatives 
of  the  fast  electrical  parts  is :  e  =  1  /to*,  (4). 

The  electrical  equations  are : 


iy--A(a).y+v 
A  (o)  =  r.L'1  +  oij* 
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where:  co  is  the  rotor  speed 

and  v  arc  respectively  the  flax  and  the  vduge  vectors  given  by 

Y  =  (Yf.  YD1 .  VD2 . VDod.  ¥Q1.  YQ2 . VQoq.  Vd.  Y<j)T  ' 

y-=(Vf,0,0, ... _ AO,.—  0,0, - ,Vd.  Vq)T 

and  the  nsitiix  distances: 

r  =  (Vf,  IDl.  H)2. ...  n>nd.  XQ1.  *Q2.  •».  fQflq.  *a.  ra)-Iod  ♦  oq  +  3  • 
where: 

Wnq+3  «  a  (nd  +  nq + 3)  x(nd  +  nq + 3)  identity  mairix. 

A^>),L  and  J*  arc  (nd +' nq  +  3) x(nd  +  nq+ 3) matrices-. 


Lf:(nd  +  nq+l)x(nd  +  nq*l)  self  rotor  matrix , 

M :  (nd  +  nq  + 1)  x  2  mutual  self  inductance. 

For  the  case  nd  »  nq  =  1,  the  eigenvalues  of  A  ((d)  are  a  $et  of 
three  group?.  (5)  * 

,  (  .  i  r.(  1  .  M 

A,“2^‘dTL-qJ 

%  is  the  armature  time  constant 

I  _L_  i 

^  "  T  Dj  ”  "  T“d 
'  1  1 
14”‘Tq>  "T-q 

and 

,  1  '' 

s"'v  '5 

with:  L’fctJ^.Lf  and  Tf=*L‘f/rf  ^  Td. 

The  flux  advance  with  three  time  scales : 

(¥<j,  3q>  :  subs  transient  flux  with  Tj  »  (Ujt, 

transient  flux  with  and 

Yf :  established  flux  with  tj  «  L 


For  the  case  nd  ■  nq  «  2,  four  time  scales  must  be  used  And  thc 
damper  windings  flux  (Yoi.  ^Ql)  and  C^D2.  ^Q2)  arc  transient 
quantities  depending  on  the  reduced  times  ‘  t2  ®  t  /Tdi  and 
t3  *1/Td2« 

The  former  time  constants  T"*d  and  T" q  are  defined  by : 
TD2«=rr,(j,  TQ2»T"q  and  TD20s,'T"dO  .  TQ20*T"q0. 

For  the  cases  nd  and  nq  >  1,  the  different  operations  can  be 
summarized  on  the  following  scheme : 


The  stator  fluxes  arc  the  fast  modes  and  so  are  the  subtransient 
variables.  The  expression  of  the  quantities  L“d  and  L*q  depends 
on  the  nd  and  nq  number  of  damper  windings  .The  rotor 
operation  is  slower  and  describes  the  transient  dynamics . 


Conclusion : 

The  singular  perturbation  reduction  of  the  model  sets  the  different 
behaviours  of  the  synchronous  machine  in  the  right  order.  The 
sub-subtransient  modes  arc  in  fact  transient  modes.  The  damper 
windings  contribute  to'these  dynamics.  They  arc  to  be  taken  into 
account  in  the  case  of  a  synchronous  machine  supplied  for 
example  by  a  static  convener  where  the  observed  modes  arc  very 
fast. 
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Abstract  -'Using  recently  developed  software  package  for  unco¬ 
nventional  digital  filter  analysis  we  study  undesired  modes  of  ope¬ 
ration  in  second-order  digital  filters  realised  in  direct  form.  Apart 
from  many  kinds  of  parasitic  overflow  oscillations  we  confirmed 
by  computer  experiments  the  existence  of  se\eral  intersting  ty¬ 
pes  of  behavior.  These  include  chaotic  oscillations  and  fractal 
patterns  of  trajectories  in  the  filter  with  modular  arithmetic  and 
devil’s  staircase  changes  in  periods  of  oscillations  in  the  filter 
with  saturation  arithmetic.  Some  new  bifurcation  phenomena  like 
’’fan-type”  bifurcation  sequences  have  also  been  found. 


InttQductipn 

Abnormal  modes  of  operation  in  signal  pre  ssing  circuits, 
both  analog  and  digital,  for  a  long  time  attracted  attention  of 
scicntioi.  In  digital  systems  the  problems  of  quantisation  effects 
and  overflow  oscillations  have  been  analysed  by  many  researchers 
(see  eg  (1),  (6),  (8])  Chua  and  Lin  [2]  reported  on  aperiodic  (cha¬ 
otic)  oscillations  from  second  order  digital  filter  employing  2’s 
complement  (modulo)  arithmetic  and  operating  at  the  border  of 
the  stability  domain.  Successively  in  the  paper  (4)  we  studied  bi¬ 
furcation  phenomena  in  a  digital  filter  with  saturation-type  adder 
overflow  characteristic*  For  the  sake  of  in-depth  study  of  these 
phenomena  wc  developed  a  set  of  specific  computer  programs 
implementing  newly  developed  algorithms  (5)  for  simulation  of 
second  and  third  order  filters  with  possibilities  of  choosing  the 
way  of  internal  coding  of  signals  and  type  of  arithmetic  used. 
The  following  types  of  analysis  have  been  implemented  so  far : 

1.  Time  evolution  of  system’s  responses, 

2.  State-space  plots  including  possibilities  of  observation  of 
fractal  trajectory  structure, 

3.  Investigations  of  limit  cycles : 

•  computation  of  periods  of  orbits, 

t  computation  of  winding  numbers  of  orbits 

•  computation  of  basins  of  attraction  of  periodic  orbits, 

4.  Computation  of  onc-paramctcr  bifurcation  diagrams, 

5.  Construction  of  winding  number  diagrams  ("devil’s  stair¬ 
case”) 

Below  we  present  chosen  results  of  our  simulation  studies  in 
second  order  digital  filters  whose  dynamics  are  described  by  a 
state  equation  of  the  form  (1-2)  In  particular  our  interest  is  con¬ 
centrated  on  nonlinear  effects  caused  by  the  adder  overflow  non¬ 
linearity. 


xi(k+l)  =  z2(fc)  (1) 

**(*+ 1)  =  F(fe, (*)  +  «>(*))  (2) 

Where  :  a,b  G  R, 

FR  -*  R ,  F(a)  =  a  for  |o|  <  1,  F(o)  =  1  for  >  1,  F(o)  =  -1 
for  a  <  -1  (saturation  arithmetic)  or  F(a)  =  a  for  |<r|  <  1, 
F(a)  =  c  —  2  for  a  >  1,  F(a)  =  a  +  2  for  a  <  —I  (modular  or 
2’s  complement  arithmetic). 

Fractal  structure  of  trajectories  in 
■the.finer.with  modular  arithmetic 

Among  very  interesting  dynamic  behaviours  encountered  in 
the  filter  with  modular  arithmetic  the  most  interesting  is  the  cha¬ 
otic  motion  observed  for  b  =  -1  and  various  a  parameters  (2j.  In 
Figure  1  w'e  present  a  trajectory  observed  in  our  simulation  expe¬ 
riments  for  a  =  0  5  and  initial  conditions  =  -.6135, x*  =  -x*, 
30000  iterations.  It  has  self-similar  structure  -  repeating,  dimini¬ 
shing  empty  sets  of  chpsc-shapc  arc  clearly  visible. 


Fig.  1:  Chaotic  trajectory  of  a  second  order  digital  filter  with  modular 
overflow  characterise  for  o  =  05,6=  -1.  Note  the  fractal  self-similar 
structure  of  the  limit  set. 

Complex  bifurcation  sequences  in 

Using  our  simulation  package  wc  were  able  to  discover  several 
new  interesting  phenomena  in  the  case  of  a  filter  with  saturation- 
type  adder  overflow  characteristic.  Fust  of  all  we  discovered  very 
complex  bifurcation  sequences  for  the  filter  operating  for  some 
parameter  choices  outside  the  linear  stability  sector  (4).  One  of 
such  bifurcation  sequences  is  shown  in  Fig.2. 
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Fig  2.  Typical  bifurcation  sequence  observed  in  ibe  digital  filter  with 
saturation  arithmetic  for  the  parameter  choice  outside  the  stability 
sector. 


An  extremely  interesting  result  of  our  study  of  nonlinear  dy¬ 
namics  of  digital  filters  is  the  fractal  structure  of  basins  of  attrac¬ 
tion  of  some  periodic  orbits.  For  some  parameter  choices  the  set 
of  initial  conditions  for  orbits  appoaching  even  simple  periodic 
orbits  (as  we  found  eg.  for  a  period  four  orbit)  has  fractal  struc¬ 
ture  -  its  area  is  finite  but  the  length  of  its  boundary  tends  to 
infinity. 

Conclusions 

Our  simulation  study  revealed  extremely  interesting  pheno¬ 
mena  associated  with  inherent  nonlinearities  within  the  digital 
filter  -  namely  its  adder  overflow  characteristic.  The  following  we 
found  most  interesting  and  requiring  further  study  : 

•  There  exist  extremely  complex  (chaotic)  trajectories  both 
in  the  case  of  modular  and  saturation  arithmetic, 


It  is  worthwhile  pointing  out  the  existence  of  extremely  com¬ 
plex  orbits  (probably  nonperiodic)  and  unusual  bifurcation  se¬ 
quences  which  we  called  "fan-type”  shown  in  the  magnified  pic¬ 
ture  in  Fig  3. 


Fig.  3.  New  "fan-type"  bifurcation  sequence  observed  in  the  digital 
filter  with  saturation  arithmetic. 


•  Fractal,  self-similar  structures  of  limit  sets  of  trajectories 
are  often  encountered, 

•  Existence  of  very  complex  periodic  orbit  structure  with  de¬ 
vil’s  staircase  sequence  of  changes  of  winding  numbers  has 
been  confirmed, 

•  New  bifurcation  structures  eg.  "fan-type"  have  been  found, 

•  Several  cases  of  fractal  boundaries  of  basins  of  attraction 
of  periodic  orbits  were  found  in  computer  experiments. 
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For  some  parameter  ranges  we  discovered  an  abundance  of 
periodic  orbits  of  all  periods  depending  on  the  parameter  value 
chosen  •  the  filter  reproduces  so-called  devil’s  staircase  structure 
of  changes  of  orbit  winding  numbers  when  changing  the  bifurca¬ 
tion  parameter  •  sec  eg.  Fig.  4. 
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Fig.  1:  Complete  devil’s  staircase  structure  of  winding  number  changes 
observed  in  the  digital  filter  with  saturation  arithmetic. 
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AN  EFFICIENT  IMPLEMENTATION  OF- VARIABLE  -INTERCHANGE  METHOD  FOR  THE 
SOLUTION  OF  AN  ELECTROCHEMICAL  MACHINING  PROBLEM 
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,  r 

„<  d  r*  i>  r 


in  D  (1) 


with  boundary  conditions 


Abstract-; A-  two  dimensional  electrochemical 
machining  problem  which. consist  of  a  circular 
anode  placed  inside  a  circular  cathode  is 
solved  using  a  variable-interchange  method. 
The  notching  of  an  initially  circular  anode 
due  to  circular  cathode  is  studied  by  solving 
the  resulting  problem  numerically.  ; 

1.  INTRODUCTION 


U  =-v  <  0  (2. a) 

on  the  cathode  r  =  a  -and  .. 

=  0 

at  the  anode  s(r,fc)=o. 

The  second  condition  which  governs  the  chan¬ 
ge  with  respect  to  time  of  the  anode  surface 
is: 

orv  5(r,tV-0  .  (2>c) 


There  are  many  real-world  problems  in  which 
the  governing  differential  equation  is-  known 
but  thesolution  domain  is  not  completely 
specified.  The  term  free-boundary  problem  is 
commonly  used  when  the  boundary  is  stationary 
and  steady-state  problem  exists.  Moving  bound- 
ries  are  associated  with  time-dependent  prob¬ 
lems  and  the  position  of  the  boundary  ha3  to 
be  determined  a3  a  function  of  time  and 
space.  There  are, however,  some  Important 
problems  in  which  the  boundary  is  moving 
but  the  equation  is  elliptic, i.e.  they  are 
degenerate  problems.  Examples  are  provided 
by  problems  in  electrochemical  machining  and 
the  Hale-Shaw.  flow  associated  with  the  injec¬ 
tion  of  fluid  into  a  narrow  channel. 

Therefore,  the  following  discussion  .is  than 
given  in  the  context  of  degenerate  problems. 
Electrocv  -ical  machining  is  a  technological 
process  in  which  a  workpiece  is  placed  a3  the 
anode  in  an  electrolytic  cell  with  a  properly 
shaped  cathodic  tool  so  that  a  desired  shape 
of  oathbdidc  work  piece  is  obtained  by  the 
electrochemical  process. 

Hougaardfg]  gave  references  to  pioneer  papers 
and  determined  the  stationary  anode  profile 
using  complex  .^variable  methods.  Chritiansen 
and  Rasmussenf  23  developed  a  method  in  which 
the  potential  problem  was  formulated  as  an 
integral  equation  of  the  first  kind. Later, 
Hansen  and  HolmC5)used  an  integral  equation 
of  the  second  kind  for  similar  problem.Mcyer 
(7] developed  a  method  based  on  the  method  of 
lines. Crowley  (30  used  enthalpy  type  formula¬ 
tion  and  Elliott  C4)  proposed  an  elliptic 
variational  inequality  formulation  for  this 
problem. 

In  this  note,  single  variable-interchange 
method  used  for  the  annular  electrochemical 
machining  problem  by  the  aid  of  Boadway'sCO 
transformation  and  this  implementation  is  more 
economical  computationally  on  this  specific 
problem  in  comparison  to  its  predecessors, 

2.  JORMULATiON 


Thus,  we  have  a  one-phase  problem  with  an 
elliptic  type  equation  and  Stefan  type  boun¬ 
dary  condition  with- non-zero  latent  heat. 


.3.  SOLUTION  PROCEDURE 

For  geometries  of  the  .problems  which  involve 
circular  boundaries,  it  is  more  convenient  to 
work  in  polar  coordinates.  Therefore,  we 
require  the  solution  of  the  equations  1)  in  D 
bounded  byr^a,  on  which  u»-v  and  by  moving 
interface  r=s(0,t)  on  which  a^O  forO^0^W2 
The  additional  conditions  are: 


Uo*0  at  0*0  and  tt/2,  (3) 

The  single  variable-interchange(see,3oadway 
CO)  of  Q(r,e)  to  r(u,0)  renders  the  equation 
(1)  to  the  following  form: 


+  r( 


in 


'  Sr 
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Au  ' 36  AuAO 


jr 

An 


AV  .  Ar  Ar  aV 

is1'"  A  0  A  6  A  u1 


J  (4) 


However, themoving  interface  condition  on  the 
anode  must  be  transformed  into  an  expression 
for  the  speed  of  tho  anode  along  each  ray. 
Therefore,  if  r  be  the  non-dimensional  posi¬ 
tion  vector  for  the  anode  surf ace, then, 

hr  _  ^  S  ^  h  S  \  ^  U 

It  is  convenient  to  repace  ©in  terras  of 
^u/ar.Since,  the  tangential  derivative  vani¬ 
shes  on  the  anode,  we  can  write 


An  approxaate  quasi-steady  model  for  the  pro¬ 
cess  consist  of  a  boundary  value  problem  for 
the  potential  between  the  electrodes  and  an 
equation  relating  tho  change  of  the  anode  sue 
face  to  the  normal  gradient  of  potential  at 
the  anode. Hence,  the  mathematical  formulation 
is 


^u^e=-(^/^8](^u/6r) 


So  that  the  gradient  condition  on  the  surface 
lead  to 
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Discretization  of  equations  :For-- numerical  sor 
lution,  the  transformed  potential  equation 
(4)  may  be  discretized  and  rearranged  to 
yield , 

fi(r'0V(rJ,’)N  B(r,-)f;j>,+C(nj)=o  (6) 


<7.ij 7 f ij-* Q:r, 
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and  rjj  in  the  nth  iteration  atep  value  of  r;j. 
The  moving  boundary-  condition  given  by  equa¬ 
tion^)  can  also  ba  discretized  erpilicitly 
to  give 

k  Vl  Vmj, 

where  ijjj  is  the  kth  time  step  value  of  rNj. 
The  moving  boundary  equation! 7) , however  can 
easily  be  solved  by  an  explicit  method.  But, 
the  equation!  6) ,  if  the  values  of  q*  are  all 
known  in  nth  iteration  step, for  example, may 
be  solved  by  regula-falsi  method  for  next  ite¬ 
ration  stop* 

Algorithm;  To  complete  the  numerical  procedu¬ 
re,  the  test  problem  can  easily  be  solved  by 
the  algorithm  stated  below: 

a)  solve  the  clasical  mixed  boundary  value 
problem  given  by  equation! 6)  and  relavent 
boundary  conditions  as  the  anode  treated  as  a 
fixed  surface, 

b) use  equation! 7)  to  determine  a  new  anode 
burface  which  becomes  the  fixed  boundary  in 
tho  next  time  step, 


c)fgo  to. step,! a); 

.4*  NUMERICAL  EXAMPLE 

To  show  the  applicability  of  the  method, let 
^consider  the  notching  of  an  initially 
circular  anode  due  to  circular  cathode. In 
this  problem, it  is  assumed' .that'  the  anode 
surface  isrinsulated  everywhere,  except  for 
9fc(  3n/8,5n/8)  and  0£(lln/8,13n/8),  so  that 
it^can  wear  away  only  in  these  two  segments. 
It  is? also  assumed  thatr no. undercutting  occur 
in  the  notch. The- actual .initial  electrode 
surface  are>rr*10:  for  cathode  and' s! 0,0) *9.5 
for,  the  anode.Y/e^have  selected  SueO. 025, 
ft/ 40  and£fc«=0.04.  Figure  1.  shows  a  typical 
result  when  the*anode  isploted  after  every 
ten  time  steps. Because* of  symmetry,  only  the 
first  quadrant  is; given  and  in  order  to 
magnify  the  erosion  of  the  anode,  the  radius 
ris  scaled  logarithmic aly  according  to  (pa 
1  ni^AD  where  f^is  the  scaled  value  of  ,r  • 
The  actual  depth  of  the  final  notch  for  s(rt/2 
,2) is  found. to  be  7.81  where  the  correspond¬ 
ing  actual  depth  of  final  notch  obtained  by 
Meyer  Cflby  using. the  method  of  lines  is  7.88. 


•V*AXI$ 


Fig.l.  The  plot  of  the  anode  and  cathode 

surfaces  during  electrochemical  erosion 
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ABSTRACT  -  This  paper  presents  an  integrated, 
and  novel,  structure  for  Intelligent  Computer 
Assisted  Learning  (ICAL)  Systems'/Environments. 

The  environment  is  a  multi-language  software 
platform  upon  which  an  interactive  CAL  system 
is  combined  with  a  set  of  * knowledge/databased1 
systems  for  student  modelling,  responding  to 
user's  queries,  monitoring/diagnosing,  and 
assessing  the  user  and'  the  environment.  The 
environment  concentrates  on ^ the  area  of 
Quantitative  Methods;  and'  in  particular  the 
subject  of  Linear' programming. 

Testing  has  taken  place  on  a  formal  baa is  bn  a 
wide  range  of  groups  from  students "  (e.g.  at 
all  applicable  level's)  to  specialists  (e.g. 
cognitive  psychologists).  These  results 
indicated  that  90%  of  the  users  favoured  the 
construction  of  this  type  of'  environment. 
This  ir.  itself  has  been  considered  a -valuable 
point  for  the  design  of  a  structural 
methodology  for  such  ICAL  environments. 

INTRODUCTION  -  Teaching  environments 
encompass  a  range  of  disciplines  from 
Artificial  Intelligence,  Cognitive  Psychology 
and  Linguistics.  The  breadth  of  application 
and  the  complexity  has  not  always  incorporated 
all  areas  to  their  full’  potential, 
consequently  developments  have  concentrated 
upon  specific  design  Issues.  Examples  include 
SCHOLAR  and  SOPHXB'for  modelling'  issues,  WUSOR 
and  QUADRATIC  for  pedagogical  issues,  BUGGY 
and  PROUST  for  diagnostic  issues,  and  NEOMYCIN 
and  ACT  for  expert  systems/cognitive  issues. 

It  is  the  latter  category  which  has  provided 
the  most  advanced  features  and  illustrated  the 
effective  combination  of  fields.  However, 
such  systems  were  constructed'  in  modular  form 
and  reached  very  little  commercial  success; 
rather  sophistication  of  the  domain  or 
specifity  of  the  design  placed  tremendous 
restrictions  on  them.  Environments  have  been 
created  such, as  Smithtown  and  Bit-Sized  Tutor 
which  attempt  to  construct  an  enclosed  shell 
around  the  user  by  which  subject  areas  within 
a  curricula  are  studied. 

The  lack  of  acceptance  has  not  been  solely  a 
result  of  the  research  orientated  developments 
but  also  the  finance  and  manpower  available 
within  the  establishments  (educational  and 
industrial)  for  which  they  were  intended. 
Consequently,  this  project  discusses  the 
construction  of  a  shell,  which  emulates  the 
best  features  of  a  human  tutor,  whilst 
redressing  the  obstacles  outlined  above. 

THE . ENVIRONMENT.  -  The  environment  is  named 

Intelligent  Linear  Programming  Tutoring  System 
( ILPTS)  (1)  and  consists  of  4  major  sections 
which  were  considered  to  be  contained  within  a 
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human  tutor.  These  were  modelling,  core 
teaching  material,  a  query  system  and  a 
monitoring/diagnostic  system.  We  will 
consider  each' of  these  sections  in  detail,  in 
each  case  describing  .the  concepts  behind  their 
design. 

(1)  Modelling  -  The  ability  of  any  system  . to 
interact  or  intercede  in  a  constructive  manner 
which  is  both  useful  and  helpful  to  the  user 
is  a  prerequisite  of  any  system,  and  was,  as 
stated  earlier,  a  major  focus  of  attention  in 
ILPTS.  There  are  three  sections  which  model  a 
student/user's  behaviour.  * These  are; 

(a)  Bandwidth 

(b)  Target  knowledge 

(c)  Difference'  between  expert  and  student/ 
user. 

Of  the  many  possible  ways  of  implementing  the 
above  20  have  been  attempted.  These  have 
consisted  of  the  simplest  combinations  which 
are  the  easiest  to  implement,  but  do  not 
always  exert  constructive  control  over  the 
student/user  leading  to  results  which  are 
difficult  to  assess.  ILPTS  involves  the  use 
of  the  mental  states  of  the  student/user  using 
a  declarative  '  target  knowledge  searching 
strategy  working  on  a  bug-part  library  with  an 
expert  system  as  a  diagnostic  program  (see 
Figure  1). 


4  »  HI  li 

e  i  a  c  d  a  b 


Figure  1 


The  modelling  system,  together  with  the 
pedagogical  interaction,  controls  the  overall 
operation  of  the  environment  and  operates  via 
a  natural  language  interface  with  the  user, 
sitting  like  a  driver  on  top  of  the 
environment.  The  declarative  nature  of  the 
modelling  system  allows  a  psychological 
picture  of  the  student/user  to  be  developed. 
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the  incorporation,  of  probability  and 
uncertainty  permitting  the  construction  of 
individualised  study  program's  -to '  be  devised. 
The  profile  of  the  student/user  is  retained 
and  incorporated  into  the  decision  process  of 
^further  study' programssWhen>; the  environment  is= 
next  used.  The  nature  of  such  a  modelling 
system  permits  the  detection  .of, 
misunderstandings  and  misconceptions  which  the 
student/user  is  perceived  to  include  in,  their 
interpretation  of  the  teaching  material. 

(2)  Query  -  The  ability  to  query  the  subject 
area  in  response  to^material  taught  is  innate 
in  any  lecture/tutorial/seminar  session..  This 
not  only  improves  understanding  but  can  also 
be  used  to  highlight Careas  of  weakness  in  the 
light  of  current  material  being  studied.  The 
system  comprises  two  sections;  terminology 
definition  and  a  problem-solver.  These  are 
discussed  below. 

(a)  Terminology  Definition  -  The  terminology 
used  within  Linear  Programming  (or,  indeed, 
the  majority  of  areas  chosen  for  ICAL 
systems/environments)  is  wide  ranging,  and 
diverse.  The  field  relates  to  other  areas 
such  as  business  and  computing  and  is  included 
in  light  of  current  curricula.  Consequently, 
the  teaching  aspects  of  the  environment  are 
enhanced  by  a  system  that  allows  the 
student/user  to  obtain  the  required 
explanation. 

The  system  focuses  on  the  student/user's  input 
via  a  natural  language  interface  and  is  parsed 
In  a  manner  which  allows,  the  system  to  handle 
the  use  of  keywords,  grammar,  context  and  a 
combination  of  keywords. 

(b)  Problem-Solver  -  The  teaching  aspects  of 
many  subject  areas  within  ICAL 
systems/environments  Involves  the  use  of 
computers.  Linear  programming  is  no  exception, 
and  a  number  of  packages  are  incorporated  into 
a  wide  variety  of  curricula.  These  packages 
(from  mainframe  to  microcomputer)  were  studied 
and  the  best  features  of  each  chosen.  One 
major  point  which  was  noted  throughout  the 
study  was  the  lack  of  an  appreciation  for  the 
dual  representation  in  linear  programming. 

Consequently,  a  linear  programming  problem¬ 
solving  package  was  designed  and  incorporated 
into  the  environment.  The  package  includes 
the  following  features 

(1)  Development  and  manipulation  of  a  linear 
program 

(2)  Extensive  editing  facilities 

(3)  Report  generation 

All  of  the  above  are  equally  applicable  to  the 
primal  and  dual  representations. 

This  query  system  compliments  the 
corresponding  core  CAL  material  by  permitting 
communication  to  be  a  complete  two-way 
process. 

(3)  Co fo.  CAL  Material  -  The  pedagogical 
aspect  of  the  environment  is  in  three 
sections;  text,  tutorial  and  test  material 
which  at  present  is  set  at  two  abstraction 
levels  (provisional  and  professional).  The 
topics  covered  range  from  formulation  to 
duality  and  are  in  line  with  the  current 


curricula  requirements  for  which  the 
environment. is  'intended. 

(a)  Text  -  The  structure  of  text  is  vital  to 
the  manner-  in  which  the  student/user  will 
digest  the  major  points  to  be  conveyed. 

The  structure  and  consistency  of  the  layout, 
plus  the  control  of  the  system  ensure  that  the 
user  absorbs  the  required,  material  before 
proceeding. 

(b)  Tutorial- -  The  same  design  considerations 
as  with  respect  to  the  text  material  are 
incorporated  but,  in  this  instance  the  text 
provided  is  done  so  as  a  sequence  of  steps 
which  gradually,  build  upon  one  another  giving 
the  student/user  a. greater  appreciation  of  the 
corresponding'  text  for  that  topic  and 
abstraction  level., 

(c)  Test  -  This  subsection  interactively 
interrogates  the  'student/user's  understanding 
of  the  corresponding '  topic.  The  results  of 
the  tests,  which  comprise  of  questions  asked 
at  random,  are  used  to’  judge  the  overall 
effectiveness  in  the  change  of  the 
student/user's  profile. 

(4)  Monitoring  -  This  system  diagnoses  the 
student/user's  understanding.  This  process 
occurs  by  constructing  records  which  indicate 
the  studeht/user’s  .current  profile.  Report 
generation  facilities  are  also  available  by 
which  individuals/groups  (or  the  environments 
teaching  techniques  can  be  assessed). 

IMPLEMENTATION  -  The  environment  encompasses  a 
number  of  features  which  each  have  specific 
requirements.  This  demanded  an  effective 
combination  of  languages  which  would  allow  for 
the  production  of  an  efficient  shell  by  which 
the  concept  could  be  illustrated. 

RESULTS  -  Formal  testing  was  carried  out  on  a 
wide  variety  of  students/users  from  a  range  of 
backgrounds  which  encompassed  the  skills 
necessary  to  assess  such  an  environment. 


CONCLUSIONS  -  The  overall  aim  of  the  project 
was  to  create  an  environment  which  best 
emulated  an  ideal  human  tutor,  and  as  such 
placed  emphasis  on  modelling,  pedagogical,  and 
diagnostic  processes  as  and  when  necessary. 
The  hope  is  that  such  a  development  will 
assist  in  lending  to  the  construction  of  a 
design  methodology  for  such  systems,  while 
allowing  ideas  within  the  environment  to  be 
advanced  still  under  the  belief  that  a 
combination  of  relevant  fields  are  necessary 
for  such  ideas  to  illustrate  practical 
progress. 

RBE@EN.gES 

1.  An  Integrated  Structure  For  ICAL  Systems, 
M  J  McAlister  and  Dr  P  Smith,  submitted 
to  Journal  Of  Computer  Assisted  Learning, 
1991. 


1574 


A  LINEAR  SYSTOLIC  ARRAY  FCft 
THE' QUOTIENT-DIFFERENCE  ALGORITHM 

Octav  Brudaru 

Universitatea  "Al.  I.  Cuza*  Iasi 
Serainarul  Matematic  "A.  Myller" 
6600-  Iasi,  Roman  i  a 


Abstract.  It  is  presented  a  linear  systolic  array 
Tor ute  colucn-by-coluiftn  version  of  the  quotient- 
difference  algoritha  for  the  polyinooial  roots 
finding.  It  is  proved  the  correctness  oi  the 
design  and  it" is  computed" the  bus  bandwidth  and 
processor-utilization.  The  asyaptotic  performances 
of  the  array  arealso  estinated  and  optiuized. 


1.  Introduction 

In  this  paper  it  is  presented  a  linear 
systolic  array  implementing  the  column-by- 
cblumn  version  of  the  quotient-difference 
(GOOD)  algorithm  for  the  solution  of  the 
equation 


p(x)«a  x  +a  x  ♦.  ,.+a  *K>,  <1) 

0  1  n 

where  p  is  a  polynomial  with  real  coeffi¬ 
cients.  If  (x  )  is  the  solution  of  the 

k  k>-n+l 

associated  difference  equation  a  x  +a  x  ♦ 
M  0  k  1  k-1 

...+a  x  ,  for  which  x  «0,  j»net,...,-l 

n  k-n  j 

and  V1'  the  CCQD  algorithm  involves 

the  computing  of  the  following  recursions 
115, pp. 3213)  : 


q(k,.J+l)*d<kel, j)/d<k,j>q<k*l,J>,  <2> 

d(k,j)*q(k+l, j)-q<k,J)+d(k+l,J-l),  <3) 

J«l,2, . . . ,n-l,  where  q(k,l)*x  /x  and 

dCk,0).0,  for  k-0, 1 . K.  k+1  k 

If  no  two  consecutive  roots  of  p  have  the 
same  absolute  value,  then  lim  q(k,j)»r  ,  for 
k  tending  to  infinity  and  pCr^*©,^  J*l, 

r  <r  ,  J»1 , . . . ,n-l .  ‘ 

J  J+l 

In  C43  it  is  presented  a.  systolic  design 
implementing  the  row-by-row  version  of  the 
quotient-difference  algorithm. 

In  section  2  we  present  a  linear  systolic 
array  implementing  (2)  and  (3)  and  give  a 
correctness  proof  of  its  working.  In  section 
3  we  analyse  the  performances  of  the  array, 
while  some  variants  of  the  basic  design  are 
discussed  in  the  last  section. 


A  linear  systolic  array 
We  define  the  clock  tick  (CT)  as  the  time 
to  execute  a  division  and  suppose  that  each 


pi ocess i rig  element  IRE)  is  active  during 
every  CT.  f'h e  lime  is  denoted  by  I  and 
represents  Ih*  nunbei  of  Cfs.  Also,  1.10 
denotes  the  value  circulating  thtcugh  th* 
pin  having  the  label  t.  during  the  t-lh  Cl. 

X  X  x  / 

■  —¥  d(M,j-l) - > - diU)  — > 

/  X  X  X 

- ) - q(w;j)  - — J - 

X  X  X  X 

•  dttfyj-D - ) - dftti,  j)  —)■ 

/'  X  X  X 


Figure  I:  I he  precedence  const  laints 
The  digraph  representing  the  precedence 
cons tr lints  in  the  compute t ion  defined  b/ 
(2>  and  (3^  is  lepiesented  in  Figm*  1. 
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The  structure  of  the  basic  systolic  cell 
(BSC)  that  one  uses  in  the  sequel  is  given 
in -Figure  2(a).  The  PEs  in  BSC  are  depic 
ted  in  Figure  2(b)-(c>.  The  PE  in  Figure 
2(a),  performs  y(t*l)«x(t)  while,  the  PE  An 
Figure  2(b)  executes  z(t+l)*x(t)oy(t) ,  where 
o6{+,— /).  The  working  of  BSC  is  stated,  by 
Proposition  1 . 

D  (t+2)»B  (t-l)-Q  (t)+QB( t+1)  (4) 

out  in  in 

and 

Q  (t+3)*Q  (t)/DT(t+l)D  (t+2)  <5> 

out  'In  out' 


OR! 

dr2 

OR 

dr3 

QR0 


Figu.o  3:  The  linear  systolic  array 
The  linear  systolic  array  (LSA)  imple- 


aenting,  the  CCPU  m*  UrxJ  is  obtained  by 

serially  connecting  K  BSCs,  BSC  ,  k- 

as  it.  is  shown  (for  K~6>  in  Figure  3. 

Tlie  inpu  t  data  is  mti^duc^d  tin  ough 

Ok-  , ...,0k  (tlie  initial  q- values)  and 

0  ‘K 

(thrfr  initial  d--va  lues).  Thi 
computed  q-values  are  extracted  fros. 

OR  ,...,QR  .  These  values  are  sent  back  to 

0  K-l 

the  left  entiles  in  orrfei  to  advance  in  the 
computation  with  a  .new- column  both  for  d  and 
q.  'Let  u*  remai  k  Hut  CCoii  *v>u»od  requires- 
to  compute  <J(k,i),  k«0,  \ , . . . ,K-2l,  and 

q(k,i+!>,  k*-0, 1,.  ..,K-2i-l,  where 
1.  Since  we  ne«rf  at  least  one  term  of'  thf 
sequence  (q(k,n>)  ,  it  obtain  that  K>?n**l« 

Further,  on<  denotes  hr  tl*e  value 

circulating  thiough  the  pin  I.  in  BSC^  -at  W*c 
time  l.  The  correct nesi  of  the  working  of 
t.SA  is  stated  by 
Theorem  £  If 


Oh  (Uk)M,(k,l),  k‘ 0, . . .  ,IC 
k 

and 

<£> 

U.  (t»K*l)»d(k*l,0),  k»-0,.. 

.  ksl 
then 

.,/v-l 

(7) 

CR  CHk*4l)*q(k,iH>,k*-0, 1, 

k*l 
and 

. . . i 

(3) 

DR  ( t H(-2*4i )»d(k, 1 ) ,  k-0, 1 

k«-l 

for  l-l, . .. ,n-1 . 

,...,K-5ti 

(?) 

Proof .  Tlie  pi  oof  is  done  by  i 

iiC'UOt  ion. 

f-oi 

the  beginning  we  piove  (3)  and 

(?)  lot 

SM. 

bet  us  examine  the  woiking  of 

BSC  .  from 

<«> 

and  (7)  we  liave  0  (I ;  t>*  Qt  q( l)»q(0, 1), 

n  (l;t-1)-»0t  (t«t)«d(1,0),  QB(  1  j  t  S  1  >  * 

In  1 

CA.  U+l)*q(l.  I).  Fiom  (4)  it  results 
1 

DR  (t*?)*Cl  (t;U2)-li  (Ijl-l)-Q  (IjO* 

1  out  in  in 

<TO(!j  tsi)^Kl,0)-q(0, l>,  and  with  (3) 

we  obtain  DR  ( H2)f*d(0,  1 ).  Now,  suppose  tt»*t 

DR  <tsK*2)*d(k,  I),  foi  k>0  and  examine  the 
kM 

activity  of  CSC  .  From  (6)  and  (/)  w: 
k»? 

obtain  Cl,  (h»2i  l»k*l>«Qt  (t<k«l)‘*q(kM,  U. 

D  (K*2;  t*k)-»OL  ( t*k>*d(ki2,0>, 
in  k<2 

QB(k+2;  Uk*2)*CA.  ( t *k*2>«q(k*2, I )  and  from 

k*2 

tk  above  assumption  0f(k*2j  Hk*2)* 

Q  (k+t;  tsk*2)-UR  ( t  »k+2)‘'d(k,  1).  By 

out  k+1 

taking  tskM  instead  of  t  in  (4),  it  lesults 

D  (ks2j  tik*3)-li  (k*2;  Uk)-G,  (k*2?HkM>* 

+QB(k*2i tsk*2)-d(ks2,0)q(k*t,t)«q(k»2, 1) 

and  from  (3)  we  obtain  DR  <l*k+3>* 

k*2 

d(k+l,i).  Consequently  (?)  l»oltf*  lor  i*»l. 

On  the  other  hand,  OR  (t»k*4)  = 

k*J 
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Q  (k+2yt+k+4>  and  from  (5)  i.t  results 
out  -  ■ 

OR,  (t+k+4)*Q  (k+2y  t+k+l)/DHk+2;  t+k+2)“ 

k+1  in 

D  (k+2y t+k+3)=(q(k+l, l)/d(k, l))d(k+l, 1)* 
out 

q(k,2)  lie.  (8)j. holds* for  .i»l. 

Now,  assume  that>  (8)  and  (9)  are  true  for 
some  i.  From  the-  constructionof  -  USA,  we 

have  OR  *QL  ,  and  DR  =DL  ,  k=0, . . „ ,K-1. 

k+1  k  k+1  k  , 

Consequently,  OL  (t+k+4i)*Qft  ;(t+k+4i>* 

q(k,i+l)#  k*?0, .  ,K-2i-l,  and  DLHt+k+4i- 

2J-DR  (t+k+4i-2)“d(k, i) ,  k«0, . . . , K-2i. 

’’  k+i  ' 

BSC  -  receives  Qin<l».t+4i)«GLo(t+4i)- 

q(0,  i+1),  D  (ly i+41-l)»DL  (t+4i-l)«d(l, i), 

QB(l;t+4i+l)*QL  (t+4i+l)«q(i, i+1)  and 

1 

DT(ly t+4i+l)*0.  So,  following  (4>,  wevobtain 

DR  (t+4i+2>«  D  (iy,t+4i+2)- 

1  out 

D^Clj  t+4i-l)rG  (ly  t+4i)+QB(l>  t+4i+t)» 

d(l, i)-q(0,i+l)+q(l,i+l)*d(0,5+l>.  Now,  let 

us  suppose  that  DR  .(t+k+4i+l)*d(k-l, i+l),, 

k>l.  BSC  receives  Q  (k+1; t*k+4i)« 
k+1  in 

C*k+  (t+k+4i)*q(k, i+1),  Din(k+lyt+k+4i-l>* 

DRk*ltt+k+4i-l)-d(k+l,i),  GB(k+ly t+k+4i+l>- 
k+2 

*QLk^i(t+k+4i+l)»q(k+l,i+l),  and  DT(k+lyt+k+ 
41+D-DR  (t+k+4i+l)*d(k-l,  i+1).  So,  follo¬ 
wing  (4),  D  (k+ly t+k+4i+2)-D,  (k+l»t+k+ 
out  in 

4i-l)-Q  (k+ly  t+k+4i)+QB(k+l; t+k+4i+t)- 

d(k+l,i)-q(k,i+l)+q(k+l,i+l).  From  (3)  it 

result*  DR  (t+k+4i+2)»d(k, i+1).  On  the 
k+1 

other  hand,  using  (3)  we  obtain 

GRk(t+k+4i+3)»Q^(k+ly  t+k+4i+3)«Q^(k+iy  • 

t+k+4i)/DT(k+ly t+k+4i+l)D  . (k+ly  t+k+4l+2)« 
out 

q(k, i+l)/d(k-l, l+l)d(k,i+l)  and  from  (2)  it 

results  OR  (t+k+4i+3)»q(k-l , i+2) ,  and  the 
k 

proof  is  terminated.  • 

i ia.  aL  1M  izilaUa  \rr.vt. 

Let  us  suppose  that  a  single  polynomial 
p(x)  is  to  be  processed  on  LSA.  The  computa¬ 
tion  starts  at  tmin»t-l  with  the  introducing 
of  q(0, 1)*0  through  QL^  and  ends  at 
t*ax»t+K+2n**3  when  q(K-2n+l ,n-l)  is  extrac¬ 
ted  from  OR  .  Thus  the  computation 

K-2n+2  , 

takes  ttot»t»ax-toin+t«K+2n-l  CTs.  From  (8) 
it  results  that  a  new  q-value  emerges  from 
each  GR-output  at  every  four  CTs.  This  weans 
that  each  BSC  in  LSA  has  a  poor  utilization. 
Now,  assume  that  we  have  to  f ind  the  roots 
of  the  polynomials  p  ,  s»l,...,S,  where 

S*4H.  One  denotes  by  B  the  i-th  batch, 
where  B  *fp./J*4i-3, . . . ,  4i),  i»l,...,tt.  For 
the  sake  of  the  regularity  we  assume  that 


all  ^polynomials  are' of  degree-- ni  where  n« 
max(n'/nX(K+l)/2T,  this  choice  of  n  being- 
given  by  the  condi  t lorn K>?n-1*H the  length  K 
of 'LSA' is  supposed^ to' be  fixed). 

The> processing -of i*l;;..;Hi*  done* in 
a  serial/  fashion.  For^each  fixed*  i.-,  the 

processing-  of.  B‘  begins- with  the  introduc- 

' 

ing  :  of  the  - q -values  of  ‘  p.  in  the  ’manner- 
indicated  by r  Theorem- '1 ,  j  Mi-3, ..  -,4i,  so 
that'  t he Jq (0,1) -value  of  p-  is  introduced 
one  CT*  after-  the  introducing  of  q  (6,  Devalue 

of  ‘p-.  'One. passes -to  B  as  soon  as  the 
J  HI 

q(o,n)rvalue  of  p  emetges  from-  QR  . 

4i  l 

Further  we  shall  estimate  the  effectiveness 

with  which  the  resources  of  LSA  are  used 
when  it  processes  a  single  batch  of  four 
polynomials,  let  0  denote  lire  time  spent, 
for  arithmetic  operations  by  a  single 
processor  algoritlw  (SPA)  which  acts  on  a 
single  problem  Instance.  Frcm  (8)  and  (9)  it 

results  that  SPA  computes  (n-D(K-n)  values 

* 

of  q  and  (n-l)(K+l-n)  values  of  d.  there¬ 
fore,  the  total  numbet  of  arillMetic  opera¬ 
tions  is  0a2(n-l>(?k-*2»tl),  and  consequently 
SPA  needs  T(SPA)»4*0  time  foi  each  batch.  On 
the  other  hand,  LSA  lakes  T (1  SA)«llot*3*»k« 
2n+2  CTs  per  batch,  and  lias  Njv>4K  proces¬ 
sors.  The  rallo  Fc*TU.SA)xNp/U$PA>  measures 
the  effectiveness  or  processor  utilization 
(C23)  and  we  obtain 

Fc(K,n)*K(K*2n*2)/(2(n-l)(2K-2n»l)l.  (10) 

Let  us  suppose  that  K»an*b,  where  a  and  b 

are  integers.  Further,  we  are  interested  It* 

obtain  a  and  b  so  that 

0  0 

lim  Ec(a  n+h  ,n)<)im  Ft (*n>l«, n), 
n->oo  0  0  n->oo 

where  a  n*b  ,  an)b<2n-l,  fo»  each  n>3.  From 
0  0 

(10),  we  obtain  lim  €  (an+b,n>* 
n->oo  c 

f (a)-a(a+2)/CH(a-l))  while  a  and  b  satisfy 

either  (l>  a-*2  ami  b£.-t  oi  (ID  a>2  and 

(bH)/(2-a)<3.  Finally,  it  results 

f(3Xf(a),  for  *>2  and  ©X3,  and  we  obtain 

a  *3,  b  >-4  and  f (3) *1.875. 
k  x"~ 

Now,  let  uv  estimate  lh*  bus  bandwidth 
utilization,  Kd  -UxNd/Ad  ((21).  The  maximum 
amount  of  data  to  be  transmitted  between  the 
host  and  ISA  in  one  CT  is  WjS.  The  data 
movement  begins  with  q(0, 1)  of  the  first 
polynomial  at  time  t  and  ends  with  q(K- 
2n+l,n)  of  the  last  polynomial  of  U«-  batch 
at  the  title  t<K+2n.  Thus,  the  number  of  data 
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movement.'steps-.  £*  Nd=K+2n+l.  The  -processing; 

of,  a-  polynomial.-heeds  K+2  input-  data-  ~and> 

offers  n-1  final,  q-valucs.  Thus,  the.  total 

amount  of  data."  to  be  .transmitted  is  Ad* 

4<K+h+l>.  Consequently,  ?Ed<K,nj*5(K+2n+l)/ 

C4<K+n+l)T  and  lim.  Ed(3n+b  ,n)*25/I6. 

-  n->oo  •  0  'v 

The  data,  locality  .ratio*  (£33).  is’  RDL“Nmen/ 

Nop,  where  Nmen  is.' the  number  of  memory; 

acceses:  and;  Nop  :*  the  number*  of  floating 

point  operations.  It  isrto  see  that  Nmen«' 

Nd,  Nop*T(SPA)-and  for -any,  K>2n-i  we  have 

lim  RDL(K,n)*0,  while  for- each;*fixed  n 
n->oo 

it  results  lim  ,  RDL(K,n)»l/ri6(nrl)3- 

K->oo  " 

4.  Some-  variants 

The  CCGD  algorithm  requires;  to  supply  the 
array  with  the  initial  q-values.  These 
values  can  be  obtained  from  '  the  systolic 
array,  implementing  the  Bernoulli'’ s  method 
for  polynomial  roots  finding  (£13). 

It  is  clear  that  the  array  could  be 
reconfigurated  in  order  to  obtain  a  trape¬ 
zoidal  systolic  array.  In  this  case,  LSA  is 
modified  so  that  the  connections  for  recir¬ 
culating  the  outputs  are  removed.  The  2D 
systolic  array  is  obtained. by  using  I  copies 
of  LSA,  LSAj ,  • .  • , LSA  ^ ,  to  that  LSA^  (wich 
has  K-i+1  BSCs)  sends  the  results  to  LSA  , 
1*1,..., 1-1,  while  LSA^  emits  back  to  LSa|. 
We  remark  that  for  sufficiently  large  I  the 
data  recirculating  can  he  removed. 

By  connecting  BSC  to  BSC  it  Is  obtained  a 
K  1 

ring  of  processors  wich  ..is  able  to  handle 
with  larger  values  of  K.  A  torus  can  be 
obtained  by  applying  this  Ldea  to  2D  array 
above  suggested. 

A  new  variant  of  LSA  could  be  obtained  by 
supposing  that  the  initial  q-  and  d-values 
enter  LSA  at  the  same  time.  The  network  must 
be  retimed  and  many  delay  processors  can  be 
saved. 

A  more  flexible  variant  of  LSA  (supporting 
any  manner  to  Introduce  {he  initial  data) 
can  be  obtained  by  using  the  so-called 
diastolic  control  elements  (£63).  It  suf¬ 
fices  to  replace  each  delay  register 
in  the  circuit  of  BSC  by  a  diastolic  control 
element.  In  this  manner  w*  ensure  a  local 
control  of  LSA  wich  becomes  a  systaltic 
array  (C63). 
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Abstract:  In  this  paper  tie  modelling  of  a  closed 
loop  po«er  amplifier  is  described  c*icg  tie  Bond- 
graph  techniques.  Tbe  power  aaplifier  is  a  50  watt 
class  AB  pcstpcll  linear  aaplifier,  which  scpplies 
power  at  230r  ac  ras.  It  draws  power" froa  *  12r 
battery.  The  bondgraph  modelling  and  simulation  is 
perforaed  oo  a  personal  coapoter  using  tbe  “TUTSIM” 
software  package.  Tbe  power  aaplifier  is  also 
realised  in  tbe  laboratory  and  tbe  resalts  are 
compared  with  tbe  si  aula  ted  aodel. 


h.  IntryfocUpn 

Bondgraph  is  a  graphical  tool  which  is  used  in  tbe 
representation  of  dynamic  systems.  It  displays  both 
the  energy  and  signal  exchanges  between  components 
or  elements  of  tbe  systea  (1].  For  a  system  of 
order  greater  than  three,  the  transfer  function 
approach  to  analysis  becoses  cusbcrsose  and  also 
one  tends  to  loose  the  correspondence  of  the  var¬ 
ious  tiae  constants  associated  with  the  various 
elements  in  the  systea,  especially  for  high  order 
systeas.  But  in  the  case  of  bondgraphs,  one  to  one 
correspondence  is  maintained  with  the  actual  cir¬ 
cuit,  and  further,  it  gives  a  graphical  picture  of 
the  dynamic  equations  of  the  systea.  The  bondgraphs 
are  initially  set  up  acausally  and  then  subsequent¬ 
ly  tranforaed  into  a  causal  diagraa  by  a  systematic 
choice  of  the  causalities  [1,2]. 


System  and  Model 

The  circuit  shown  in  fig.l  is  the  systea  which  has 
been  aodclled  and  siaulated.  It  is  a  linear  push- 
pull  aaplifier  operated  in  the  class  AB  node.  The 
drive  for  the  transistors  is  fron  a  push  pull  stage 
driven  froa  an  integrated  aaplifier.  A  sinevave 
reference  of  frequency  50Hz,  drives  the  bases  of 
the  pushpull  transistors  through  a  transforser  TX1 
(fig.l).  The  power  stage  consists  of, two  power 
transistors  and  a  step  up  transforser  TX2.  The 
output  voltage  fed  to  the  load  is  230  v  ras  at  50 
Hz.  The  output  voltage  is  maintained  at  230  v  by 
seans  of  feedback  loop  as  shown  in  fig.l. 

The  transforser  TX2  is  modelled  taking  into  account 
the  leakage  and  Magnetising  inductances,  and  wind¬ 
ing  resistances. The  transistor  is  represented  by  a 
large  signal  equivalent  circuit,  as  it  handles 
power  signals.  It  is  modelled  as  a  voltage  con¬ 
trolled  current  source  through  the  Ic-Vbe  non-lincar 
characteristics.  The  diode  {•  represented  by  its 
nonlinear  i-v  characteristics. 

The  TUTSIM  package  [3]  is  used  in  this  emulation 
on  a  personal  computer.  As  there  are  no  representa¬ 
tions  for  transistor  and  diode  in  the  bondgraph 
language, ,  the  nodel  for  transistor  and  the  diode 
(as  aentioned  above)  were  built  using  the  various 
Function  blocks"  provided  in  TUTSIM.  The  transis- 
tor  Is  represented  as  a  dependent  current  source 
(Qf)  as  shown  in  fig.2(a)  and  the  diode  is  repre¬ 
sented  as  a  piecewise  linear  resistor  (D)  as  shown 
in  fig. 2(b).  The  bondgraph  aodel  of  the  entire 
systea  is  shown  in  fig. 3. 


3.  Simulation  Bcsults 

One  of  tbe  difficulties  faced  dcriog  design  of  a 
closed  loop  control  systea  is  tbe  tuning  of  tbe 
controller  parameters  for  optissa  control  perform¬ 
ance.  Bondgraph  simulation  is  a  very  useful  tool 
which  helps  one  to  tune  tbe  controller  and  tbe 
sensor  tiae  constants  to  achieve  optima*  control 
performance,  as  it  gives  directly  tbe  required 
signals  in  tiae  doaain  and  tbe  paraseters  can  be 
changed  quickly  during  the  sisulation. 

The  simulation  results  are  shown  in  figures  4(a) 
and  4(b).  Tbe  fig.4(a)  shows  the  output  voltage 
wavefora  (Vo)  and  tbe  sense  filter  output  waveform 
(Vc)  during  tbe  transient  period  and  fig. 4(b)  these 
signals  during  the  steady  state  period.  Sisulation 
results  indicate  that  for  stable  operation,  the 
sense  tiae  constant  must  be  aucb  fatter  than  the 
controller  tiae  constant.  As  the  sense  tine  con¬ 
stant  approaches  the  controller  time  constant,  the 
systea  becomes  increasingly  unstable.  It  was  found 
froa  this  simulation  that  a  sense  tiae  constant 
which  is  twice  as  fast  as  the  controller  tiee 
constant  gives  a  stable  operation. 

The  power  aaplifier  circuit  was  mpleaented  in  the 
laboratory  to  verify  the  above  siaulation  results. 
The  siaulation  proved  to  be  very  accurate.  Through 
siaulation,  a  sense  tiae  constant  of  273as  and  a 
controller  tiee  constant  of  381 as  provided  a  very 
stable  systea  fioa  no  load  to  full  load  and  was 
used  in  the  hardware  lapleaentation  successfully. 
To  verify  the  siaulated  results  further,  the  value 
of  the  sense  tiae  constant  in  the  hardware  unit  was 
varied  around  the  optiaua  value.  The  systea  proved 
to  be  unstable  as  was  predicted  during  siaulation. 
Further,  the  observed  output  waveforas  and  the 
siaulated  output  wavefora  agreed  in  shape  and  aapli- 
tude. 


is.  Conclusion 

In  the  above  discussion,  we  looked  at  aodel ling  and 
siaulating  a  pushpull  aaplifier  systea,  with  the 
objective  of  analysing  the  systea  and  to  tune  the 
controllei  tiae  constants  to  achieve  optiaua  con¬ 
trol  pcrforaance.  This  kind  of  siaulation  reduces 
the  hardware  iapleaentation  tiae  considerably  as 
one  can  siaulate  the  non-linear  devices  like  tran¬ 
sistors  and  diodes,  one  can  predict  the  nature  of 
the  outputs,  design  the  control  circuit  to  aeet  the 
required  perforaance,  etc.  in  a  short  tiae. 
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Abstract  This  work  presents  a  comparative  survey  of  h- 
refineznent  adaptive  schemes  for  the  solution,  by  means  of  a 
first  order  finite  element  method,  of  the  differential  equations  of 
interest  in  computational  electromagnetics.  All  schemes  anal¬ 
ysed  are  based  on  “a  posteriori”,  element  by  dement,  single 
solution  error  estimation  techniques.  A  number  of  possible  al¬ 
gorithms  in  the  above  class  is  reviewed,  and  obtained  results 
are  compared  and  discussed. 

INTRODUCTION 

The  quality  of  solution  that  can  be  obtained  by  finite  de¬ 
ment  methods  is  strongly  affected  by  the  mesh  used.  For  this 
reason,  the  definition  of  a  mesh  suitable  for  a  particular  prob¬ 
lem  has  always  been,  and  in  most  practical  cases  still  is,  one  of 
the  most  crutial  phases  of  a  finite  dement  analysis.  As  a  con¬ 
sequence,  a  very  significant  research  effort  is  devoted  at  present 
to  develop  suitable  adaptive  meshing  algorithms,  that  are  rec¬ 
ognized  of  critical  importance  to  relieve  the  analyst  from  this 
difficult  and  time  comsuming  task  and  to  make  quality  of  solu¬ 
tion  independent  of  user  shillfullness  in  meshing. 

The  first  attempts  in  this  direction  have  been  performed 
in  the  area  of  structural  analysis,  historically  the  first  to  de- 
vdop  finite  dement  techniques,  where  the  most  commonly  used 
dements  are  quadrilateral  ones,  for  this  reason,  many  of  the 
most  significant  papers  in  the  literature  about  adaptive  mesh 
ing  deal  with  quadrilateral  dements  (1)  However,  the  aim 
of  the  authors  is  to  develop  adaptive  meshing  algorithms  and 
procedures  suitable  for  application  in  electromagnetic  analy¬ 
sis,  or  computational  electromagnetics,  that  has  a  number  of 
different  requirements.  In  fact,  on  the  contrary  of  structural 
analysis,  computational  electromagnetics  must  frequently  solve 
open  boundary  problems,  has  a  significant  variety  of  differen 
tial  equations  to  face,  frequently  nonlinear  and  time- dependent, 
and  in  application  of  practical  interest  must  cope  in  general 
with  intricate  geometries  and  a  number  of  different  materials 
For  this  class  of  applications  triangular  meshes  are  generally 
used,  mainly  because  of  the  greater  simplicity  in  dealing  with 
intricate  boundaries  and  interfaces.  However,  this  kind  of  ele¬ 
ments  has  received  so  fax  much  less  attention  in  adaptive  mesh 
ing  literature  [2]. 

Within  computational  electromagnetics,  the  attention  of 
the  authors  has  been  focused  on  adaptive  meshing  algorithms 
suitable  for  “general-purpose”  analysis  codes,  that  is,  codes  ca 
pable  to  solve  a  wide  class  of  geometries  of  practical  interest, 
generally  unknown  “a  priori”to  the  code  developer,  and  then  to 
the  meshing  algorithm  Bcvausc  of  this  further  specific  require 
ment,  a  number  of  stringent  features  are  necessary  or  desirable 
In  particular,  the  adaptive  meshing  algorithm  should  be  gen 
eral  enough  to  be  suitable,  or  readily  adapted,  to  a  number  of 
differential  equations,  also  nonlinear  and  time-dependent,  and 
should  be  “robust”  enough  to  perform  correctly  even  with  se 
vere  geometrical  constraints.  Finally,  since  this  class  of  applica¬ 


tions  is  significantly  compute-bound,  computationally  intensive 
algorithms  should  be  as  far  as  possible  avoided.  As  a  conse¬ 
quence,  no  activity  has  been  performed  on  adaptive  methods 
based  on  dual  finite  dement  solutions  [6J,  judged  not  general 
enough  and  too  computationally  expensive,  nor.  on  methods 
requiring  the  solution  of  a  local  differential  problem  on  “finite 
patches”  made  up  with  several  elements  J7J,  since  they  involve 
an  “ad  hoc”  treatment  of  boundaries  and  interfaces,  that  can 
become  cumbersome  or  even  critical  in  intricate  geometries. 
In  order  to  maximize  flexibility  and  robustness,  the  activity 
of  the  authors  on  error  estimation  and  adaptive  meshing  has 
then  been  concentrated  on  first  order  triangular  elements,  us¬ 
ing  element- by-element  error  estimation  and  h- refinement  tech¬ 
niques  (3-5). 

In  the  present  paper  a  review  of  the  error  estimation  and 
adaptive  meshing  algorithms  and  techniques  selected  and  im¬ 
plemented  is  performed,  and  results  of  some  comparative  anal¬ 
ysis  of  their  features  are  presented  and  discussed. 

MESH  REFINEMENT  ALGORITHM 

An  h- refinement  adaptive  meshing  algorithm  is  organized 
as  follows,  first  a  finite  clement  solution  is  computed  on  the  cur¬ 
rent  mesh,  then  an  estimation  of  *hc  local  error  of  this  solution 
is  computed  element  by  element,  finally  the  elements  showing 
greatest  errors  are  subdivided  to  obtain  a  new  mesh.  Since 
the  various  methods  in  this  paper  differ  only  for  the  error  esti¬ 
mation  procedure  used,  in  the  present  section  mesh  refinement 
algorithm  common  to  all  of  them  is  described. 

To  start  the  adaption  procedure,  an  initial  mesh  is  built  up 
using  the  automatic  meshing  features  of  the  CEDEF  package 
[8],  a  first  problem  solution  is  performed,  the  local  error  on  each 
element  is  estimated  find  used  to  decide  which  elements  are  to 
be  subdivided.  Once  an  element  has  been  marked  for  refine¬ 
ment,  two  different  clement  subdivision  rules  have  been  used, 
for  elements  lying  in  the  bulk  or  having  a  side  on  a  boundary 
or  on  an  interface,  respectively. 

In  the  first  case,  a  node  in  the  centroid  of  the  element 
is  added,  splitting  the  original  clement  in  three  smaller  ones. 
For  boundary  and  interface  elements,  a  node  is  added  at  the 
midpoint  of  the  boundary  or  interface  side.  In  this  way  it  is 
possible  to  thicke.1  the  mesh  also  on  boundaries  and  interfaces, 
whith  would  not  be  affected  otherwise  using  the  first  technique 
only.  Once  this  procedure  has  been  applied  to  the  whole  mesh, 
a  Delaunay  tnangulation  is  performed,  to  provide  the  optimal 
mesh  for  the  given  set  of  nodes,  and  a  new  solution  is  computed. 
The  procedure  is  then  iterated  until  the  convergence  criterion 
is  satisfied. 

The  final  step  of  mesh  improvement  is  an  iterative  nodal 
displacement  procedure,  operated  on  patches  of  elements  with 
an  underrclaxed  "rubber  banding”  technique,  that  can  be  ac¬ 
tivated  by  the  user  to  improve  the  final  quality  of  the  mesh 
before  performing  the  final  solution. 
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ERROR  ESTIMATION  PROCEDURES 

The  three  error  estimator  procedures,  whiai  in  the  pre¬ 
vious  work  of  the  authors  have  emerged  as  the  best  ones  to 
be  used  in  the  adaptive  mesh  algorithm,  are  described  in  the 
present  section.  Two  of  them  are  based  on  the  residual  evalua¬ 
tion  and  are  referred  to  as  “local  error  problem”  and  “extended 
complete  residual”  methods  [3,5],  while  the  other  one  is  based 
on  polynomial  interpolation  theory  and  is  referred  to  as  “held 
difference"  method  [3,4].  The  “local  error  problem"  is  as  fol¬ 
lows.  Considering  a  Poisson  problem  defined  by  the  differential 
equation: 

— V2!*  =  /  (1) 

in  a  region  11  with  boundary  conditions  on  dSl  =  T  =  T*  U  Fj: 


*  =  2T'(/a;M‘is)+ 

(11) 


where  the  weight  7  (0  <  7  <  1)  allows  to  weight  differently  the 
error  in  the  dement  and  on  its  boundary.  The  estimator  s,  has 
the  meaning  of  an  average  error  over  ft,,  but  to  use  the  same 
refinement  indicator  as  in  the  “local  error  problem”  method  we 
need  rather  an  estimate  of  the  square  of  the  quadratic  norm  of 
the  error,  that  is  evaluated  as: 


=  (12) 


u  —  uj  on  r. 


du 

a^=q 


on  r„ 


(2) 


the  local  error  e;  on  the  5-th  dement  ft,'  is  estimated  by  solving 
the  following  differential  problem  on  ft; 


~V7ei  = 

t{  ~  Ti  on  t^.nPi 


on  ft; 


(3) 
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and  u  is  the  finite  dement  solution  of  the  problem  (1)  -  (2). 
Once  the  error  e;  on  the  solution  has  been  estimated,  the  error 
on  its  gradient  (i  c.Vu  -  Vu)  is  readily  obtained  as  Vc,.  Then 
the  refinement  indicator  t),  on  the  element  ft,'  is  computed  as 

Cl- 

’*  '  Ej'li IIV5,-|iJ  +  r^.iifi.ii2' 


e;.,w 


where  N  is  the  total  number  of  dements  in  the  whole  domain 
ft  apd  k  is  a  weighting  factor  to  be  selected  in  the  range  0  to  1, 
allowing  to  define  it  as  the  error  estimator  on  the  solution,  on 

its  gradient,  or  on  both.  The  values  t]l} . rjs  are  then  used  to 

decide  which  dements  will  be  subdivided.  The  quadratic  norm 
in  (8)  is  the  standard  one,  that  is  to  say: 


where  t>,  is  a  generic  scalar  or  vector  quantity  in  ft,.  The  re¬ 
finement  criterion  used  is  based  on  the  following  procedure, 
the  maximum  value  of  the  refinement  indicator  over  the  whole 
doma'n,  is  computed  after  each  problem  solution,  and 
an  dement  is  marked  for  subdivision  at  the  next  refinement 
iteration  when; 

7»  ^  Qflmat  (10) 

where  a  is  a  user  defined  refinement  parameter  such  that 
0  <  o  <  1. 

The  “extended  complete  residual”  method  assumes  that 
an  estimate  of  the  error  on  the  solution,  j„  can  be  obtar  <*d  by 
combining  the  driving  functions  of  the  “local  error  problem” 
as; 


where 'd;  is  the  area  of  the  dement  ft, 

To  provide  also  an  estimate  of  an  average  error  on  the 
gradient  of  the  solution  oh  the  i—th~  element,  the  quantity  g, 
has  been  defined  as: 


9t  —  axgts  +  Stg,t  (13) 

where  glz  and  gIJt  for  an  dement  without  any  side  on  the 
boudaxy,  are  obtained  by  a  least  square  solution  of  the  following 
overdetermined  linear  system; 


(s-  •  + ^ =KL  ■  filJ 

(;  =  1,2,3)  (14) 

If  any'  side  of  ft,'  is  on  a  Neumann  boundary  the  corre¬ 
sponding  equation  in  the  system  (14)  is: 


(o*  *  nt})gtt  +  (5r  n,})g„  =  rn  (15) 

If  one  side  of  ft,  is  on  a  Dirichclct  boundary,  the  system 
(14)  has  two  equations  only,  so  that  it  is  no  longer  overdeter- 
mmed  and  an  ordinary  solution  exists.  If  two  sides  of  ft,  are  on 
a  Dirichclct  boundary,  gt  is  directly  determined  by  assuming 
that  it  is  directed  as  the  n,,  normal  to  the  only  internal  side, 
that  is  to  say: 


£  = 


1  du 

2  &n*j  cut 


(16) 


From  9,  the  square  of  the  quadratic  norm  of  the  error  on 
the  gradient  of  the  solution,  needed  in  (8),  is  readily  estimated 


as: 

!|Ve,H!  =|  (17) 


For  the  “field  difference"  method  approach,  let  us  first  de¬ 
fine  as  E[e^  the  gradient  of  the  solution  m  a  generic  element  *, 
constant  over  the  clement,  and  the  gradient  of  the  solution 
assigned  to  the  generic  node  y,  computed  as  the  average  of  the 
values  of  the  elements  surrounding  the  node  and  having 
the  same  material  properties.  The  error  on  the  gradient  of  the 
solution  in  the  node  y  of  the  element  1  has  then  be  defined  as. 

Vcj,  =  £,w  -  £(">  (18) 

This  quantity  can  be  defined  in  any  node  of  each  element,  and 
can  assume  different  values  for  the  same  node  when  computed 
for  elements  lying  across  an  interface  between  different  maten- 


1582 


Is.  By  assuming  that  this  error,  computed  in  the  codes  only,  is 
distributed  over  the  element  with  the  dement  shape  function 
JVj,  it  is  possible  to  define  the  function  Ve,-  AjVej,-, 

giving  a  distribution  of  the  error  over  the  whole  dement.  The 
square  of  the  quadratic  norm  of  the  estimate  of  the  error  on 
tie  gradient  of  the  solution  over  the  generic  dement  x  is  given 


by: 


7=f  Yftfis 

Jiti 


(19) 


Since  the  “fidd  difference”  algorithm  defined  in  this  way 
provides  only  an  estimate  of  the  error  on  the  gradient  of  the 
solution,  the  weighting  factor  k  of  eq.  (S)  has  been  set  to  1. 

At  the  end  of  the  adaption  process,  the  relath  e  estimated 
error  on  the  solution  and  on  its  gradient,  relevant  to  the  dement 
Sl{,  according  to  the  quantities  made  available  by  the  basic  error 


estimator  routines,  are  computed  as: 

,  M 

""p.i 

(20) 

IIV..II 

'■"IIVo.D 

(21) 

error  (22)  or  (23)  is  used  depending  on  whether  the  quantity 
of  interest  is  the  potential  or  the  fidd.  The  same  criterion  has 
been  used  in  comparing  the  best  version  of  each  method,  for  the 
sdeetion  of  the  best  method  as  refinement  indicator.  Finally, 
the  relative  error  (20)  or  (21). on  the  mesh  obtained  by  the 
best  refinement  indicator  has  been  evaluated  by  each  method 
and  compared  with  the  “true”  relative  error  (22)  or  (23).  Sudi 
a  comparison  provides  information  about  the  relative  merit  of 
each  method  when  used  as  a  tool  to  assess  the  accuracy  of  the 
final  results.  The  whole  procedure  has  been  carried  out  using 
both  the  errors  (20),  and  (22)  on  the, potential  and  the  errors 
(21)  and  (23)  on  the  fidd,  since  the  best  refinement  indicator 
*  to  compute  the  potential  is  not  necessarily  the  best  one  to  com¬ 
pute  the  fidd.  A  similar  argument  bolds  when  the  methods  arc 
regarded  as  estimators  of  the  final  error. 

The  solution  of  the  Laplace  equation  in  an  L-shaped  region 
has  been  used  as  test  case.  This  region  is  shown  in  Fig.  1 
with  the  boundary  conditions  used  and  the  analytical  solution 
superimposed.  The  A  —  B  line,  along  which  the  errors  used 
to  compare  the  various  methods  have  been  computed,  is  also 
shown  in  Fig.  1.  All  tests  have  been  carried  out  starting  from 
the  initial  mesh  shown  in  Fig.  2. 


TEST  STRATEGY  AND  RESULTS 

Preliminary  tests  have  shown  that  it  is  difficult  to  perform 
significant  comparisons  among  adaptive  meshing  algorithms 
even  if  they  differ  only  in  the  method  of  error  estimation.  In 
fact,  the  error  estimation  method  usually  determines  both  the 
choice  of  the  dements  to  be  refined  and  the  iteration  at  which 
the  algorithm  stops.  Since  the  best  method  for  the  selection  of 
the  dements  to  be  refined  can  be  in  principle  different  from  the 
best  one  to  be  used  as  convergence  criterion  of  the  refinement, 
the  two  effects  must  be  separated  so  as  to  work  out  significant 
comparisons.  In  order  to  do  so,  a  test  case  provided  with  an¬ 
alytical  solution  has  been  used,  stopping  the  mesh  refinement 
when  the  greatest  difference  between  the  analytical  and  numer¬ 
ical  solutions  (at  the  nodes  for  the  potential  and  at  the  centroid 
of  the  clement  for  the  fidd)  falls  below  a  fixed  limit.  Thus  the 
error  estimation  methods  are  used  only  to  choose  the  elements 
to  be  refined.  In  this  way,  the  comparison  of  “true  errors”  and 
CPU  time  directly  shows  the  relative  merit  of  each  error  estima¬ 
tion  method  used  as  a  refinement  indicator.  Of  course,  different 
methods  give  solutions  on  different  final  meshes,  so  that  com¬ 
paring  the  deviations  of  the  estimated  error  with  respect  to  the 
“true”  one  is  not  fully  significant.  Thus,  evaluating  the  quality 
of  the  method  as  error  estimator,  such  a  comparison  has  been 
carried  out  on  the  same  mesh  for  all  methods. 

As  a  first  step  to  perform  significant  comparisons  among 
the  various  methods,  each  of  them  has  been  optimized  with 

respect  to  the  parameters  involved,  a  and  k.  To  this  aim,  the 
results  obtained  for  different  values  of  the  same  parameters  have 
been  compared  and  a  compromise  taking  into  account  both  the 
execution  time  and  the  “true”  relative  error  on  the  potential  or 
on  the  field  has  been  done  to  choose  the  “optimal”  parameter 
value.  The  “true”  relative  errors  have  been  defined  as: 


,  ,_IM 
*•"- W 

(22) 

,  _  liv.,,11 

(23) 

where  u,  is  the  analytical  solution  over  the  generic  element 
Vu,  its  gradient,  e«  =  -  u„  and  Ve»,  =  Vu,  -  Vu,.  The 


SI 

an 


Fig.  1  -  Equipotcntial  lines  of  the  analytical  solu 
tion  of  the  test  case. 
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'A*  a  first  step,  the  a  parameter  ra  equation  (10)  has  been 
optimized  for  each  method  according  to  the  methodology,  pro 
vioudy  d*ieribe<L  Once  the  best  a  has  been  found,  also  the  k 
parameter  in  equation  (8)  has  been  similarly  optimized,  except 
for  the  Afield  difference”  method  which  only  allows  k  ==  }';  la 
Fig.  3  the  “true”  relative  errors  (22)  on  the  potential,  along 
the  A  —  B  line,  on  the  solution  obtained  by  using  the  various 
methods  as  refinement  indicators,  each  with  its  best  value  of  a 
and  k,  are  shown.  , 

In  oixfcx  to  compare  the  methods  as  estimators  of  the  final 
errors,  the  relative  error  estimation  of  eqy  (20)  on  the  solution 
provided  by.  the  final  mesh  of  the  “extended  complete  resid¬ 
ual*  method,  which  has  provided. the  best  results,  has  been 
performed  for  each  method  along  the  previously  defined  line 
A  —  B  of  Fig.  1.  The  difference  between  the  “true^  and  the 
estimated  errors  (22)  and  (20),  along  the  A  —  B  line,  is  shown 
for  each  method  m  Fig.  4. 

In  Figs.  5  and  6,  the  same  comparisons  are  displayed  for 
mesh  refinement  and  error  estimation  on  the  field*  using  the 
errors  (23)  and  (21),  respectively.  ' 


DISCUSSION  AND  CONCLUSIONS  . 

The  extensive  work  on  the  “optimization”  erf  the  a  and  k 
refinement  parameters  of  ei*.  (10)  and  (8),  even  if  by  necessity 
•semi-qualitative,  his- brought  to  the  conclusion  that  a  value 
around  3  of  the  <7  parameter  generates  the  most  “balanced” 
meshes  for  all  refinement  methods,  both  for  potential  and  field 
errors.  As  far  as  k  is  concerned,  it  can  be  stated  that  for  “field 
oriented”  meshes  again  a  value  of  ,5  has  proven  to  be  the  most 
adequate  solution,' for  all  methods  that  allow  its  variations. 
The  situation  is  more  dependent  on  the  specific  error  estimate 
used  for  “potential  orwnted”  meshes.  As  for  the  convergence  of 
adaptive  meshing  to’  the  true  solution,  all  methods  tested  (with 
the  exception  of  the  “field  difference”  one,  estimating  the  field 
error  only)  provide  a  good  convergence,  as  can  be  seen  in  Fig.  3, 
„  giving  error  indicatorson  the  potential,  as  defined  by  equation 
(22),  lower  than  two  per  cent  in  the  high  field  region  The 
relative  merits  of  the  various  proccdur’es  for  the. solution  of  po¬ 
tential  problems  should  then  be  evaluated  on  the  basis  of  their 
computational  effort.  This  comparison  favours  significantly  the 


"extended  complete  residual”  formulations  with  the  “optimal" 
value  of  k  being  egual  to  zero,  meaning  an  evaluation  of  the  re¬ 
finement  estimator  (8)  only  on  the  potential.  The  convergence 
to  the  true  solution  for  the  fields  also  shows  a  similar  behaviour 
for  all  methods  tested,  with  computational  time  slightly  favour¬ 
ing  the  "local  error  problem”  approach.  Finally,  regarding  the 
final  error  estimation  of  the  potential  for  the  various  methods, 
it  can  be  observed  in  Fig,  4  that  the  best  results  axe  given  so 
far  by  the  "local  error  problem”  algorithm.' As  far  as  the  final 
error  estimation  of  the  field  is  concerned,  it  can  be  seen  from 
Fig.  C  that  all  the  methods  seem  to  provide  reasonably  ade¬ 
quate  estimates,  even  if  none  of  the  methods  implemented  has 
provided  a  uniform  upper  bound  to  the  error.  The  activity  is 
now  continuing  to  verify  the  consistency  of  the  above  results  in 
different  cases,  to  improve  mesh  refinement  convergence  criteria 
and  to  try  to  define  accelerated  convergence  strategies. 
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'RELATIONSHIPS  BETWEEN  DUAL  ENERGY 
FORMULATIONS  IN  ELECTROMAGNETISM. 
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Abstract; 

It  is  the  intention  of  this  contribution  to  show  how  different 
ways  of  constructing  dual  energy  forms  for  problems  in 
electromagnetism  can  be  related  to  each  other,  and  then  used 
to  provide  error  bounded  solutions  for  this  class  of  problem. 
The  first  approach  is  based  uoon  the  principles  of  Hamilton 
and  Toupin  and  the  use  of  Legendre  transformations,  which 
can  be  related  to  the  method  of  Lagrange  multipliers  in 
constrained  optimization.  The  second  technique  stems  from  a 
geometrical  interpretation  in  Hilbert  space  called  the 
hypercircle  method,  first  introduced  by  Prager  and  Synge  (1J. 
Both  can  be  shown  to  provide  the  same  formulations  as  those 
achieved  using  direct  integration  and  the  methods  of 
functional  analysis.  This  allows  a  general  structure  for 
electromagnetic  field  problems  to  be  developed  which  leads 
to  the  easy  selection  of  appropriate  energy  functionals. 


2.  The  hvpercircle  method. 

In  [1]  it  is  shown  how  the  solution  to  the  appropriate 
boundary  value  problem  can  be  constrained  to  Ue  on  the 
hypera'rcle  representing  the  intersection  of  a  vector  space 
hypersphere  and  a  byperplane.  In  the  electrostatic  example 
referred  to  earlier  D  is  a  function  in  the  space  V,  in  which 
distance  is  measured  by  the.’ energy  norm  (E|D),  where 
E=VCD.  If  two  subspaces  of  V,  V*  and  V,  exist  where  V1 
contains  functions  D’  such  that  D  =  cE  and  V.D’=  p,  and 
V"  contains  functions  D"  such  that  D*=  cE“  and  VxE"=  0,  then 
the  intersection  of  V*  and  V*  represents  the  exact  function  D 
=  eE,  which  also  satisfies  V.D  -  p  and  VxE  =  0.  The  energy 
norms  in  these  spaces  are  also  the  pnmal  complementary  pair 
given  above. 


I«  Hamilton's  and  loupm’s  principle. 

These  principles  has  been  used  by  several  authors  to  obtain 
complementary  variational  forms  for  engineering  field 
problems,  in  particular  Hammond  &  Penman  (2),  used  the 
analogy  betwen  analytical  mechanics  and  electromagnetism  to 
find  complementary  forms  for  electromagnetic  field  problems. 
To  illustrate,  the  canonical  set  of  equations  for  electrostatics, 
in  a  domain  fi  is: 

•70  a  E,  with  0  »  g  on  dfy;  eE  =  D;  V.D  **  p,  with  nU« 
h  on 

Application  of  Hamilton’s  principle  leads  to  the  energy 
statement, 

m  -  VWWft  •  (pMo  +  <h|0)dfl 

whilst  Toupm’s  principle,  by  letting  D  =  Ds  +  VxC  gives  the 
complementary  energy  form, 

S(C)  -  +  V‘C)|(DS  ♦  VxC)^  •  (gn|(Ds  +  V*))*, 

These  functionals  are  the  pnmal  complementary  pair  and  may 
be  extremized  to  provide  bounded  solutions  to  the  the  pnmal 
canonical  equation.  It  is  also  possible  to  develop  a  dual  system 
and  a  corresponding  pair  of  complementary  functionals,  which 
also  provide  bounds.  A  closed  chain  of  Legendre 
transformations  can  be  used  to  move  between  the  various 
forms, (3}. 


3.  The  direct  method. 

More  generally,  the  partial  differential  equations  of 
electromagnetism  often  have  the  form,  T^MfTu+Vj)  =  p,  in 
0,  with  Bu  *  g  on  3flj  and  B  w  *  h  on  302,  w^crc  w  = 
M(Tu+vs).  Here,  T3  is  the  adjoint  of  T,  M  is  self-adjoint  and 
invertible,  u  and  p  c  U,  and  w  and  v  c  V.  Using  the  methods 
described  by  Vamberg,  [4]  one  can  generate  the  functional, 

*(u,v,w)  =  (V/I^pMu  +  C/jBVhjuJan  +  V,(Mv*w|v)v 
+  (‘/)Tu-,/,v+vs|w)v  +  (V,Bu-g|w)afl. 

Using  a  generalised  form  of  Greens  theorem  this  functional 
can  be  reduced  to  two  different  forms,  namely, 

6(u)  »  ’/:(Tu + vs | M(Tu + vs)) Q  -  (p|u)u  -  (h|u)an,  and 

-(w)  =  (•,',M',w+vs|w)v .  (g|w)aQ. 

Reduced  to  the  same  nomenclature  these  are  exactly  the 
fuctionals  given  above  for  the  electrostatic  system.  They,  have 
much  wider  applicability  however,  and  can  be  used  to  provide 
error  bounded  pairs  of  functionals  for  magnetic  and 
electromagnetic  field  problems  too.  The  finite  element 
method  can  be  used  to  extremize  them  to  a  prescribed  level  of 
accuracy. 
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4,  An  craropfc  from  tht  pi  agn.et  infold 

In  section  3,  above,  the  solution  to  T^MCIu+Vj)  »  p  was 
sought  by  extremizing  the  functionals,  9(u)  and  E(w).  This 
equation  can  be  expressed  in  canonical  form  as, 

Tu+vs=v,  with  Bu  =  g  on  30  j 

Mv=w  infl 

T®w*p  with  B  w  =  h  on  302- 

For  the  magnetic  field  the  equivalent  set  is, 

V»A=B  withA=gon30j 

H«l//x(B)  infl 

V*H-J  withn*H=hon3Q2, 

and  the  corresponding  functionals  are. 


Xl«*» 


Figure  2:  Error  bounded  convergence. 


6(A)  =  V.CVxAl  lM7xA))fl  -  (J|A)o  +  (h|A)ao 
and, 


To  illustrate  the  bounded  nature  of  this  complementary  pair 
the  simple  example  of  figure  l  is  solved,  for  a  sev  eral  different 
numbers  of  degrees  of  freedom,  using  the  finite  element 
technique.  The  convergence  in  energy,  from  above  and  below 
the  true  value  is  shown  in  figure  2,  and  the  magnetic  flux 
density  distributions  calculated  from  A  and  H,  respectively  is 
given  in  figures  3  and  4.  The  expected  similarity  of  the  two 
fields  is  apparent. 


K 


h 


Figure  3:  B  from  0(A).  Figure  4:  B  from  S(H) 
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SOLVING  MAXWELL’S  EQUATIONS  USING  VECTOR  POTENTIALS  FOR  THE 
FULL  FREQUENCY  SPECTRUM 
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Abstract  -  The  aim  of  the  present  paper  is  to  present  a  gen¬ 
eral  method  using  vector  potentials,  and  show  its  applicability, 
to  electromagnetic  field  problems  over.the  entire  frequency  spec¬ 
trum.  Results  mil  be  given  for  a  range  of  problems  using  a  gen¬ 
eral  purpose  computer  code  based  on  the  finite  element  method. 


Infre&Klion 


A  great  deal  of  work  has  been  carried  out  recently  in  defining  ro¬ 
bust  algorithms  for  3D  eddy  current  analysis,  producing  unique 
and  accurate  solutions.  The  problem  of  gauging  vector  solutions 
has  therefore  been  carefully  studied,  resulting  in  a  number  of 
viable  formulations  for  this  ‘low’  frequency  limit  of  Maxwell’s 
equations  (1,2). 

The  formulation  used  in  this  paper  is  based  on  the  Lorentz  gauge, 
which  is  extended  to  solve  the  full  set  of  Maxwell’s  equations. 
This  can  then  lead  to  either  an  eigenvalue  problem  (in  which 
conductors  are  assumed  ideal,  and  are  represented  as  bound¬ 
ary  conditions),  or  to  a  full  deterministic  problem  (when  lossy 
materials  are  included). 


Low  Frequency  Limit  Of  Maxwell’s  Equations 


The  governing  equation  in  terms  of  the  magnetic  vector  and 
electric  scalar  potential  is  well  known  for  the  low  frequency  case, 

1  dX 

V  x  -V  x  A  =  -  oVV  +  o(u  x  V  x  A)  (1) 

p  dt 

along  with  the  required  condition  V  ■  J  =  0 

Vxa)  =  °  (2) 

The  use  of  the  Lorentz  gauge  V-  A  =  -furV  to  separate  A  and 
V  equations  is  classical,  and  it  has  recently  been  shown  that  the 
uniqueness  proofs  of  equations  (1)  and  (2)  require  a  further  con¬ 
dition,  for  example  A  •  n  =  fiV  on  conductor  boundaries  (where 
P  is'  an  arbitary  constant).  This  results  in  a  form  of  eqn  (1)  in¬ 
dependent  of  V,  with  the  condition  A  •  n  =  0  included  in  a  weak 
form.  The  current  can  later  be  found  by  solving  eqn  (2),  where 
A  is  the  value  obtained  from  above. 

This  formulation  is  also  applicable  for  DC  problems  (since  the 
gauge  is  independent  of  frequency),  and  allows  vector  potentials 
to  be  used  in  non-conducting  regions  (with  the  gauge  effectively 


becoming  the  Coulomb  gauge  in  those  volumes).  Transient  so¬ 
lutions  to  the  above  equation  can  also  be  obtained  using  a  time 
integration  scheme.  Results  for  a  transient  problem  are  shown 
in  Figures  1  and  2,  showing  half  the  geometry  (see  Problem  14 
of  the  TEAM  Workshop  (3])  and  the  time  variation  of  power  loss 
in  the  casing. 


Figure  1:  Geometry  of  a  transient  eddy  current  field  problem 


Figure  2:  Solution  to  transient  eddy  current  field  problem 


The  effects  of  velocity  can  be  included  also,  leading  to  the  extra 
terms  in  eqn  (2)  involving  the  velocity  u.  The  resulting  equation 
is  complex  for  time  harmonic  problems,  but  is  no  longer  symmet¬ 
ric.  A  conjugate  gradient  squared  algorithm  is  used  therefore  to 
solve  the  linear  equations. 
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Formulation  For  The  Full  Frequency  Spectrum 


When  the  frequency  is  high,  it  is  necessary  to  include  the  effects 
of  displacement  currents.  The  full  system  shown  in  equation  (3) 
must  now  be  solved  (neglecting  velocity  terms). 


_  1„  _  dx 

V  X  -V  x  A  =  -O" 
/x  at 


d*A 
e  W 


oVV-t 


ovv 

dt 


(3). 


The  full  form  of  the  Lorentz  gauge  is  used  V •' A  =  —  ficr V  -pt-gf 
and  when  substituted  into  equation  (3)  leads  to  the  new  govern¬ 
ing  equation  (after, having  applied  the  GaJerkin  procedure) 


V  x  W-V  x  Adfl  +  fv-W-V-AdCl- 

/*  / 

iuJaW-AdSl  -<J‘jcYf-A<Ul  + 

f  4 A ■  n  W ■  ndr 
Jr  0 


(4) 


0 


This  is  then  solved  using  a  standard  complex.ICCG  solver.  To 
be  able  to  recover  the  electric  field  (including  the  losses),  the 
solution  for  V'  must  be  obtained.  This  is  achieved  by  solving  the 
secondary  equation  V  •  J  -f  V  •  ^  =  0. 

If  the  frequency  is  sufficiently  high,  the  conductors  can  be  re¬ 
garded  as  ideal  In  this  case,  the  terms  involving  c  can  be  ne¬ 
glected,  and  conductors  are  modelled  using  boundary  conditions 
alone.  This  leads  to  a  real  set  of  equations.  Alternatively,  the 
equations  can  be  structured  as  an  eigenvalue  problem,  of  the 
form  Ku  =  uiJMu,  which  can  be  solved  to  find  the  resonant 
frequencies  of  the  system. 

A  typical  example  is  shown  below  shows  the  standing  waves  gen¬ 
erated  by  a  short  circuit  termination  on  a  rectangular  waveguide 
at  two  different  frequencies,  one  above  cutoff  and  the  other  below 
(showing  the  evanescent  nature  of  the  solution). 


Figure  4:  Field  Ev  in  short  circuit  waveguide  at  1  GHz  (below 
cutoff) 


foiKluswn? 


A  formulation,  similar  to  the  low  frequency  eddy  current  formu¬ 
lation  presented  elsewhere,  has  been  shown  to  be  extendable  to 
solve  high  frequency  applications.  An  eigenvalue  solver  has  been 
developed,  and  results  have  shown  that  the  approach  is  viable, 
and  does  not  produce  any  spurious  modes. 

The  full  set  of  Maxwell's  equations  have  also  been  implemented. 
The  resulting  equations  are  no  longer  of  an  eigenvalue  nature, 
and  a  deterministic  solution  method  is  required.  This  will  then 
enable  solutions  to  problems  in  which  lossy  materials  are  also 
present,  m  addition  to  lossless  dielectric  and  permeable  material 
types. 
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Figure  3:  Field  Ev  m  short  circuit  waveguide  at  10  Ghz  (above 
cutoff) 
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Magnetostatic  conservation  principles  and 
mixed  FEM. 
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Abstract 

la  ihh  paper  consider  the  mixed  FEM  in  connection  with 
conservation  principles  for  magnetostatic  problems.  The  usual 
F  E  M  can  be  viewed  «  giving  *  minimal  energy  m  the  bans  func¬ 
tion  space  under  consideration  It  does  not  satisfy  a  discrete  flux 
conservation  principle.  Mixed  FEM’s  have  the  property  that 
they  satisfy  such  principles.  The  mixed  FEM  is  explained  with 
the  help  of  a  simple  Poisson  problem.  Then  we  show  the  lack  of 
conservation  for  the  FEM.  Several  formulations  of  the  magneto- 
static  problem  are  discretised  with  the  mixed  FEM  and  conser¬ 
vation  principles  found.  Also  normal  and  tangential  component 
continuity  properties  of  vector  quantities  involved,  a  strong  point 
of  the  mixed  FEM,  are  discussed. 

1  Introduction. 

In  this  paper  we  investigate  the  conservation  principles  and  element 
interface  conditions  that  are  of  interest  for  magnetostatic  problems 
in  connection  with  mixed  FEM.  In  section  2  we  describe  the  mixed 
finite  element  method  for  the  simplest  Poisson  problem.  In  section  3 
we  show  with  a  simple  example  that  the  finite  element  method  does 
not  have  the  conservation  property  under  consideration.  In  section 
4  conservation  principles  for  magnetostatic  problems  are  discussed  in 
connection  with  mixed  FEM  discretisations  The  term  "exact"  will 
be  used  to  indicate  that  apart  from  rounding  errors  two  numbers  are 
equal,  so  the  fact  that  equality  is  not  influenced  by  discretisation. 

2  Mixed  Finite  Elements  for  the  Poisson 
problem. 

The  mixed  finite  element  method  it  discussed  here  only  with  respect 
to  applications  For  more  basic  mathematical  considerations  JlJ  may 
be  consulted  In  this  section  we  explain  the  mixed  FEM  for  the  Pois¬ 
son  problem  At*  =  p,  on  a  region  (1  cSl  with  boundary  T,  and  u 
given  on  T.  It  enables  us  to  explain  the  essence  without  unnecessary 
complications.  The  basic  idea  is  to  split  the  equation  in  two  first  order 
equations 

a  =  Vu  (1) 

V-a  =  p  (2) 

The  mixed  FEM  formulation  for  this  problem  then  is 

(f,r)  +  («,VT)  =  r,  (3) 

(V.cr.ri)  =  r,  (i) 

Vr  e  IV.  Mid  V*  €  V.,  where  { ,  )  it  defined  by  //„  and  rj  itemi 
from  the  boundary  londitroni,  IV.  C  H(dir,U)  and  V.  r.  1,(0)  In 
practice  the  simplest  basis  functions  for  a  triangular  mesh  are  given 
by  the  u,^  constant  on  each  triangle  and  c,t  piece  wise  linear  vector 
functions  with  continuous  normal  components  on  the  edges.  These 
are  e  g.  the  lowest  order  Raviart  Thomas  spaces,  (2],  A  basis  for  the 
vector  space  is  given  by  q,(z  -  x„y-  yt)T  where  a,  is  the  normalizing 
factor.  The  set  of  equations  stemming  from  this  discretisation  can  be 
represented  by 

(b*o)(a)  =  (*)  (5> 

with  A  =  =  ((dli  V  -rj)).  The  rt  are  a  basis  for  V*  and 

&  4  b**1*  f°r  Wh>  The  unknown  are  the  normal  components  at  the 
element  edges  and  the  constants  per  element.  Here  we  have  not  per¬ 
formed  a  partial  integration  on  the  V-  applied  to  the  vector  unknown 


as  is  done  m  ordinary  FEM  This  is  the  basis  of  the  conservation  prin¬ 
ciple  that  we, will  come  back  to  later.  Contrary  to  what. we  would 
expect  from  the  differentiability  requirements  displayed  by  the  oper¬ 
ational  formulation,  the  potentials  are  represented  as  piece  wise  con¬ 
stants,  the  vector  unknowns  as  linear  functions.  To  Solve  equation  5 
for  the  potential  values  after  elimination  of  the  vectors  one  encounters 
the  matrix  BTA_1B  .  This  is  not  an  M  matrix  and  it  is  also  dense. 
One  may  wonder  whether  there  is  any  advantage  in  not  eliminating 
the  matrix  A  and  solving  with  e  g,  a  preconditioned  iterative  method. 
I  have  no  knowledge  of  any  special  properties  that  would  make  this 
advantageous.  The  usual  procedure  is  (3]  the  Introduction  of  disconti¬ 
nuities  m  the  edge  normal  components  of  the  vector  unknown  in  the 
space  of  vector  functions  The  continuity  is  enforced  by  adding  extra 
equations.  The  extra  equation  number  is  compensated  by  the  fact  that 
now  there  are  two  unknown  per  edge.  There  still  are  the  same  number 
of  unknown  and  equations.  The  equation  5  now  is  replaced  by 

!  La-T*^i  LuV-T-^SnXTin  = 0  (6) 

/ =  J JnpMo  (7) 
= 0  <8> 

Vr  €  IVk.Vtf.  €  and  V£  G  A*,o  with  iV  the  space  of  linear  functions 
on  each  triangle,  Ah£  the  space  of  edge  valued  functions  that  are  equal 
to  (  on  T.  This  set  of  equations  can  be  represented  by 


where  <r%u\A*  and  r*  are  the  logical  replacements  of  the  variables 
<r,t*  and  r  as  used  in  equation  5.  It  is  shown  in  (1]  that  the  matrix 
in  equation  2  is  a  row  diagonally  dominant  M-matruc  for  non  obtuse 
triangles.  The  solution  obtained  with  both  formulations  is  exactly  the 
same.  It  is  clear  that,  for  the  same  number  of  triangles  we  have  more 
unknown  in  the  mixed  FEM  than  in  an  ordinary  FEM.  For  each  tri¬ 
angle  there  are  three  unknown  edge  variables,  one  unknown  potential 
value  and  three  unknown  vector  normal  components.  Per  two  trian¬ 
gles  there  is  one  edge  variable  A,  So  for  N  triangles  approximately 
(boundary  conditions)  5  5N  unknown.  However  we  may  eliminate  the 
constant  potential  values  and  the  normal  components  of  the  vector 
unknown  The  system  for  the  edge  variables  still  is  sparse,  with  m  two 
dimensions  only  a  five  point  coupling.  This  implies  that  the  matrix 
has  less  nonzero  values  on  each  row  then  usual  with  FEM.  The  number 
of  unknown  now  is  approximately  1  5N.  For  FEM  we  can  not  exactly 
make  a  similar  calculation  because  the  number  of  unknown  depends 
on  the  number  of  triangles  coming  together  in  one  vertex,  However 
if  we  assume  an  average  of  six  triangles  per  vertex  we  have  approxi¬ 
mately  3N/6  unknown.  The  conclusion  is  that  the  number  of  unknown 
is  more  but  the  number  >>f  nonzeroes  is  less  in  mixed  FEM  than  in  or¬ 
dinary  FEM.  There  is  no  evidence  known  to  me  clearly  showing  the 
properties  of  the  two  types  of  matrices  with  respect  to  preconditioned 
iterative  methods.  It  also  is  unknown  to  me  whether  the  same  number 
of  triangles  is  needed  for  equal  accuracy. 

3  Conservation  principles  and  FEM. 

The  FEM  often  is  derived  by  requiring  an  energy  functional  to  have 
a  minimum  value  over  a  certain  finite  dimensional  space  7  he  con¬ 
tinuous  solution  often  satisfies  a  conservation  principle  at  the  same 
time  So  for  2  which  we  may  interpret  as  an  electrostatic  equation 
relating  the  displacement  vector  D  s  Vu  to  the  charge  density  p  the 
conservation  principle  states  that  the  charge  in  a  volume  equals  the 
tot  al  displacement  over  the  boundary  of  the  volume  A  simple  example 
shows  that  within  the  representation  and  spaces  used  m  the  FEM  this 
is  not  exactly  the  case  for  the  FEM  solution  Consider  the  example 
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«ir  =  ltu(0)  =  u(3)  5=  0  on  the  segment  (0,3j.  For  the  conservation 
principle  we  should  find  u,(3)  ~  ut(0)  =  ldz  =  3.  However  the 
FEM  solution  with  one  unknown  at  x=l-and  linear’  basis  functions- 
will  give  2  25  instead  of  3,  For, the  mixed  FEM  discussed  in  section 
2  the  conservation  is  guaranteed  by  the  fact  that  one  of  the  two  (or 
three)  basic  equations  is 

J  J(V  •  a  -  p)$do  =  0  (10) 

with  4>  constant  per  triangle.  We  therefore  may  talced  =  Ions 
triangle  Using  the  Gauss  theorem  immediately  gives  the  conservation 
principle  It  should  be  stressed  that  this  is  mainly  due  to  the  fact  that 
we  did  not  use  a  partial  integration  to  denye  in  the  first  place. 

4  Conservation  principles  for  magnetostatic 
problems. 

hirst  we  consider  the  usual  magnetostatic  vector  potential  formulation. 
So 

Vxi/V*A~J  (11) 

In  two  dimensions  this  is  exactly  equation  2.  However  the  interpreta- 
tion  of  the  conservation  principle  is  different.  We  have  V  X  A‘VAt  =  0 
and  f|V  X  A[J  =  flVAiH  where  A  a  (0,0,  At).  Therefore  the  conserva¬ 
tion  principle  is  simply  /H  •  ds  a  / /  Jdo.  Two  facts  are  interesting, 
the  conservation  is  per  arbitrary  group  of  triangles  exact  and  for  any 
element  edge  there  is  point  wise  tangential  component  continuity  of 
II.  It  should  also  be  pointed  out  that  the  classical  "spurious  sources" 
are  impossible  with  the  mixed  FEM.  In  three  dimensions  the  situation 
is  more  complicated  because  as  is  well  known  (4j  a  gauge  condition  is 
needed  to  ensure  uniqueness  for  the  vector  potential,  A  possibility  is 
the  inclusion  of  the  gauge  by  using  a  V-V  formulation  ts  e  g.  discussed 
in  (5).  I  suspect  that  also  here  a  conservation  principle  can  be  found, 
but  have  to  investigate  this  further.  The  othe.  formulation  that  is  of 
interest  is  (6]: 

V./x(Vf-Hc)aO  (12) 

where  Hc  is  as  defined  tn  (6).  The  first  ordtr  equations  are 

i/B  =  (V^-Hc)  (13) 

«  0  (14) 

where  the  way  p  is  treated  is  important.  Then  the  mixed  FEM  with 
normal  component  continuity  is 

=  0  (15) 

(V-B,u)  =  C  (16) 

Vr  €  JV\  and  Vu  €  Vj,  and  the  equations  with  normal  component 
discontinuity  for  the  basis  functions  (not  for  the  solution!). 

///n‘'B-r  +  ET///IWV-H,.)r 


=  0 

(17) 

/  1  Uv-bw°  =  ° 

(18) 

= 0 

(19) 

Vr  €  €  V*  and  V£  6  A^o-  Therefore  the  conservation  principle 

here  is 

yjf  Bdn  =  0  (20) 

for  T  the  boundary  of  any  set  of  elements.  As  usual  in  the  mixed  FEM 
we  also  find  normal  component  continuity,  point  wise  on  any  element 
edge.  Also  when  the  combination  of  a  reduced  and  a  total  potential 
(6)  are  used  the  formulation  automatically  gives  a  pointwise  B  normal 
component  continuity  at  the  interface.  For  Eddy  current  problems 
the  same  discretisation  principle  can  be  applied  but  the  simplest  ba«’S 


functions  no  longer  give  an  M-matrix  Solutions  for  this  problem  are 
discussed  in  |7)  and  (8) 

5,  -Concluding  remarks. 

from  the  considerations  given  in  this  paper  it  is  clear  that  conservation 
.principles  are  found  when  using; the  mixed' FEM  for  magnetostatic 
problems,  similarto  those  found  for  other,  pi  oblems.  The  efficiency 
•of  the  method  wr.t,  the  FEM'can  not  be  judged  from  theoretical 
considerations  as  there  are  on  the  one  hand  more  unknown  while  on 
the  other  hand  the  matrix  is  more  sparse.  On  top  of  this  it  is  not  clear 
how  the  number  of  necessary  triangles  differs  from  the  number  needed 
for  the  FEM  Practical  experience  with  mixed  FEM  for  magnetostatic 
problems  would  be  very  interesting. 
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ABSTRACT 

This  paper  presents  a  finite  element  technique  for 
the  computation  of  open  boundary  problems  using 
transformations.The  principle  of  the  method  is  brifiy 
developed.The  efficiency  and  the  easiness  of  the 
implementation  renders  this  method  very  interesting. 
An  example  of  an  'magnetodynamic  application  is 
presented. 

INTRODUCTION 

Unbounded  problems  in  electromagnetism  science 
are  encountered  in  a  great  number  of  applications.  The 
resolution  of  such  problems  is, not  obvious  using  the 
finite  element  method  (FEM).  Indeed  the  magnetic  or 
electric  fields  far  away  from  the  structure  can  not  be 
neglected. 

In  order  to  overcome  this  limitation  of  the  FEM, 
several  techniques  have  been  proposed  by  differents 
authors1*2.  According  to  P.Bettes2,  they  might  be 
separated  into  five  different  groups: 

-  the  method  of  truncation  wich  consist  to 
approximate  the  infinite  domain  by  a  closed  domain, 
which  should  be  sufficiently  large 

-  the  method  called  "Ballooning"  developped  by 
P.P.  Silvester  and  al2 

-  the  "infinite"  or  "mapped"  elements4 

-  the  use  of  transformations  reducing  an  open 
infinite  region  to  a  closed  one-’ 

-  the  methods  coupling  FEM  with  an  analytical  or 
numerical  method  (BF.M)m' 

Among  these  methods,  we  have  used  particular^ 
those  involving  transformations.  The  theory  applied  is 
to  reduce  an  open  infinite  region  to  a  closed  finite  one. 
Up  to  now,  the  transformation  used  (inverse 
tranformation  like  1/r)  were  conformal  mappings 
logically  limited  to  applications  in  2D  cartesian 
coordinates  from  a  geometrical  point  of  view  and  to  the 
Laplace  equation  from  a  physical  point  of  view  (the 
Laplace  equation  is  invariant  to  conformal  mapping). 
In  previous  papers,  J.F.  Imhoff  ^demonstrated  that 
these  transformations  need  not  to  be  conformal  and 


therefore  may  be  applied  to  problems  other  than  the 
Laplace  equation  in  2D  cartesian  coordinates.  He  has 
particulary  extented  the  possibilities  of  inverse 
transformation  applications  to  axisymmetrical  and  3D 
problems  and  has  developed  a  transformation  which 
even  in  2D  is  not  conformal  but  whose  introduction 
into  a  3D  finite  element  software  is  easier  than  that  one 
of  an  inverse  transformation.These  works  being  very 
interesting,  we  have  been  searching  for  new 
transformations  and  introduce  them  into  the  software 
FLUX3D  with  regard  to  a  great  efficiency. 

PRINCIPLE  OF  THE  METHOD 

The  principle  consists  in  considering  that  the 
entire  2D  or  3D  space  E  is  the  sum  of  a  closed  internal 
subdomain  E;nt  (treated  by  the  FEM)  and  of  an  open 
boundary  external  subdomain  Eexk 


Open  boundary  rroblcm  Closed  tamdary  problem 


Figure  1 

In  order  to  take  into  account  the  external  domain, 
we  map  it  into  a  closed  domain  Eext_trans  by  means  of 
a  bijective  spatial  transformation  as  it  appears  in  figure 
1.  This  domain  Eext  trans  is  meshed,  and  used  for 
solving  the  problem.  The  equations  of  the  physical 
problem  are,  of  course  modified  by  the  transformation. 
For  electromagnetic  problems,  the  alterations  of  the 
equations  are  obvious.  For  example,  using  Galerkin 
method  for  the  magnetostatic  scalar  model,  we  usually 
have  to  compute : 
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where  xare  the  coordinates  in  the  real  space  E^,  X 
the  coordinates  in  the  image  space  [3x/3X] 

is  the  Jacobian  matrix  of  the  transformation,'  Wj  are  the 
usual  finite  element  functions  in  the  mapped  domain 
and  wj  are.  weighting  Galerldn  functions,  which  are 
known  thanks  to  their  images  W;. 

From  all  the  transformations  we  tried,  we  have 
kept  the  spherical  shell  transformation0  and  the 
parallelepipedic  transformation*0.  The  way  we 
implement  these  transformations  lead  to  the  creation  of 
an  efficient  tool.  The  modification  of  the  integration 
procedure  is  made  by  subroutines  wich  carry  out. the 
modification  of  the  Galerkin  functions  due  to  the 
transformation.The  post-procesing  (fields  computation 
in  the  infinite  region  for  example)  is  made  by  the  same 
way  using  the  same  subroutines. 

EXAMPLE 

The  Figure  2  presents  a  magnetodynamic  problem: 
computation  of  eddy  currents  in  a  thin  sheet  conductor 
induced  by  a  exterior  sinusoidal  field  (0.1  Hz).  The 
sheet  conductor  is  modelled  by  a  A-V  formulation 
applied  to  surface  element.  The  exterior  uniform  field 
is  stmulated  using  the  reduced  scalar  potential  and  the 
rnfir  te  domain  is  modeled  by  a  parallelipipedic 
transformation.  Figure  2  shows  eddy  currents  in  the 
sheet  and  reduced  scalar  equipotentials  in  the  air. 


Infinite  domain 


CONCLUSION 

The  method  used  gives  very  good  results  for  the 
exterior  region  and  is  very  cheap  compared  to  the 
"Balloning"  method  or  hybrid  method  (FEM-BEM). 
The  employed  transformations  seem  to  be  similar  to  the 


transformation  developed  by  Zienkiewics,  Emson  and 
Bettes  which  introduced  the  mapped  element-They 
give  the  same  relative  and  absolute  error.  Howewer 
their  transformation. can  only  simulate  the  exterior 
region  by  using  one  element.  With  the  method 
presented  in  this  paper  we  make  a  global  definition  of 
the  transformation  which  allows  to  mesh  the  exterior 
region  as  thickly  as  necessary  in  ordrer  to  improve  the 
solution.  r 
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Abstract 

The  design  of  optimal  geometric  boundaries  to.  achieve  a 
prescribed  behaviour  of  the  electromagnetic  field  has  been 
attracting  more  and  more  attention  recently.  This  paper 
summarizes  the  application  of  "evolution  strategies"  11,2,3] 
to  find  the  global  minimum  of  a  given  objective  function 
These  strategies  utilize  a  simplified  model  of  biological 
evolution.  Another  method  used  for  global  minimization 
is  known  as  “simulated  annealing"  14),  a  method  based  on 
simulation,  of  the  physical  process  of  cooling  solids  to  their 
lowest  energy  configuration.  Once  a  solution  in  the  neigh¬ 
bourhood  of  the  optimum  has  been  obtained  by  the  methods 
mentioned  above,  a  deterministic  procedure,  eg.  the  Gauss- 
Seidel  strategy  or. a  bjeher  order  deterministic  strategy  can 
be  used  to  "identify"  the  optimum  in  an  efficient  way.  The 
determination  of  the  magnetic  field  is  carried  out  by  a  finite 
element  calculation.  The  efficiency  of  the  optimization 
strategics  has  been  investigated  using  a  simple  two  dimensional 
model  with  nonlinear  material  characteristics. 

Introduction 

Optimal  design  requires  the  solution  of  a  synthesis  problem 
The  goal  Is  to  produce  a  given  field  by  a  sought  geometry. 
This  task,  for  example.  Is  encountered  If  one  has  to  design 
the  magnet  system  of  an  NMR  CS3,  a  medical  diagnostic  equip¬ 
ment.  Both  deterministic  and  nondeterministlc  optimization 
methods  need  the  successive  recalculation  of  an  electromag¬ 
netic  field  problem  using  each  time  a  more  or  less  different 
geometry.  In  order  to  keep  the  number  of  trials  necessary 
to  reach  the  optimal  shape  of  the  magnets  or  distribution 
of  coils  as  low  as  possible,  some  strategy  in  performing  the 
variations  Is  required.  Recently,  evolution  strategies  have 
been  introduced  for  the  solution  of  electromagnetic  optimi¬ 
zation  problems. 

Evolution  Strategies 

Evolution  strategies  applied  to  electromagnetic  problems  (5,6] 
utilize  a  simplified  model  of  biological  evolution  to  arrive  at 
an  optimum  of  a  given  technical  problem  They  are  more  likely 
to  arrive  at  the  global  optimum  than  deterministic  methods. 
To  begin  with,  a  set  of  variables  which  will  be  modified  In 
the  course  of  the  optimization  process  must  be  set  to  their 
Initial  values.  Common  to  all  evolution  strategies  fs  the 
generation  of  one  or  more  "descendants"  from  one  or  more 
"parents"  by  tho  addition  of  a  random  vector  with  normally 
distributed  components  and  some  mechanism  to  choose  the 
descendant  to  "survive". 

(1*1)  Evolution  Strategy 

The  simplest  scheme  simulating  natural  proceedings  is  the 
(1*1)  strategy.  In  each  generation  a  single  descendant  springs 
off  a  single  parent.  It  replaces  the  parent  only  if  it  leads  to 
a  better  quality  of  the  objective  function.  This  means,  that 
no  deterioration  occurs  from  one  generation  to  the  next. 
After  a  certain  number  of  generations  the  probability  of 
arriving  at  a  "better"  descendant  is  evaluated  and  the  standard 
deviation  of  the  components  of  the  random  vector  (mutual 
step  width)  is  adaptively  increased  (too  many  successes)  or 
decreased  (too  few  successful  attempts). 

L\M  Evolution  Strategy 

This  higher  order  strategy  makes  use  of  some  additional 
features  of  biological  evolution,  namely  population,  finite 
live  span  of  an  individual  and  genetically  adjusted  mutability 
6  descendants  are  generated  from  one  parent  using  two  diffe¬ 


rent  values  for  the  mutation  step  width.  The  new  parent  is 
selected  from,  among  the  descendants  only.  The  adaptive 
control  of  the  mutation  step  width  is  achieved  by  the  "survival" 
of  the  strategy  variables  of  the  chosen  descendant.  The  flow 
chzrt  of  this  scheme  is  shown  in  Fig.l  One, major,  advantage 
of  this  and  other  higher  order  evolution  strategies  compared 
-to  the  (Id)  strategy  is  the  fact  that  sometimes  a  new  accepted' 
solution  yields  a  worse  quality  than  the  current  solution. 
The  benefit  of  allowing  deteriorating  solutions  is  to  help 
the  search  avoid  becoming  trapped  In  a  local  minimum  of 
the  objective  function. 

(10,100)  Evolution  Strategy 

In  this  multi-membered  scheme  another  aspect  of  evolution, 
recombination,  can  be  introduced  into  the  .model  for  the 
generation"  of  descendants.  Every  set  of  object  variables  has 
its  corresponding  set  of  strategy  variables.  10  parents  produce 
100  descendants,  allowing  the  10  best  ones  to  survive  to  become 
the  new  parent  generation. 


Fig.l  Flow-chart  of  the  (1,6)  evolution  strategy 


Simulated  annealing 

The  name  of  this  method  derives  from  an  analogy  between 
simulating  the  annealing  of  solids  and  solving  optimization 
problems.  Starting  from  an  initial  state  ("initial  temperature  ) 
the  trial  variables  are  perturbed  at  random  to  a  new  state 
in  the  neighbourhood.  This  state  corresponds  to  a  new 
temperature,  specified  by  a  cooling  schedule  If  the  new 
state  represents  a  reduction  in  the  value  of  the  objective 
function,  then  the  transformation  to  the  new  state  is  accep¬ 
ted.  Similar  to  higher  order  evolution  strategies  there  is  a 
chance  that  a  state  yielding  a  worse  quality  than  the  current 
one  will  be  accepted.  The  acceptance  probability  function, 
based  on  the  Boltzmann  distribution,  takes  the  form  of 
expt-AQ/kTJ,  where  AQ  is  the  increase  in  quality,  T  is  the 
control  paramter  ("temperature”)  and  k  is  an  appropriate 
constant.  The  flow  chart  of  this  scheme  is  shown  in  Flg.2 


Flg.2  Flow-chart  of  ’Simulated  Annealing’ 

Using  this  method,  a  cooling  schedule,  the  number  of  repe¬ 
titions  having  a  constant  temperature  and  a  function  con¬ 
trolling  the  stepwldth  are  required  and  maqy  different  schemes 
to  control  the  algorithm  have  been  suggested  (7), 

Using  for  instance  the. 0,6)  evolution  strategy  an  Initial  step- 
width  o  and  a  stepwldth  factor  a  have  to  be  specified  only. 
This  fact'seems  to  make  evolution  strategics  easier  to  handle. 

Numerical  Investigations 

Three  currently  very  Important  strategies,  the  (1*1)  evolution 
strategy,  the  (1.6)  evolution  strategy  and  the  simulated 
annealing  method  have  been  applied  to  a  very  simple  2D 
nonlinear  magnetostatic  optimization  problem  (FIg.3,  Initial 
geometry:  planar  pole  x»=X23$0.  x»*100,  dimensions  In  mm). 
The  pole  shape  of  a  dipole  magnet  has  been  optimized 
(shimming  problem),  varying  the  trial  variables  xi.  xa  and  x>. 
The  goal  was  to  achlve  a  homogeneous  field  of  1  Tesla  In 
the  prescribed  region  under  the  pole. 


!y 

L 
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Flg.3  Shimming  problem 

Using  the  (1,6)  evolution  strategy  the  initial  stepwldth  o  was 
set  to  10  mm  and  the  stepwldth  factor  a  was  set  to  1,3 
Using  simulated  annealing  the  temperature  was  assumed  to 
decrease  In  a  linear  manner  from  Tsuirt  -  1000  degrees  to 
Tend  •  0  degrees. 


The  stepwldth  (ostart  -  10  mm,  0End=  1mm)  was  assumed 
to'decrease  In  a  exponential  way,  allowing  more  significant 
changes, in,  geometry  at  higher  temperatures: 

-  <*P<-T«».  <» 

where  x  =200  degrees 

Every1  single  test' case  was  stopped  after  100  finite  element 
calculations. -The  best  solutions  arrived  at  the  following  values 
for  xi,  X2  and  xa  (dimensions’  In  mm). 


XI 

XX 

X* 

(Ul) 

St. 67 

31.77 

907 

(1,6) 

St. 85 

288 

9187 

SA 

SI.  14 

2962 

9174 

Flg.4  “optimized"  paramters  of  the  shimming  problem 

The  distribution  of  the  magnetic  flux  density  In  the  region 
of  interest  obtained  using  the  Initial  geometry  and  the 
above  mentioned  best  solutions -Is  shown  in  Fig.S.  All 
methods  under  Investigation  lead  to  very  good  results. 
Using  a  finer  discretization  even  better  'results  will  be 
obtained. 


Fig.S  Distribution  of  the  flux  density  In  the  region  of  Interest 
References 

(1)  I.  Rechenberg,  Evolutionsstrategle,  Optlmlerug  techni- 
scher  Systeme  nach  Prlnzipien  derbiologischen  Evolution 
Stuttgart-Bad  Cannstatd.  Frommann-Holzboog,  1973 
(21  H.P.  S  hwefel,  Numerlsche  Optlmlerung  von  Computer- 
Modellen  mlttels  dcr  Evolutionsstrategle.  Basel  und 
Stuttgart:  Birhhfcuser  Verlag,  1977 
(3)  A.  Scheel,  Beitrag  zur  Theorie  der  Evolutionsstrategle, 
doctoral  thesis,  Berlin  198S 

(41  j  Simkln,  C.W.Trowbrldge."Optlmlzation  Problems  In 
Electromagnetics",  4’*  Biennial  IEEE  Conference  on 
Electromagnetic  Field  Calculation,  Toronto  1990 
(SI  K.Preis,  C.Magele,  O.Biro,  “Fern  and  Evolution  Strategies 
In  the  Optimal  Design  of  Electromagnetic  Devices",  IFEE 
Trans,  on.  Mag.,  MAG-26,  2181-2183,  1990 
(61  K.Preis  et  al., "Comparison  of  Different  Optimization 
Strategies  In  the  Design  of  Electromagnetic  Devices", 
4,h  Biennial  IEEE  Conference  on  Electromagnetic  Field 
Calculation,  Toronto  1990 

l7J  N.E.Collins  et  al, ‘Simulated  Annealing  -  An  Annoted 
Bibliography'  Ameucan  Journal  of  Mathematical  and 
Management  Sciences,  Vol  8,  pp  20S  ff,  1988 


1595 


ELECTROMAGNETIC  SCATTERING*  USING  MIXED  FINITE- AND  INFINITE  ELEMENTS  IN  2D 

MALCOLM  S.  TOWERS,  JOHN  A.  R.  MACNAB,  and  ANDREW^McCOWEN 

Department. of  Electrical  and  Electronic  .Engineering 
University  College  of  Swansea  "  ~~ 

Swansea  U.K. 


Abstract  -  A  formulation  of  Helmholtz's  equation  has.  been 
devolved  to  allow  analysis  ’  of  electromagnetic  backscatter  in  2D. 
The  open  regions  of 'the -problem  domain,  are  handled  through 
the  inclusion,  of  infinite  quadratic,  wave  envelope^  elements 
which  serve,  to  enforce  appropriate;  far-field  attenuation.  The 
remaining  regions  including  the  scattering,  object  are;  modelled 
using  triangular  finite-  elements  with  standard  area  shape 
functions.  ,  . 

Tests  have  been  made  to  verify  the  technique  on 
homogeneous  ,  dielectric  and  conducting  bodies  with  known 
analytic .  solutions  and, .excellent 'agreement,  has  been  achieved. 
The  method  is  expected/ to  be  particularly,,  useful  for 
inhomogeneous  and ,  geometrically  Irregular^  problems. 

I.  INTRODUCTION 

Many  applications  are  'foreseen  for  a  numerical 
computation  scheme  that  will  efficiently  and  accurately  solve 
for  electromagnetic  incidence  on  arbitrary  bodies  or  scatterers 
of  resonant  sue  or  gieater.  Such  applications  include  the 
radar  signatures  associated  with  targets,  electromagnetic 
compatibility  (EMC),  the  design  and  siting  of  antennas  and  the 
hazardous  effects  associated  with  electromagnetic  radiation. 

In  this  paper  we  discuss  the  use  of  a  mixed  finlte/Infinitc 
element  scheme  to  solve  the  time  Independent  form  of 
Maxwell’s  differential  equations.  The  resulting  linear  equation 
system  is  highly  sparce  and  this  fact  1$  used  to  advantage  in 
their  storage  and  solution.  A  De  Launcy  mesh  is  used  to 
discretise  the  scatterer  and  the  Immediate  surrounding  medium 
and  infinite  elements  (1)  are  used  to  represent  the  remaining 
unbounded  medium.  Initial  results  In  2D,  from  the  newly 
developed  cede  SEMS,  will  be  presented  for  a  range  of 
scatterers  of  resonant  dimensions.  Far-field  scattering  is 
determined  directly  from  the  nodal  values  of  the  infinite 
elements  and  shows  excellent  agreement  with  the  available 
analytic  solutions.  Meshing  requirements  of  the  scatterers  and 
the  importance  of  the  positioning  of  the  infinite  elements  will 
also  be  discussed. 

II,  THE  FINITE/INFINITE 
ELEMENT  FORMULATION 

For  a  linear  source  free  medium  and  harmonic  time 
dependance  with  radian  frequency  a  each  component  of  the 
electric  and  magnetic  field  vectors  satisfies  the  equations 


vx  <i-  vxE)  -  kV  E  -  0 

Vr  r 

<i) 

vx  (ji  vxH)  -  k*f  H  -  0 

where  E  and  H  are  electric  and  magnetic  fields  and 
where  tt  and  nt  are  relative  permittivity  and  permeability 

respectively  and  k  -  u  ,s  tho  space  wave 

number.  Applying  a  Petrov-Gilerkln  formulation  to  either 
equation  leads  to  a  sparce  linear  system  of  equations, 

Ku  a  f  (3) 

where  u  is  a  vector  of  complex  field  components  at  nodes  of 
the  mesh  and  right  hand  side  vector  f  contains  boundary 
Integral  terms  representing  the  inclusion  of  incident  radiation 
into  the  problem  (2).  Matrix  K  is  the  result  of  assembling 
element  matrices  and  applying  boundary  conditions,  for 
example  at  the  surface  of  a  conductor,  and  takes  the  form  fo- 

K,j  -  }  [  |  I-'XW,)  •  (■"‘Nj)  -  l<rk!  W,Nj  ]  <U)  (.) 


where  Wj  is  .the 'global 'weight -function  associated  with  node  i, 
Ni  is  the  global  *  shape  function*  for  node  )  and  the  integral  is 
taken  over  the  whole  mesh.  -For  -equation  (1),  simply 
interchange  <r  and  equation  (4). 

In  two-dimensional  .‘problems,  such  as  those- reported  here, 
equations  (l)  and- (2)  reduce  to  problems  ,in -a  single  complex 
field  •.  component  f with  TM  or  TE  polarizations  respectively  and 
the ’firsts  terra  of  the  integrand  in  equation. (4)  reduces  to  the 
dot-product  of  the , 2-dimensional  gradiants  of  Wj  and  N:. 

The- finite  elements  ^radiate  from  the  outer  boundary  of 
the  finite,  element  mesh  into -the  remaining  open- space  and, 
following  Zienkiewicz  et  la  (3),  are  provided”  with  shape 
functions  in  2-D  of  the"  form 

N  -  r*  M-(f,  >l>  .xp  {-Jit  (r/R  -  I)  j.  (5) 

R  is  the  distance  of- the  element’s  inner  edge  from  the  decay 
centre  chosen  for  the  mesh,  r  is  the  distance  of  point  (f,  17) 
from  the  centre  and  M.  is  a  standard  Lagrange  polynomial 
shape  function  with  quadratic  variation  in  the  radial  direction, 
f.  and  -  linear  Jn  17.  Reciprocal  mapping  >s  used  to  relate 
distance  r  to  the  element. local  variable  ?, 

t  ■  2R/(I  -  J)  (6) 

and  clearly  r  tends -to*  infinity  as  t  tends  to  unity. 

If  the  inverse  transformation  is  applied  to  M  then  a 
polynomal  In  r“»  results.  The  additional  factor  of  ri  In 
equation  (5)  guarantees  the  leading  term  in  N  decays  as  ri. 
R  is  normally  chosen  constant  In.  each  element  (4)  (5)  but  in 
the  current  work  R  Is  Interpolated  through  the  local  variable 
>7.  This  has  the  advantage  of- restoring  C0  continuity  to  the 
shape  functions  *  but  care  must  be  taken  to  ensure  that  the 
integrals  In  equation  (4)  are  convergent  In  the  infinite 
elements. 

By  choosing  weight  functions  as  the  complex  conjugates  of 
the  shape  functions  a  wave  envelope  formulation  of  the  infinite 
elements  results  (6)  {7)  due  to  cancellation  of  the  complex 
exponential  •  terms.  This  allows  Gauss-Legendre  integration  to 
be  used  In  the  standard  way.  The  finite  elements  used  are 
three  noded  triangles  with  standard  linear  area  shape  functions. 
Since  these  arc  purely  real  the  resulting  matrices  coincide  with 
a  Galerkin  formulation  for  the  finite  elements.  Employing 
triangles  has  the  advantage  of  allowing  De  Launey  meshes  to 
be  used  with  their  enormous  power  and  versatility  to  represent 
geometrically  and  materially  irregular  problems  with  highly 
controllable  local  mesh  density. 

Far  field  calculation  may  be  performed  directly  from  the 
solution  on  the  infinite  element  nodes,  without  recourse  to 
elaborate  surface  Integral  formulations  (8|.  Along  a  radial 
edge  of  an  infinite  element  the  limit  is  simply  given  by 

{  r*  u(r)  }  — .  R*  <-u(  +  2  /I  u;  )  (7) 

as  r  — *  » 

where  u,  is  the  field  on  the  inner  node,  distance  R  from  the 
decay  centre  and  u2  is  on  the  outer  node,  at  2  R  from  the 
centre.  Linear  interplation  of  these  results  is  used  for  values 
between  the  edges. 

HI.  RESULTS 

The  problem  domain  within  SEMS  is  split  into  2  regions, 
see  Fig.  1,  namely  ft,  the  region  meshed  with  finite  elements 
and  fij  the  unbounded  inifinite  element  region  i>  the 

boundary  separating  ft,  and  ft}  and  J,  is  one  or  more  closed 
boundaries  within  0,  defining  the  surface  of  perfect  conductors 
which  do  not  require  internal  meshing.  The  initial  tests  for 
the  code  and  formulation  have  been  made  against  the 
benchmarks  of  scatterers  having  an  analytic  solution 


Fig.  I,  Definitions  of  regions  and  boundaries 


The- following' set  of ‘ typical, results  shown  in  Figs,  2  (a), 
(b)  and  (c)  gire  the  scattering  width  for  TE  incidence  on  a 
circular  cylinder  of  ka ‘  =  2.09  where  a  is  the'  radius  of 'the 
cylinder.  Fig.  2  (a)  .is  -for-  a  .perfectly,  conducting  -cylinder 
when  2,  is  Vt  wavelength  from  the  surface  of  the 


O 


Fig.  2.  Computed  results  of  TE  scattering  width  for  circular 
cylinders  of  radius  a  with  ka  a  2.09  and  material  type 
(a)  perfect  conductor,  (b)  <r  «  4,  (e)  rr  a  4  .  jit. 


conductor.  61  infinite  elements  were  used  to  determine  the 
far-field  and  the  corresponding  analytic  solution  was  sampled 
to  coincide  with  the  sides  of  the  elements.  Reducing  the  fi, 
region;  to, jiKt.  one  layer  of  finite  elements  slightly  deteriorated 
the  solution"-  but  by  no  more  than  1  db  from  the  analytic 
solution  ^  Figs.  2  (b)  and  (c)  are ‘for  a  pure  dielectric-  of 
permittivity  4  and  a  lossy,  one  of  4  -j««  respectively.  In  both 
cases  the  0,  region  Mnlcuding  the  dielectric  was  meshed  with 
-approximately  10  nodes  per  dielectric  wavelength,  and  only  one 
layer  of  finite  elements  outside  the  cylinder  resulting  in  47 
infinite  elements  to  compute  the  far-field. 

-IV.  CONCLUSIONS-  . 

A  Petrov-Oalerldn  formulation  has  been -  applied  to 
Helmholtz's  equations- using-  a  mixed  -finite/infinite  'element 
mesh  to  compute  bistatic  scattering  from  -2D  objects. 
Excellent  agreement  between  computed  far-field  and  'the 
-analytic  solution  for  =  a  range  of '•circular  cylinders  has  been 
demonstrated.  .  Post -processing  has  been 'greatly  simplified- by 
computing  the  far-flelds  directly  from  the  nodal  values  of  the 
infinite  elements. 

Further  work  including 'the  positioning  and  shape  of' the 
£2  boundary  for  non-cylindrieal  scatterers  and*  a  comparison  of 
the  far-field  computed  from  surface/volume  currents  with  those 
directly  from  the  infinite  elements  will  also  be  discussed -in  the 
paper. 
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Abstract  -  The  paper  deals  with  a  nonconven- 
tional  thermodynamical  modelling  of  -  interac¬ 
tions  of  weak  and  high  energetic  neutrons  with 
a  defective,  deformable  semiconducting  body. 
Field  equations  and/  boundary  conditions,  are 
derived.  Some  examples  .of  interactions  are 
analytically  and .  numerically  investigated. 

1.  INTRODUCTION 

Contemporary  technologies  demand  in  many 
situations  materials  of  special  properties  to 
obtain  expected  results  which  .come  .from 
simultaneous  Interactions  of  physical  ■  fields 
occurlng  in.  The  neutrons  passing  through  a 
crystal  occasionally  collide  with  the  nuclei 
of  atoms  and  transfer  them  kinetic  energy.  If 
this  recoil  energy  Is  large  enough  the  atom 
concerned  will  be  knocked  out  of  Its  lattice 
site  and  will  travel  through  the  crystal, 
colliding  with  its  neighbours  and  may  be 
displacing  them  also  from  their  sites.  The  end 
product  will  be  a  numbor  of  vacant  lattice 
sites  and  an  equal  numbor  of  displaced  atoms. 
Having  produced  the  lnten  stitlals  and 
vacancies  the  thermal  vibration  of  the  crystal 
lattice  causes  these  so-called  point  defects 
to  migrate.  During  migration  the  point  defects 
may  attach  themselves  forming  dislocations.  It 
is  well-known  that  tho  dislocation  line  is  a 
reason  of  an  elastic  distortion  field  around 
It,  but  the  dislocation  core,  as  a  singularity 
of  that  field,  posesses  very  interesting 
physical  properties.  It  can  be  charged  for 
some  crystal  bonds  are  broken  there.  It  Is  a 
kind  of  a  trap  for  charge  carriers  In  a 
semiconductor,  it  forms  a  line  or  a  capillary 
tube  which  considerably  Influences  the 
transport  of  particles  through  the  crystal,  It 
Is  tho  line  forming  a  special  distribution  of 
spins,  and  the  like.  So  that,  if  the  neutron 
beam  has  the  energy  sufficiently  small  tho 
other  kinds  of  Interactions  with  the  body 
occur.  Neutrons  may  diffuse  along  the 
dislocation  network.  Because  both  have 
magnetic  moments,  there  is  an  Interaction 
between  neutrons  and  electrons. 

The  proposod  theory  is  applied  to  soveral 
particular  cases  of  interactions  between 
fields  mentioned  above. 

2.  THERMODYNAMICAL  MODEL 
The  object  of  our  consideration  Is  a  defor¬ 
mable,  homogenous,  isotropic  and  defective  by 
dislocations  semiconducting  body  of  magnetic 
properties.  It  Is  assumed  that  the  following 
fields  Interact  with  each  other: 

(1?  the  elastic  field  described  by  the  sti'ess 
tensor  a. .  and  the  small  strain  tensor  c  , 
tj 

CIO  the  thermal  field  described  by  the  tempe¬ 
rature  T  and  the  heat  flux 

CUD  the  electromagnetic  field  described  by 
the  electromotive  intensity  3  and  the 
magnetic  induction  B  , 


(iv?  the-  charge  carrier  field  described  by  the 
concentration  n  and  the  flux  j”  for 
electrons  and  the  concentration  p  and  the 
flux  for  holes, 

(v?  the  neutron  field  In  the  case  of  a  weak 
energetic  beam  when  its  magnetic  properties 
dominate,  described  by  the  magnetic  moment 
density  p.  and  the  .flux  T)v]  (the  neutron 
field  is  understood  here  as  a  distribution 
of  travelling  and  rotating  magnetic 
moments?, 

<vl>  or  the  neutron  field  in  tho  case  of  a 
hard  neutron  beam  describe'1  oy  the  particle 
number  density  t»  and  the  fiox 

<vll>  the-  dislocation  field  described  by  the 
variable  responsible  for  its  density  a  .and 

the  flux  r,  , . 

Taking  Into  account  the  relaxation  features 
of  tho  thermal  charge  carrier,  neutron  and 
dislocation  fields  and  basing  on  the  general 
philosophy  of  so-called  extended  Irreversible 
thermodynamics,  the  Independent  constitutive 
variables  are  represented  by  the  set 

G  *  \c.  ,B  ,T,T  ,n,  n  .  ,p,p  .  ,p  ,(or  tt? 

\  l|  *  l 

‘V'w-C’  v<or  w }  <21> 

This  specific  choice  shows  that  magnetic 
exchange  effects  are  discarded  and  that  we 
Ignore  the  relaxation  features  of  the  mecha¬ 
nical  field  (viscosity?  so  that  o\^  Is  not  In 

tho  set  (2.1?.  Moreover,  we  assume  that  the 
contribution  of  the  neutron  and  dislocation 
fields  into  processes  Is  of  such  a  character 
that  u  and  a  ,  can  be  understood  as  the 
v  vj 

internal  variables. 

Neglecting  mass  of  neutrons  In  comparison 
to  the  solid  we  take  the  continuity  equation 
In  its  classic  form  (p  denotes  the  density? 

p  +  p  V,  a  0.  (2.2? 

The  balances  of  momentum,  moment  of  momentum 
and  infernal  energy  for  the  considered  body 
read  assuming  that  the  electrical  polarization 
is  practically  negligible  (the  magnetic 
properties  dominate?  ^ 

pv-[a  .  CJn+  +  B  M+fJ-0 

4  1  .  )  v  jkwj  k  i  ,k  k 

e  ,  +■  c  *  0  (2.3? 

»jk  |k  » 

where  v^  is  the  velocity  of  the  body  point, 
denotes  the  magnetization,  f  Is  the  body 
force,  c  Is  the  couple,  e  is  the  internal 


energy  density  and  r'"is  tf»e  heat  source.  The 
electromagnetic  field  is  governed'  by  the 
folio  wing  "Maxwell  equations: 

<2  E  +  - —  -  0  :  *  D,  .«  p  Z  , 

^  k'i  <*, 

M> 


where  £1J^8J  . 


Z«  n+'n-n  t  P  +  ,P “  P 

B(  -  ^oCH  +  Mv)  ., 


«  denote  the  permittivity  .and  permeability 
of  vacuum,  H,  is  the  magnetic -field  intensity, 

n,  p  are  charge  densities  of  fixed"  impurities 
and  n  ,p  are  the  equilibrium  concentrations 

of  electrons  and  holes,  respectively.  The,  re¬ 
lation  (2.5)  should  be  written  as 
Bi"  ^Hl+  th°  w®ak  r»<>Vtl’on  beam 

case,  but  as  Jp.  |«|M.  |  we  have  kept  the 
classical  expression  (2.5)4.  According  .to  the 
evolution  of  the  total  charge  we  are 

interested  only  in  the  balances  of  charge 
carriers  for  the  fixed  charge  .  ‘of  impurities  is 
rather  weakly  engaged  into  interactions.  Their 
forms  road  * 

P  ft  ♦  j/,  -  tr"-r" 

<2  6> 

p  p  +  J^,  -  ap-rp  . 


with  the  evolution  equations  of  electron  and 
hole  fluxes 


«  (2.7) 


■?  - 


J;<0. 


whore  U  ,  sP,T'n  and  rp  denote  the  afeneration 
and  recombination  of  the  electrons'  and  holes, 
respectively.  Since  the  dislocations  during 
interactions  as  well  as  the  .neutrons  can  move 
through  a  crystal  the  evolution  equations  for 
the  above  fields  take  the  forms 


5  +yj,  v*  (G)«0 

**  lj,j  ' 

n+t)k  -  JKO.O 

V  *ii<0>  ■ 0 

&k-  vo>  “  0 

VrUk.k‘  "u<c>  “ 

V1  ■ 

0 

(2  8) 

The  last  law  concerns 
equation 

tho  heat  flux 

evolution 

$<  -  Qv<0)  . 

(2  9) 

The  superimposed  dot  and  asterisk  denote  the 
material  and  Zaremba-Jaumann  time  derivatives, 

roopoatlvAly. 


Ve  must  _  'comment  the  equation  (2.8).  In  a 
case  of  4  high"  energetic  neutron  'beam  9KC>  is 
practically  negligible  Li  nuclear  reactions  do 
not  occur,  '  within  cascade  of  collisions. 
Otherwise  >91(0  does  hot  vanish  and  -M.  (C>  form 

-a  source,  of  additional  spins  coming  from'  the, 
produced/  neutrons*.  Then-  the*  relaxation  time  of 
the  neutron* field  for  the -hard  beam  tends  to 
zero  because  the  beam  Is-’ very  stable  and 
inertial  on  influences  from  outside.  This 
results  -  in  5fc(C>«0  and  means  -  that’  practically 

only  transport  features  <"of  that  field  should 
be  considered.  According  to  the  dislocation 
field  the  '  term  is  responsible-  for  -a 

production,  .of  dislocations  coming  from  the 
interaction ,  j>f  the  hard  neutron4,  beam  with  the 
Lattice/  If  we,  ‘however,  deal  with'  the  weak 
energetic  neutrons  the  magnetite  interactions 
dominate  ;ln.  processes,  '  A<C)  ’  describes 

couplings  with,  the  remaining  fields  in  the 
body  and  the  relaxation  time  of  the  neutron 
field  is  relatively  long.  According  to  ^'(C> 

it-  Is  practically  negligible  for  the  weak 
energetic  neutron  beam  4  interaction  as  a  source 
of  dislocations  and  can  occur  in  the  case  of 
elastic  deformation  as  well  as  for  couplings 
with  such  ‘  physical  fields  as  '  the 
electromagnetic,  thermal,  and  the  like. 

All  the  v  fundamental  laws  (2  2)- (2.8)  should 
not  contradict  the  second  law  of  thermodyna¬ 
mics.  Thus  all  the  considered  physical  pro¬ 
cesses  have  to  be  restricted  by  the  following 
entropy  inequality 

p  S  +  t,  -  -y-  i  0  <2.10> 

where  s  denotes  the  entropy  density  and  is 
the  entropy  flux. 

The  most  difficult  step  in  constructing  the 
theory  of  interactions  is  Its  constitutive 
part.  On  introducing  the  set 


^,Mk,c^,e,r,gn,B^,rn,rp,^  (or  9l),P(^<or  8k) 

-M’V'w-’V s  •  t 


we  look  for  the  constitutive  relations  in  the 
form 

Z  -  Z  <C>  .  (2.12) 

On  analyzing  the  entropy  inequality  toge¬ 
ther  with  <2  2>-C2.7>  with  the  help  of  Hu's 
thoorem  we  use  the  free  energy  and 
flux-potential  function 


«P=o-Ts,  K(  =  p  v  -P  *  T  5  (2.13) 

as  the  most  proper  ones  for  the  considered 
processes. 

As  a  result  we  obtain  general  definitions 
of  the  constitutive  relations  (laws  of  state) 
basing  on  the  free  energy  density,  the 
fluxllke  relations  leading  to  the  explicit 
form  of  the  entropy  flux  basing  on  tho 
flux-potential  function  and  a  residual 
inequality  which  the  kinetic  constitutive 
relations  can  be  defined  fiom  Some  particular 
examples  of  Interactions  are  analytically  and 
numerically  investigated. 
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'Abstract -This  paper  describes  .the  analyses  of  the  coupled 
electromagnetic-mechanical  problems  bn  the  dynamic  behaviors  of  a 
thin  plate  arid  a  shallow,  arch  under  moving V  transient' magnetic  field. 
Numerical  analyses,  considering  the  coupling  effect  between  magnetic 
.field  and  defection,  are  performed  for  the  evaluation  of  eddy  current, 
,  magnetic  field, '  force  and  vibration. 

V  INTRODUCTION 

Thin  electrically  conducting  plate  and  shallow  archare  used  in  high 
magnetic  field  machines;  They  receive  transient  and/or  impulsive 
electromagnetic  forces,  deflect  dynamically  and  sometimes  show 
magnctoclastically  unstable  behaviors. 

The  coupling  effect  of  a  beam  has  been  investigated  by  L  R  Turner 
1 1],  T.  Morisuc  (2),  S.  Matsuda  |3)  3nd  so  on.  They  treated  the  rigid- 
body  movement  and  thic  elastic  beam  deflection  under  uniform  time- 
changing  field.  Dynamic  responses  of  shallow  arches  under 
mechanical  forces  have  been  already  studied  (4J.  Humphreys  derived 
the  nonlinear  equation  of  motion  for  an  arch  under  an  impulsive 
mechanical  load  considering  an  axial  force  [4).  We  have  already 
shown  the  preliminary  results  for  the  dynamic  behavior  of  a  shallow 
arch  under  transient  electromagnetic  force  (5). 

In  this  paper,  we  show  the  numerical  calculation  procedure  for  the 
coupling  effect  between  electromagnetic  field  and  vibration  for  both  a 
thin  plate  and  a  shallow  arch. 

II.  NUMERICAL  ANALYSES  METHOD 

A  .EMyArenl  Analysis 

A  thin  elastic  isotopic  homogeneous  shell  with  electrical  conductivity 
o  and  magnetic  permeability  is  set  under  a  coil  field. 

From  the  Ohm’s  law,  the  following  equation  is  obtained, 

J*o(E*uxB)  (1) 

where  J  is  current  density  and  u  is  deflection  velocity  vector. 

Since  divj*0,  the  current  vector  potential  T  is  defined  as,  7«rot  T. 
Using  the  Faraday's  law  and  eq  (1),  we  get  the  following  equation. 

rot(rot  T)*-o  +  orot(uxJ5)  (2) 

We  divide  the  magnetic  induction  into  external  magnetic  induction 
B0  and  that  induced  by  eddy  current  Be  We  finally  obtain  the 
governing  equation  using  the  current  vector  potential  for  a  thin  plate 
considering  the  coupling  effect  as, 

r 


»  )p  a 

(3) 

K|>  8°«  +  dySl'  Si  St 

(4) 

where  6  is  the  lateral  displacement. 


B..  Deflection  Analysis 


The  governing  equation  for  the  deflection  of  the  thin  plate 
considering  the  Lorentz  force  is  expressed  as  follows: 

O(0  *  +  a/)  +  ^  'hBA  ■  hBJi 

(5) 

where  D,  h  and  p  are  bending  rigidity,  plate  thickness  and  mass 
density  respectively.  Coupling  with  -ddy  current  calculation,  we 
calculate  the  plate  deflection  based  on  the  finite  element  method  with 
the  nonconforming  triangular  elements  using  the  modal  analysis  [6J 

B.2.Arch<kftotfoo 

The  equation  of  motion  for  a  shallow  arch  was  given  by  _ Humphreys 
when  a  mechanical  impulsive  force  acts  [4].  By  adding  the  structural 
damping  term  to  the  Humphreys'  equation  the  following  non-linear 
differential  equation  can  be  written, 

E,W '  S&  bo  - 6)  +  P'1  ff  +  *  w‘l  (6) 

where  £/,$,>•(>,  p,  A,  C,  W  and  q  arc  flexural  rigidity,  resultant  axial 
force,  initial  arch  shape,  mass  density,  cross-section,  damping 
coefficient,  arch  width,  and  Lorentz  force  respectively. 

Using  six  vibration  modes,  we  obtain  the  nonlinear  equations  of 
motion  and  solve  these  equations  iteratively  using  Newmark 
method  coupled  with  eddy  current  calculation. 

HI.  RESULTS  AND  DISCUSSION 

In  this  study  we  performed  two  numerical  calculations. 

(1)  Dynamic  behavior  analysis  of  a  shallow  arch  under  a  pulse  coil 
field  (71, 

(2)  Dynamic  behavior  analysis  of  a  thin  plate  under  moving  magnetic 
field  (8). 

A.  Shallow  Arch 

An  arch  test  piece  is  set  with  both  ends  clamped  as  shown  m  Fig.l. 
The  length,  the  width,  the  thickness,  the  arch  radius  of  an  aluminium 
test  piece  arc  100, 20, 0  2  and  500mm,  respectively.  Fig  2  shows  the 
measured  dynamic  responses  of  the  arch  when  maximum  pulse  coil 
current  was  393A  (pulse  width,  about  4msec).  The  numerical  results 
for  the  response  with  and  without  considering  the  coupling  effect  are 
shown  in  Fig.3.  Vibration  behavior  (such  as  amplitude,  frequency, 
vibration  mode  etc.)  with  the  coupling  effect  almost  agreed  with  that 
of  Fig  2. 

Fig.4  shows  the  relation  between  the  coil  current  and  the  maximum 
deflection  Both  numerical  and  experimental  results  might  show  a 
kind  of  snap-through  buckling  behavior.  We  define  the  critical  current 
when  the  maximum  deflection  becomes  a  half  of  the  arch  height  (H). 
As  shown  in  the  figure  the  nume»ically  predicted  critical  current  with 
the  coupling  effect  was  within  the  scatter  of  experimental  ones. 
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Fig.l  Tbst  apparatus 


r:  radius  of  circular  coils  p:  pitch  of  coils  velocity  of  a  scries  of  coils 

fa  current  of  coils  r=0  025,  p=4r 
Fig  5  - Analytical  model 


Fig  2  Dynamic  response 

of an  arch  } 

(Experiment,/,;^  93  A)  * . 


(a)  With  coupling  term  (b)  Without  coupling  term 
Fig.3  Dynamic  response  of  an  arch  (Analysis,  /C*393A) 


(a)  With  coupling  terms  (b)  Without  coupling  terms 

Fig  6  Displacement  and  Lorcntz  force,  /C*0.5(MA) 


Fig.7  Coupling  effect 


the  effect  and  coil  velocity.  The  ratio  became  minimum  (O' 95)  at 
Vc»8( m/s)  and  it  was  less  than  one.  This  means  that  the  magnetic 
damping  coefficient  changes  as  a  function  of  the  coil  velocity  and  a 
negative  damping  effect  exists  at  a  certain  velocity. 


IiimIhI  until 
“  till 

riiiui  1 1 ■  1 1 1 ■  t 


M*xttwa  (am) 

Fig.4  Puls*,  coil  current  vs  maximum  deflection 

B  JSua  Plate 

A  thin  elastic  isotropic  homogeneous  copper  square  plate  was  set 
under  moving  magnetic  field  as  shown  in  Fig  5.  The  width  and  the 
length  of  the  plate  are  both  0  l(m),  The  thickness  of  the  plate  is 
0  l(mm).  It  was  fixed  at  two  sides  and  free  at  other  sides.  A  series 
of  four  moving  circular  coils  on  x-axis  generates  moving  field.  The 
coils  move  from  the  positions  of  solid  circles  to  those  of  broken 
circles.  Figs  6(a)  and  6(b)  shows  the  dynamic  response  of  the  plate  in 
the  case  of  the  coil  current  of  /C*0.5(MA)  and  the  coil  velocity  of 
Vc=4m/sec.  As  shown  in  the  figures,  when  we  considered  the 
coupling  effect,  the  amplitude  of  vibration  became  smaller  because  of 
magnetic  damping  effect  Fig.7  shows  the  relation  between  the  ratio 
of  maximum  displacement  with  the  coupling  effect  over  that  without 


IV.  CONCLUSIONS 

We  showed  a  numerical  analysis  method  of  dynamic  responses  of 
thin  shell  structure  under  magnetic  field.  The  method  was  applied  to 
the  dynamic  behavior  analysis  of  a  shallow  arch  under  a  pulse  coil 
field  and  a  thin  plate  under  moving  magnetic  field.  In  both  cases  we 
showed  the  differences  between  numerical  results  with  and  without 
the  coupling  effect.  Therefore  we  can  conclude  that  the  coupling 
calculation  is  quite  important  to  evaluate  the  dynamic  response  of  thin 
shell  structures  under  magnetic  field. 

We  wish  to  acknowledge  Mr.  K.  Ohtomo,  Mr.  S.  Matsuda  and  Mr. 
Y.  Mohri  for  doing  experiment  and  numerical  calculation. 
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Abstract-  The  buckling  characteristics  of  a  cylindrical 
shell  are  experimentally  studied  to  determine  the 

effects  of  impulsive. magnetic  forces,- and  compared,  to 
calculation  results.'  Dynamic  effects  including  the 
electroaagneto-mechanical  coupling  are  observed  at 
impulsive  loads  of  high  frequency  1-lOkHz. 

X.  INTRODUCTION. 

Large  impulsive  magnetic  forces  act  on  the  vacuum 

vessel  and  other  components  of  fusion  devices  -  during 

plasma  disrupt ions. Therefore,  the  buckling  problem  is 
one  of'  the  important  considerations  during  the  design 
period  of  such  devices.  However,  the  buckling 

behaviors  of  cylindrical  or  toroidal  shells  under 
Impulsive  magnetic  forces  is  not  well  known.  As  the 
first  step  towards  addressing  this  problem,  experiments 
and  analyses  have  been  done  on  cylindrical'  shells. 

II.  EXPERIMENTAL  DEVICE 

A  capacitor-bank  and  a  working  solonoidal  coil  are  the 
main  elements  of  the  experimental  apparatus  as  shown  in 
Fig.l  U).  Ignitron  switch  controllers  are  prepared  to 
make  impulsive  sinusoidal  half  exciting  current  waves 
as  shown  in  Fig. 2.  The  working  coil  is  lOOmm  in  length 
and  60am  in  diameter.  Two  kinds  of  capacitor-bank  are 
used;  C«1400jiF  (200pFx7)  and  18pF  (6nFx3).  The  coil 
current  peak  time  was  changed  from  At"0.025aS(10kHz)  to 
0.2mS(l. 25kHz)  by  adjusting  L  and  C. 

The  tost  specimens  are  cylindrical  shells,  53mm  in 
outer  diameter  and  100mm  and,60ma  in  length  with  a 
variety  of  thickness:  h»l,  0.5  and  0.14mm.  The 
cylindrical  shell  is  located  inside  the  coil.  Aluminum 
alloys  are  seleoted  as  test  materials  because  of  their 
high  eleotrio  conductivity  and  low  density,  whioh  are 
preferable  for  testing  deformation  and  buckling 
behaviors  of  the  shell.  Two  aluminum  alloys,  A1070-0 
and  A5052-H14,  aro  used  to  investigate  the  yield 
strength  dependence  on  buckling  behaviors.  The  0.2% 
yield  strengths  are  40MPa  and  220MPa,  respootivoly. 
The  eight  iteoa  are  measured  as  shown  in  Fig.l. 
avoiding  eleotromagnetically  induced  noise  and 
responding  to  high  speed  phenomena. 


differential  pick-up  method  with  a  couple  of  small 
, coils  that  are  placed  on  both  sides  of  the  cylindrical 
shell  as  shown  in  Fig.l.  However,  this  measurement  can 
not  be  applied  to  the  experiment  with  large  deformation 
of  the  shell.  The  experimental  data  of  an  induced 
one^turn  current  are  compared  to  calculated  results  as 
shown  in  Fig. 2.  .They  agree  well  with  each  other  as 
long  as  the. deformat ion  is  small. 

Eddy  currents  induced  on  the  cylindrical  shell  have 
been  calculated  with  MATEX  [2]  and  EDDYCUFF  (3),  that 
uses  a  finite  element  circuit  method  (the  network  mesh 
method  with  finite  elements)  with  the  normal  component 
of  the  current  vector  potential  Tn.  The  maximum 
induced  one-turn  current  is  -70%  of  the  maximum  coil 
ampere-turns  in  the  case  of  f-l. 25kHz,  h»lmm,  p»2.9nQca 
as  shown  in-  Fig. 2.  The-  calculated  results  are 
summarized  for  the  maximum  induced  one-turn  current  in 
Fig. 3.  The  eleotromagneto-mechanical  coupling  is  not 
considered  in  these  results. 


Fig. 2  Comparison  of  measured  and  calculated  one- turn 
currents  Induced  on  the  cylindrical  shell. 


III.  MEASUREMENT  AND  ANALYSIS  OF  EDDY  CURRENTS 
An  induced  one-turn  current  is  measured  by  a 


0Lo*j 


^  I/— 

(D  Votaxter 
!(V) 


Fig.l  Experimental  apparatus  and  measurement  system. 


Fig. 3  Calculated  ratio  of  induced  one-turn  current  to 
coil  current  as  a  function  of  coil  current  frequnoy. 
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IV.  DEFORMATION  AND* BUCKLING. 

Transient  deformation  during  the  loading  test  of  an 
Impulsive  magnetic  force  was  observed  by  a  high  speed 
camera.  The  pictures  in  Photo  1,  taken  every  100ns, 
show  the  deformed  shapes  of  the  cylindrical  shells.  The 
buckling  wave'  number  in  the  circumferential  direction 
seems  to  increase  as  the-  load  duration  becomes  short, 
such-as^a  typical  wave  number  n»>6r8  at  the -1.25-lOkHz 
dynamic  loads. 

The  residual  deformation  of.-the  cylindrical>3hells  was 
Investigated  '  after  the  loading  s  test.  An  example  of. 
buckled  cylindrical  shells  is  shown  in  Photo  2. 
Figures  4  and  5  show  the  buckling  loads  experimentally 
obtained  in  the -tests  of  the  A1070-0  and  A5052-H14 
shells,  respectively.  The  buckling  load3  are  shown  as 
a  function,  of  the  radius  to  thickne33  ratio  (a/h)  of 
the  cylindrical  shell,  withthe  parameter  of  impulsive 
current  frequency.  They  are’  compared  to  calculated 
results  of  elastic,  plastic  and  elastic-plastic 
buckling  for  static  loads.  Elastic-plastic  buckling 
analysis  has  been  carried  out  using  BOSOR  141..  Buckling 
of  A1070-0  shells, occurs  in  the'  plastic  range  as  shown 
in  Fig. 4.  On  the  other  hand,  buckling  of  A5052-H14 
shells  occurs  in  the  elastic  range.  Experimental 
results  are  higher  than  estimated  static  buckling  loads 
at  high  frequency. 

V.  DISCUSSION 

The  difference  between  the  experimental  and  calculated 
results  is  explained  by  combined  dynanio  effects. 
Structure  response  analysis  for  dynamic  loads  has  been 


Photo  1.  Transient  deformation  of  the  cylindrical  shell 
cross-section.  Time  step:6t«100j»s  (A5052-HM, 


f-1. 23kHz,  h«0.5ma,  EO-1400V, 


A5052-H14 
f-1. 25kHz 
h-lma 


Photo  2.  Appearance  of  the  buckled  cylindrical  shell. 


performed  and  the  dynamic  response  factor  is  calculated 
to  be  1.04-1.56  at  the  1.25~10kHz  Impulsive  loads. 
Figure  6  and  7  shows  the  dynamic  effects  of  the 
buckling  load  as  a  function  of  the  peak  load  time.  The 
dynamic  buckling  load  can,  be  predicated  by  the 
elastic-plastic  buckling  load  divided  by 'the  dynamic 
response  factor  as'  shown  in  Fig; 6  and  7.  In  the 
results,  the  large* difference  between  the  experimental 
data- and  the  predicated  dynamic* buckling; loads  at  high 
frequency  is  thought  to  be  due'  to  the  dynamic  effects 
of.  electromagneto-mechanical  coupling,  buckling 
strength  and  material  properties.  It  is  reported  that 
buckling  strength  and  material  properties  such  as  yield 
strength  change  at  short-duration, impulsive  loads. 

The  effect  of ,  electromagneto-mechanical  coupling  can 
be  calculated  from -deformation  data  derived’ from  high 
speed  pictures,  such  in  Photo’ 1.  It  is  estimated  that 
the  magnetic  pressure  on  the  cylindrical  shell  can  be 
reduced  10-50%  by  the  coupling  effect  in  typical 
buckling  conditions. 

VI.  CONCLUSION 

Buckling  characteristics-  of  a  cylindrical  3hell  are 
experimentally  obtained  for  impulsive  electromagnetic 
loads  of  10kHz  and  1.25kHz.  Buckling  loads 
experimentally  observed  are  mush  higher  than  those 
expected  from  static  buckling  analysis  and  dynamic 
structure  response  analysis  at  impulsive  loads  of  high 
frequency.  This  difference  can  be  explained  by  the 
olectromagneto-mechanical  coupling  and  dynamic  effects 
of 'buckling  strength  and  material  properties. 
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Fig. 6  Dynanio  effects  of  impulsive 
electromagnetic  buckling  as  a  function 
of  peak  load  tine  (A1070-0). 
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Fig. 7  Dynamic  effects  of  impulsive 
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Fig. 4  Impulsive  electromagnetic  Fig. 5  Impulsive  electromagnetio  electromagnetic  buckling  as  a 

buckling  characteristics  (A1070-0;.  buckling  characteristics  (A5052-H14).  of  peak  load  time  (AS052-H14). 
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MAGNETO-ELASTIC  BUCKLING  OF  SUPERCONDUCTING  COILS 
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Abstract  Systems  of  superconducting  coils  can  buckle  whenever  the 
electric  current  through  the  coils  exceeds  a  certain  critical  value. 
A  variational  method  which'  uses  as 'admissible  magnetic  fields  the 
so-called  Biot-Savart  fields  is  presented.  An  evaluation  for  slender 
systems  of  curved  beams  is  given.  The  method  is  applied  to  a  con* 
.ductor  in  the  shape  of  a  flat  spiral  coil. 

INTRODUCTION 

A  superconducting  body  in  the  shape  of  a  slender  curved 
beam  is  placed  in  a  vacuum  and  carries  a  prescribed  current  /$.  The 
own  field  of  the  conductor,  generated  by  A>,  causes  large  Lorentz 
forces  between  the  seperate  parts  of  the  conductor.  Due  to  this  the 
system  can  become  mechanically  unstable  and  buckle  whenever  /o 
becomes  too  large  (l9  >  Ax*)*  In  a  series  of  earlier  papers,  (1]  ■ 
(3),  the  present  author  together  with  his  co-workers  presented  two 
methods  for  the  calculation  of  the  buckling  current  / The  first 
one  is  a  variational  method,  the  second  one  a  more  direct  one  start* 
ing  from  a  formula  for  the  Lorentz  force  on  a  conducting  slender 
coil  that  is  derived  from  the  law  of  Biot  and  Savart  in  a  way  as 
shown  In  (4),  Sec.  2.6.  Both  methods  have  been  applied  to  calculate 
Ax*  for  systems  of  parallel  rods  and  rings  ((2)  *  (3)).  Comparison 
of  the  results  shows  that  under  certain  (moderate)  restrictions  the 
agreement  between  the  two  methods  is  good.  Since  exact  fields  are 
needed  for  the  variational  method  as  employed  in  our  earlier  papers 
this  is  a  mathematically  exact  but  also  very  complicated  method. 
On  the  other  hand  the  Biot*Savatt  method  is  less  precise  but  easier 
in  use.  An  approach  that  combines  the  advantages  of  both  methods 
is  presented  here.  This  combined  method  is  a  variational  one.  How* 
ever,  in  this  formulation  an  admissible  set  of  fields  is  chosen  on  the 
basis  of  the  law  of  Blot  and  Savart.  Since  the  differences  between 
our  two  previous  methods  were  so  small  it  seems  logical  to  expect 
that  the  combined  method  will  also  yield  good  results.  Some  of  our 
results  indeed  show  that  the  correspondence  becomes  even  better 
when  using  the  combined  method.  The  combined  method  becomes 
extremely  simple  when  applied  to  alender  spiral  or  helical  coils.  An 
example  of  such  a  calculation  will  be  given,  but  first  we  shall  present 
the  main  lines  of  the  variational  method  and  show  how  this  leads  to 
the  combined  method. 


MAIN  LINES  OF  VARIATIONAL  METHOD 

The  fundamental  quantity  in  the  variational  method  as  pre¬ 
sented  in  (l)  Is  the  functional  Jt  which  is  equal  to 

(1) 

where  W  is  the  elastic  energy  of  the  deformed  (buckled)  body  and 
(/<o  i*  the  permeability  In  vacuum;  we  use  summation  convention 
and  ^  =t  djdii) 

IlK  =  “  /  (( t+BMAD.ViK  +  W,)\dS.  (2) 

according  to  (3),  Eq.  (2.22)  (here,  we  have  already  neglected  some 
terms  that  become  small  for  slender  systems).  In  (2),  dC  represents 
the  boundary  of  the  conducting  body  and  N  is  the  unit  normal  on 
St.  Furthermore,  U  is  the  displacement  in  buckling  of  the  conductor, 


B  is  the  rigid-body  magnetic  field  in  the  vacuum  Gt  and  $  is  the 
perturbed  (due  to  U)  magnetic  potential  in  G*.  The  displacement 
U  must  be  chosen  in  such- a  way -that  it' is  characteristic  for  the 
deflection  of  the  specific  slender  body  under  consideration  and  for 
the  type  of  buckling  that  is  assumed  to  take  place”.  The  field  B  and 
tho  potential’  ^  bave  to  satisfy  the  constraints  (fields  that  satisfy 
these' constraints  are  called  admissible) 

i)  curl  B  =  O  ,  x£G,  B-*0-,  |xj-*oo, 

(and  Ampbre's  law)  j(Btt)ds  =  /i0A>  »  C£G+\  (3) 
C 

ii)  ^  =  0,'xeG^  ;  ^  -*  0  ,  |  x  J-*  oo.  (4) 

Our  variational  principle  as  presented  in  e  g,  (3)  states  that 
the  first  variation  of  J  with  respect  to  admissible  U,  B  and  tf>  must  be 
zero.  On  this  basis  we  proceed  as  follows,  we  first  choose  admissible 
fields  B  and  ^  and  thereafter  we  take  the  first  variation  of  J  with 
respect  to  U  equal  to  zero.  This  leads  us  to  an  eigenvalue  problem 
from  which  Io<r  can  be  calculated. 


ADMISSIBLE  FIELDS  FOR  SETS  OF  PARALLEL  STRAIGHT 
CURRENT  CARRIERS 

Consider  one  Infinitely  long  straight  current  carrier  (with  e* 
along  tho  axis  and  x  a  zej  +  ye»  +  re*).  Let  R  bo  a  characteristic 
measure  of  length  for  the  cross-section  of  the  rod  and  suppose  that 
the  rod  is  periodically  supported  over  lengths  l.  The  rod  is  called 
slender  if  0  <  6  .»  RJl  <  1.  We  only  consider  slender  rods  and 
we  neglect  terms  that  are  0($*)  with  respect  to  unity.  Finally,  we 
assume  that  the  rod  buckles  in  the  e»  direction,  so  that  U  »  l/(z)ci 
Application  of  the  law  of  Biot  and  Savart  to  the  undeformed 
(for  B)  and  to  the  linearly  deformed  rod  (for  t&)  yields  the  following 
two  admissible  fields 

nfv.-  ... 

<6> 

It  can  be  easily  checked  that  B  and  satisfy  the  constraints  (3)  and 
(4),  however  0  only  up  to  0(^’)-term$. 

For  a  set  of  n  parallel  rods  the  admissible  fields  can  be  found 
from  (5)  and  (6)  by  simple  superposition.  Using  these  fields  in  (2) 
we  obtain  for  a  set  of  n  equidistant  rods  (distance  between  two  rods 
is  2a,  a  <  /) 

>1* = ("(0w  -  .  in 

•*'  jj*  -i 

where  U M  is  the  lateral  deflection  in  the  plane  of  the  rods  (i.e.  in 
the  ei-dircction)  of  the  i,K  rod,  and 

*  -  Mm)  =  Hpjpjr-  =  I  +  0(m>),0  <  m  :=  g  <  >  ,  (8) 

for  a  circular  cross-section ,  radius  R. 

For  IF  we  can  use  the  familiar  expression  for  the  elastic  bending 
energy  of  a  slender  beam.  For  n  s  2  this  yields  a  value  for  Ax* 
which  is  just  m  between  the  values  from  the  Diot-Savarl  method 
(for  which  ^  =  1)  and  the  exact  variational  method  (cf  (2)) 


SLENDER.  CURVED  BEAMS 

Let  us  consider  a  superconducting  cojl'In'the  shape  of  a 
slender  curved  beam  like  a  (flat)  spiral  or  a  (straight  or  toroidal) 
helix.  We  call  the  total  structure  slender  if  the  distance  2 a  between 
two  branches  is  much  smaller  then  some  global  measure  of  length  b 
of  the  structure  (e  g.  a  radius  of  curvature  of  the  central  line  or  the 
total  length  of  the  coil).  Tike  two  distinct  branches  (or  turns)  of  the 
spiral  or  helix,  let  Pi  be  a  point  on  one  branche  and'Pj  that  point 
on  the  other  branche  that  has  the  shortest  distance  to  Pi.  For  the 
structures  we  consider  here,’ the  tangent  lines  on  the  central  lines  in 
Pi  and  Pj  are  (nearly)  parallel.  For  slender  systems,  ie„  under,  the 
neglect  of  terms  of  0(a/h)  with  respect  to  unity,  this  has  the  following 
consequence*  The  contribution  to  the  P-integral  from  a  line»element 
in  Pi  and  due  to  the  interaction  with  the  other  branche  through  Pj 
is  equal  to  the  one  found  for  the  case  of  two  straight  parallel  rods 
through  Pi  and  P2  and  tangent  on  the  curve. 

This  statement  was  already  checked  in  (3)  for  systems  of  two 
rings  and  there  proven  to  be  true  for  small  values  of  (a/6),  where  6 
U  the  radius  of  the  ring.  With  this  we  now  have  the  disposal  of  an 
easy  algorithm  to  calculate  the  AMntegral  for  such  complex  systems, 
as  a  helical  or  spiral  conductor. 


SPIRAL  CONDUCTOR 

A  flat  spiral  of  n  turns  is  given  by  the  following  relation  for  its  central 
line 

b(<fi)=Rt+hY>,  0£^2*n. 

The  distance  between  two  adjacent  turns  Is 
2a=*%  +  2*)-6(¥,)32*ft  * 

The  system  Is  called  slender  if 

0  *}i  s'  i 

v*c(o.M  11  6(v>)  <  1  * 

When  relevant,  we  take  the  cross  section  circular,  radius  R(R  <  a). 

We  assume  that  the  coil  buckles  in  its  plane  and  that  the 
pertinent  displacement  is  given  by  (in  polar  coordinates) 

U  a  U(v>) 13  U(v>)er  +  ■  O2) 

The  spiral  Is  taken  inextensible,  yielding 

V(v)  +  V(v)*  0  .  (<=A).  (13) 

The  elastic  energy  due  to  in  plane  bending  is  then  (El  is  the  bending 
stiffness) 

r»p» 

W  =  \ei  j  —(«“(!>)+  V(v)\'i9 ,  <H) 

A  value  for  the  A'- integral  can  be  obtained  from  the  assumed 
analogy  with  formula  (7).  To  this  end  we  Introduce  for  v>  €  £»*»*  € 
H,n),  where  Ci  stands  for  the  »lfc  turn  of  the  spiral,  the  pole  angle  9 

by 

+  *  ,  0  <  0  <  2r  , 
and  the  displacement  of  Ct  by 

V(v)  a  V®(9)  =  U(9  +  2x(i  -  1)) , 
and,  analogously, 

%)  =  6,(*)s6(0  +  2*(i-l))  = 

=  «o{l  +  ^l2,(<-l)  +  «]}  • 


(15) 

(16) 

(II) 


C>> 

(10) 

(H) 


Replacing  in  (7).  dz  -*  bt($jd$  and  a-*  xh  we’ thus  obtain 

(IS) 

where  the  integral  must  be  taken  with  respect  to 

To  obtain. a  numerical  value  for  lot r  we  have  followed  the 

following  procedure. 

i)  We  have  discretized  the  displacement  U(<p)  in  N  -f  1  points,  the 
nodal  displacements  being  Uk  ,  k  €  (0,AT),  with  K#  =  Un  =  0 
(the  spiral  coil  is  simply  supported  in  begin  and  end  points). 

ii)  We  have  calculated  the  first  variation  6J  of  J  with  respect  to  {/* 
and  put  this  equal  to  zero,  yielding  a  linear  eigenvalue  problem 
for  Uk  of  the  form 

Ari(W  =  0  ,  (19) 

iii)  The  lowest  eigenvalue  I0  (following  from  det  A  =  0)  is  the  looked 
for  buckling  current  /q. 


We  worked  out  this  scheme  for  a  spiral  coil  for  which  A/l?o  = 
0  01  and  for  some  values  of  n.  The  results  are  shown  in  the  Ibble 
below  in  which 


*  Po/o^Po 
48  xWEI’ 


(20) 


More  numerical  results  will  bo  published  in  the  forthcoming  paper 

(5). 


TABLE  1.  Normalized  buckling  currents  for  h/R9  a  001  and  for 
some  numbers  of  coils. 


n  |  2 

3  | 

4 

5 

A.  10a  |  16  30 

8.40  | 

5.19 

3.50 
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Abstract 


The  transient  analysis  of  a  coupled  electro-mechanical  system 
Is  undertaken.  The  system  consists, partly  of  spatial  regions, 
which  may  support  magnetic  fields,  that  are  modelled  by  finite 
elements  The  regions  may  be  attached,  to  external  electrical 
sources  and  circuits,  and  may  also  be  capable  of  rigid  body  mo¬ 
tion  with  respect  to  one  another  A  method  for  coupling  the 
electric  circuit  transient  equations,  transient  magnetic  field  fi¬ 
nite  element  equations,  and  the  transient  mechanical  motion 
equations  is  described.  Only  the  external  source  variation  is 
assumed  to  be  known;  all  other  field,  circuit,  and  mechani¬ 
cal  motion  quantities  are  treated  as  unknowns  and  calculated. 
Equations  for  transient  analysis  of  a  general,  2-dimcnsional, 
planar,  non-lmear,  voltage-excited  system  arc  derived  in  detail 
The  Golerkin  formulation,  time-discretization,  and  lineariza¬ 
tion  of  these  equations  are  presented.  The  resulting  global 
system  of  coupled  clcctro-mechanical  equations  is  assembled 
and  investigated  Included  arc  examples  of  the  application  of 
the  proposed  method  to  perform  transient  analysis  of  practical 
coupled  electro-mechanical  systems. 

1  Introduction 

The  standard  procedure  of  using  finite  element  analysis  to  ap¬ 
proximate  magnetic  field  quantities  within  a  fixed  device  or 
region  is  well  known.  The  user  describes  the  problem  geome¬ 
try  and  material  characteristics,  sets  the  boundary  conditions, 
and  specifies  numerically  all  current  densities,  which  act  as  the 
source  of  the  magnetic  field.  The  region  of  interest  is  then 
discretized  in  space  into  a  mesh,  and  the  finite  element  field 
approximation  equations  are  set  up  and  solved.  The  solution 
consists  of  a  set  of  approximations  for  the  field  potential  at 
each  node  of  the  mesh. 

Such  a  procedure  is  inadequate  for  a  large  class  of  practical 
problems,  however.  Consider  the  transient  analysis  of  an  elec¬ 
tromagnetic  device  which  is  actuated  by  a  voltage  (or  current) 
source,  such  as  a  transformer,  motor,  or  actuator. 

The  voltage  (current )  source  for  such  devices  is  time-dependent, 
therefore,  one  cannot  specify  a  prion  the  numerical  value  of 


the  current  density  in- the  conductive  regions  of  the  device, 
because  skin  effect  and  eddy  currents  cause  the  current  den¬ 
sity,  to  vary  with  time  and  position  within  the  conductor.  The 
standard 'finite  element  procedure,'  however,  requires  current 
density  as  a  known  input  to  the  analysis. 

In  addition^  it  may  be  necessary  to  attach  lumped  cir¬ 
cuit  components,  such  as  resistance  or  inductance,  between 
the  voltage  (current)  source  and  the  region  to  be  modelled  by 
finite  elements.  The  lumped  components  may  represent  the  in¬ 
ternal  impedance  of  the  voltage  (current)  source,  or  they  may 
be  used  to  approximate  the  effects  of  the  parts  of  the  device 
which  are  outside  the  region  modelled  by  finite  elements.  The 
corresponding  transient  circuit  equations  must  be  coupled  with 
the  transient  finite  element  field  equations. 

Moreover,  in  the  case  of  the  motor  or  actuator,  there  are 
movable  mechanical  components.  Magnetic  forces  determine 
the  position  of  these  components,  and  the  positions,  in  turn, 
affect  the  magnetic  field  withm  the  device.  Provisions  must 
be  made  for  the  transient  modelling  of  such  a  coupled  electro¬ 
mechanical  system. 

Therefore,  a  method  for  the  proper  coupling  of  transient 
fields,  circuits,  and  motion  must  be  such  that:  (a)  only  termi¬ 
nal  voltage  (or  total  terminal  current)  applied  to  the  device  is 
required  as  a  known  input  quantity,  and  total  terminal  current 
(terminal  voltage)  is  calculated  as  an  unknown,  (b)  the  tran 
sient  external  circuit  equations  that  model  electrical  sources 
and  circuit  components  are  coupled  to  the  finite  element  field 
equations,  and  (t)  equations  for  mechanical  motion  are  coupled 
to  the  finite  element  field  equations. 

This  paper  presents  a  method  that  fulfills  these  three  ob¬ 
jectives  The  method  was  developed  by  Istfan  (2)  and  applied 
by  Palma  (3j.  The  following  sections  detail  the  method  First, 
the  supporting  electromagnetic  and  mechanical  theory  will  be 
summarized,  and  the  Galerkin  formulation  of  the  field  equa¬ 
tions  will  be  shown  Next,  the  time-discretization  of  the  field, 
circuit,  and  mechanical  equations  will  be  presented  Then  the 
field  and  mechanical  equations  will  be  linearized  Finally,  the 
global  system  of  coupled  electro-mechanical  equations  will  be 
assembled  and  investigated  The  paper  concludes  with  exam 
pies  of  the  application  of  the  proposed  method  to  perform  tran 
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*ent  analysis  of  practical  coupled  deciro-mechamcal  systems,  3  Time  Discretization 


2  Electromagnetic  and  Mechanical 
Theory 

The  electromagnetic  field  theory,  electric  circuit  theory,  arid 
basic  mechanical  motion  theory,  upon  which  the  proposed  method 
is  based,  arc  summarized  below. 

2.1  Summary  of  Equations. 

The  equations  necessary  for  transient  coupling  of  field,  circuit, 
and  motion  equations  have  been  derived  from  baric  theory. 
They  we  summarized  below: 


Field  Equation 


„  „  .  H  dA 

V  Xi/Vxd  =  d-r- o  — 
l  oi 


Total  Current  Equation 


Series  Bar-Coil  Equation 


Parallel  Coil  Equation 


In  this  section,  the  field  equation,  total  current  equation,  cir¬ 
cuit  equations,  and  the  mechanical  equations  of  motion  are 
discretized  in  the  time  domain. 

The  method  of  time^£K7eUzation  used  here  is  based  on  the 
following  equation: 

« 

The  value  of  the  constant  &  determines  whether  the  algorithm 
is  of  the  forward  difference  type  {fi  —  0),  backward  difference 
type  —  1),  or  some  intermediate  type  (0  <  fi  <  1).  Note 
that  if  &  =  the  Crank-Mcholson  Method  is  implemented. 

The  goal  is  to  solve  for  {A}HM.  The  derivatives  + 
and  are  unknown. 

4  Linearization 

The  field  equation  and  the  acceleration  equations  are  non¬ 
linear  functions  of  vector  potential,  A,  and/or  component  dis¬ 
placement,  x.  These  equations  must  be  linearized  before  they 
can  be  combined  with  the  other  equations  of  the  system  in  a 
general  global  system  matrix  equation.  The  linearization  of 
the  field  and  acceleration  equations  is  accomplished  using  the 
Kcwton-Raphson  method. 


v,  =  i?.{i}T{/}t+i,{i}T|^|  +K.  5  Global  System  of  Equations 


Mechanical  Acceleration  Equation 
dv 

mdt  +  = 

Mechanical  Velocity  Equation 


Noie  that  the  first  three  equations  are  coupicd  by  the  volt¬ 
age  applied  to  the  finite  element  region,  V*'»  the  second,  third, 
and  fourth  equations  t  te  coupicd  by  the  conductor  currents, 
and  the  first,  second,  and  fifth  equations  arc  coupicd  by  mag¬ 
netic  vector  potential  A. 

The  portion  of  the  problem  to  be  analyzed  with  finite  ele¬ 
ments  must  be  discretized  in  space,  i.e.,  meshed.  The  Galcrkin 
method  is  used  to  approximate  the  field  and  current  equations 
in  discretized  space.  The  Galerkm  method  belongs  to  a  class 
of  approximation  techniques  called  the  methods  of  weighted 
residuals. 


The  field,  circuit,  and  mechanical  equations  are  now  available 
in  a  discretized  and  linearized  form.  It  remains  to  assemble 
these  matrix  equations  into  a  global  system  of  equations  de¬ 
scribing  the  entire  problem. 

5.1  Assembly  of  the  Global  System  of  Equa¬ 
tions 

FYom  the  summary’  above,  it  is  apparent  that,  in  general,  there 
are  five  vector  unknowns: 

{AA}**f f  .  ch&np’  in  vector  potential  of  each  node 
:  change  in  voltage  across  each  bar 
{A/}^jk+|  .  change  in  current  in  each  coil 

*  change  in  parallel  terminal  voltage 
{Ax}j^f  .  change  m  position  of  each  movable  component 
The  global  system  matrix  equation  may  then  be  set  up  in 


or,  ja  expanded  form. 


Mu  Mu  Mu 

f+Ai 

{*,} 

Mfj  Mu  Mu 

{AV,} 

{*.} 

hiu  Mu  Mu 

{A/) 

=  • 

.{*>} 

ML  Mu 

(4*3 

{*.) 

.  MI*  . 

{Ai} 

1 '{«•,) 

This  global  system  of  equations  must  cox  be  solved  for  the 
unknown*,  {/}. 

6  Examples 

Tbe  formulation  above  has  bora  applied  to  numerous  cases  in 
order  to  check  its  validity.  Some  examples  are  given  below 
wfoch  illustrate  the  features  of  coupling  fields,  circuits,  and 
motion- 


FLUX  COJXPOTCMTXM.  LINES  AT  T  •  33.69  as 
POT  OlfEtagNCC  BETWEEN  LINES  2  8 «  iq/aM 


fLVJX  COUIWENTIAL  WINES  AT  T  -  J»  *7  •» 

pot  oirycfiCNCE  ecTxctw  lines  z  <e 2  «o/**l 


6.1  Induction  Motor 

An  example  is  a  three  phase  souirrel  cage  induction  motor. 
The  motor  is  rated  at  5  Hp.  In  this  example  the  entire  mo¬ 
tor  winding  is  represented.  The  inputs  are  the  instantaneous 
voltages  at  the  three  terminals.  The  current  in  the  windings 
is  unknown.  The  rotor  is  free  to  turn-  Each  rotor  bar  is  rep¬ 
resented  as  an  independent  circuit  connected  to  an  aid  ring 
which  has  .a  constant  resistance  and  inductance.  The  mesh  in, 
the  air  gap  may  or  may  not  be  remeshed,  depending  on  the- 
distortion  of  the  dements.  In  any  case  the  remeshing  is  done 
such  that  the  number  of  nodes  and  elements  remains  the  same. 
The  sequence  of  plots  in  Figure  1  shows  the  motor  operating 
at  full  load  at  various  positions  in  the  cycle. 

7  Summary  and  Conclusions 

A  new  method  for  the  transient  analysis  of  coupled  electro¬ 
mechanical  systems  has  been  presented.  The  principal  fea¬ 
tures  of  the  method  are:  (a)  only  terminal  voltage  (or  total 
terminal  current)  applied  to  the  device  is  required  as  a  known 
input  quantity,  and  total  terminal  current  (termiual  voltage) 
is  calculated  as  an  unknown;  (b)  the  transient  external  circuit 
equations  that  model  electrical  sources  and  circuit  components 
are  coupled  to  the  finite  element  field  equations;  and  (c)  equa¬ 
tions  for  mechanical  motion  are  coupled  to  the  finite  element 
field  equations. 

The  chief  attraction  of  using  the  method  proposed  here  to 
develop  models  for  elecro-mechanical  devices  is  that  the  be¬ 
havior  of  the  mathematical  models  is  determined  by  the  same 
set  of  laws  that  govern  the  actual  devices.  The  models  faith¬ 
fully  reproduce  the  fact  that  most  actual  devices  receive  in¬ 
put  solely  from  external  electrical  sources.  The  interaction  of 
fields,  circuits,  forces  and  motion  takes  place  mathematically 
in  the  model  just  as  it  docs  physically  in  the  actual  device 
No  artificial  assumptions  about  source  current  density,  device 
inductance,  or  un-couplcd  mechanical  motion  are  made. 

The  proposed  method  is  also  attractive  in  a  numerical  sense. 
Care  has  been  taken  to  arrange  system  equations  in  such  a  way 
that  the  global  system  matrix  is  sparse  and  symmetric,  thereby 
allowing  the  use  of  efficient  storage  techniques.  In  addition,  the 
global  system  matrix  has  been  made  positive  definite,  so  that 
the  system  can  be  solved  by  a  variety  of  advanced  techniques 


Figure  1:  Induction  Motor:  Equipotcntiel  Plots  at  Different 
Instants  of  Time 
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Abstract  -  Potentials  and  gradients 
of  the  electric- field  in  a  region  occupied  by 
distributed  sources  can  be  expressed  as  some 
improper  integrals.  The  paper  presents 
formulae  for  their  calculation.  The  solution 
can  be  used  In  axi-symaetrical  cases. 


1  1 


a(u,v) «J(u,v) 

0  0 


y-yo 

r2(u,v) 


dudv. 


(4) 


where: 


I.  INTRODUCTION 

Potential  and  field  due  to  distributed 
sources  can  be  calculated  by  integrating  the 
Coulomb's  formulae.  The  integrals  become 
improper  for  the  calculation  of  potential  and 
field  at  points  within  the  area  occupied. by 
the  distributed  sources.  The  region  with  the 
sources  can  be  discretized  using  quadrilateral 
elements. 

In  the  following  it  Is  assumed  that  the  area 
with  the  sources  is  formed  by  a  quadrilateral 
P0P1P2P3'  potential  and  field  being  calculated 
at  one  of  its  corners  (P0). 


p3 

p2 


p0 


Pig.  1.  A  quadrilateral  element  covered  with 
distributed  sources  of  density  <r(P). 


Introducing  transformation  of  a  unit  square 
0  £  u,v  £  1  onto  quadrilateral  PqPip2p3 : 


x  *  axu  +  bxv  +  cx  ♦  dxuv 

y  ■  OyU  +  byV  ♦  Cy  ♦  dyUV 

potential  and  field  at  point  Pq: 

1  1 


(1) 


V  ■ 


-1 

2k£a 


o(u,v) *j:u,v) *ln  r(u,v)  dudv  (2) 


and 


0  0 
1  1 


If'""' 


v) *  J ( u, v) 


x-x0 

r2(u,v) 


dudv  (3) 


r2(x,y)  ■  (x-x0)2  +  (y-y0)2 

J(u,v)  :  jacobian  of  the  transformation. 

The  integeral  corresponding  to  V  and 
components  of  the  integrals  corresponding  to  B 
may  be  represented  in  a  general  form  as: 


f (u,v) >h(u,v)  dudv  : 


0  0 
1  1 


m£(u,v) 
g(u,v) 


0)  *g(u. 


h(u,v)  -  h(0,0)  *g(u,v)  dudv  + 


♦  h(0,0) 


1  1 


fj 


g(u,v)  dudv  ■  Ij+  I2 


(5) 


where 

h(u,v)  ■  -  o(u,v)  «J(u,v) 

2*c0 

g(u,v)  :  a  function  that  may  be 
integrated  analytically 
and  such  that  the  expression 


rf<u,v) 

p{u,v)  ■  I -  h(u,v)  -  h(0,0)  «g(u,v)  (6) 

l^(u,v) 


is  bounded  and  integrable  over  the  unit 
square  so  that  the  numerical  Integration 
can  be  applied  to  calculate  the  first 
Integral  Ij  in  (6). 

For  the  calculation  of  V  and  E  the  auxiliary 
functions  £(u,v)  and  g(u,v)  will  be  discussed 
separately. 


II.  CALCULATION  OP  POTENTIAL  V 
Let  the  following  functions  be  defined: 

f(u,v)  *  -0.5* In  l(axu  ♦  bxv  ♦  dxuv)2+ 

(ayu  ♦  by v  ♦  dyuv)2J  (7) 

g(u,v)  ■  -0.5»ln  (u2  ♦  v2)  (8) 
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It  can  be  proved-^that  function  p(u,v) 
obtained  by  substituting  (7,8)  in  (6)  has 
the  following  property: 

Property  P:  Function  is  continuous  on  the  unit 
square  with  the  exception  of  point 
(0,0)  in  which  it  Is  bounded  only. 

Integral  I2  defined  in  (6)  can  be 
calculated  analytically: 

_X2=  0.25*h(0,0) • (6-x-2?ln2)  (9) 


III.  CALCULATION  OP  GRADIENT  E 
Let  the  following  functions  be  defined: 
d(u,v)  «  (axu  t  bxv  t  dxov)Jt 
f  (ayU  t  byV  ♦  dyUV 1‘ 
e(u,v)  -  (axu  t  bxv)2+  (ayti  +  byv)2 
Substituting  (1)  In  (3)  yields:. 

1  1 

n-(a_u*bxv*dxuv) 

h(u,v>  - ? -  dudv  - 

d(u,v) 


0  0 

where: 


-  ^ax  +  bx*bx  *  dx*dx  <10> 


h(u,v)  -  dudv 

d(u,v) 


X  X 

I  bx  “II  Mu,v)  - ““ 

Dx  J  J  d(u,v 


-uv  .  . 

h(u,v)  -  dudv 

d(u,v) 


For  the  calculation  of  Iax  the  following 
functions  are  used: 


Function  p(u,v)  obtained  by  substituting  (14) 
In  (6)  has  property  P  defined  In  II. 

Integral  I2  in  (6)  can  be  calculated 
analytically: 

I2  -  h( 0, 0) <S*R  (15) 

where:  ■  denotes  scalar  product; 

vectors  R,  S  are  given  below: 


In  A 

0 

In  B 

1/ ( 2A) 

In  (A+B+2C) 

S1* 

-1/(2A) 

atan  I(B+C)/D) 

-1/D 

atan  (C/D) 

(A-C)/(AD) 

atan  ((A+O/Dl 

C/UD) 

Similar  considerations  can  be  applied  to 
calculate  Ibx .  The  corresponding  Integral  I2 
in  (6)  is  calculated  analytically  from: 

I2  *  h(0,0)*T*R  (16) 

Vector  T  is  given  as:  1/(2B) 

0 

-1/ (2B) 

7  C/(BD) 

(B-C)/(BD) 
-1/D 

Function  under  Integral  I<jx  (13)  also  has 
property  P' defined  in  II. 

For  rectangular  elements  of  constant  source 
density  coefficients  dx,  dy’and  Integrals  I2 
vanish.  Potential  and  gradient  are  calculated 
according  to  formulae  (9)  and  (10,15,16). 

For  non-rectangular  elements  and/or.  elements 
of  variable  source  density  all  the  integrals 
have  to  be  calculated. 


IV.  NUMERICAL  EXAMPLE 

Let  a  quadrilateral  element  be  considered: 
P0(0.0),  Pi  (1.5.  0),  P2(l.  1).  P3(0.5.1) 

Then  the  following  values  of  potential  and 
gradient  at  (0.0)  can  be  calculated: 


a)  1.988415E-01 

b)  2.004412E-01 

c)  2.002138E-01 


Ey 

(-1.212671.  -7.244254E-01) 
(-1.212815.  -7 . 223202E-01 ) 
(-1.194719.  -6.990175E-01) 


a)  -  calculated  according  to  formulae  (5,  9) 
or  (5,  10,  15.  16)  and  Gauss  quadrature  of 
function  p(u.v)  over  the  whole  element; 

b)  -  as  a)  but  Gauss  quadrature  with  subdivi¬ 
sion  of  the  unit  square  into  64*64  smaller 
ones; 

c)  -  direct  Gauss  quadrature  over  the  whole 
element  without  taking  into  account 

the  infimtness  of  the  integrated  functions 
at  (0,0) 


V.  CONCLUSIONS 

Differences  in  calculating  potential  due  to 
distributed  sources  within  the  area  occupied 
by  them  ore  rather  small.  When  calculating 
gradients  these  differences  may  reach  3%. 


1.  Lean  M.H  and  Wexler  A. : "Accurate  Numerical 
Integration  of  Singular  Boundary  Element 
Kernels  over  Boundaries  with  Curvature". 
Int.  Journ.  for  Num.  Heth.  in  Eng.,  1985 


1611 


FEMAG  -  AN  INTERACTIVE,  MENU  DRIVEN  PACKAGE  FOR  CAD 
OF  ELECTRICAL  MACHINES  AND  DEVICES 

K.  Reichert,  J«  Skoczylas,  T.  Tarnhuvud 

Swiss  Federal  Institute  of  Technology, (ETH),  Dcpartement  of  Electrical  Machines, 
CH-8092  Zurich,  Switzerland 


Abstract  -  FEMAG  is  the  powerful,  interactive  finite  ele¬ 
ment  (FE-)  based  electromagnetic  field  analysis  system  for  work¬ 
station  computers, especially  developed  for  the  CAD  of  electrical 
machines  and  devices,  with  two-dimensional  and  axisymelrical, 
static  and  steady-slate  nonlinear  magnetic  field  problems.  The 
design  of  the  package  is  based  on  industry  requirements,  where 
the  package  is  already  installed.  The  paper  gives  details  on  the 
package  specifications,  ou  the  structure  of  data  aud  software, 
and  on  the  program  and  user  interface.  Application  examples 
are  given  at  the' end. 

I.  USER  REQUIREMENTS 

In  the  last  decade  a  number  of  2D  and  31)  finite  element  codes 
have  been  developed  at  universities  and  research  centers.  Yet 
the  penetration  of  these  methods  into  a  wide  range  of  industries 
is  still  not  as  expected  for  a  number  of  well  known  reasons  such 
as  lack  of  user  orientation  and  expertise.  Recent  developing  m 
computing  (workstation,  high;  resolution  graphic  displays,  cost 
reduction,...)  enables  the  design  engineer  in  industry,  to  solve 
sophisticated  field  problems  at  his  desk.  However,  the  designer  of 
new  codes  for  industrial  applications,  especially  for  field  analysis, 
should  be  aware  of  the  following  facts: 

•  Industrial  engineers  are  seldom  specialist  in  FE-racthod, 
numerical  analysis,  operatings  systems,  computer  language. 

•  The  programs  are  mostly  used  for  basic  developements  and 
not  in  the  daily  design  process. 

•  The  response  time  should  not  exeed  hours,  time  required 
to  understand  and  to  use  the  code  should  be  minimum 

The  transfer  of  this  situation  into  specifications  for  a  user-  and 
CAD-oriented  FE- package  can  be  Summarized  as  follows: 

a)  Requirements  on  problem  and  field  modelling  capabilites. 

•  Problem  definition  Jn  engineering  terms  as  a  local  field 
problem  or  a  device  problem  considering  the  coupling  of 
fields  and  circuits,  if  required. 

•  Problem  and  user  oriented  postprocessing,  i  c.  direct  eval¬ 
uation  of  results  by  means  of  standard  procedures. 

•  Compatibility  of  results  with  those  from  traditional  meth 
o'ds. 

b)  Requirements  on  the  program  interface,  structure  and 
handling  capabilites: 

•  Programmed  user  guidance  to  minimize  learning  aud  re¬ 
learning  times,  especially  in  case  of  input  errors. 

•  Minimum  data  input  and  manual  user  interaction. 

•  Communication  with  the  program  by  means  of  menace, 
mouse,  tablet,  etc. 


-  •  Automatic  mesh  generation,  error  estimation,  adaptive 
problem  oriented  mesh  refinement. 

•  CAD' interface  for  geometry  input  aud  data  base  manage¬ 
ment. 

•*  Multi-task  and  multi-user  enviroment. 

These  requirements  are  strongly  influencing  the  llwretical  basis, 
the  data  structure,  the  data  management,  the  postprocessing 
and  man-machine  interface  of  the  F& package. 

II.  DATA  STRUCTURE 

The  minimum  and  basic  data  set  for  FE-calcuIation  consist  of 

•  Node  data.  Coordinates,  potentials,  boundary  condi¬ 
tions. 

•  Element  data.  Type,  names  of  node,  material  constans, 
field  sources. 

As  the  definition  and  the  input  of  these  basic  data  is  rather 
.labour  intensive  and  offers  high  failure  probabilities,  a  user  ori¬ 
ented  data  structure  is  based  on: 

•  Geometrical  entities,  line,  circle.  ...  segments  for  the 
description  of  the  device  surfaces  and  material  boundaries. 

•  Auxiliary  entities.  Node  chains  along  the  geomeliy, 
superclcmcnls  build  from  closed  node  chains  for  the  de¬ 
scription  of  the  FE-prohleiu. 

The  node  chains  are  generated  manually  or  automatically  based 
on  the  geometrical  description  of  the  problem  The  mesh  gen 
erator  determines  the  basic  node  and  element  data  set  from  the 
auxiliary  entities. 

User  oriented  parameter  input  requires  further  definitions  of 
entities,  especially  if  material  constant  or  external  current  or 
voltage  conditions  are  assigned  to  a  set  of  superelements.  We 
are  using  in  our  package  the  following  additional  entities. 

•  Subregions  consisting  of  a  group  of  superelements  hav¬ 
ing  the  same  material  constants,  field  sources  or  integral 
current  conditions  (3). 

•  Winding  branches  consisting  of  a  set  of  subregions,  con¬ 
nected  in  scries  and  having  the  same  current. 

•  Winding  systems  consisting  of  the  winding  branches. 

This  scheme  gives  the  user  the  possibility,  to  determine  the  cur¬ 
rent  density  via  the  winding  current,  to  introduce  easy  the  ex¬ 
ternal  current  conditions  or  to  evaluate  self  and  mutual  induc¬ 
tances,  impedances,  losses  all  refered  to  winding  branch.  It  is 
especially  suited  for  the  analysis  of  2D  steady-state  or  transient 
magnetic  field  being  coupled  with  the  external  electric  circuits 
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III.  STRUCTURE  OF  FE-PACKAGE 


Program  structure:  The  package  FEM^G'is  a  highly  inter¬ 
active,  graphic  based  system  for  VMS  and  UNIX  environments, 
being  developed  in  the  Dept,  of  Electrical  Machines  (ETH)[1]. 
,It  consists  of  foutmam  components: 

•-  -  interactive  preprocessor  for  geometry  definition  and 
semiautomatic  mesh  generation, » 

-  database  with  database  management  system, 

-  solver  with  adaptive  mesh  refinement, 

-  interactive  postprocessing  with  problem  oriented 
mesh  refinements  (2). 


Interface:  Input/output  operations  use  screen-masks  and 
screen-meriues.  Generally,  two  levels  of  mask-menues  have  been 
.  introduced ^tuain  menu  and  submenues  (see  Fig.  1). 


Soled  triput  F?e 
Graphics  (Window) 
Geometry 

Node  Chains  (SE) 

Mesh  Generation 
Node  Manipulation 
Element  Manipulation 
Sutxegwns 
Material  Constants 
Transformations 
..Wirings 

Magnetization  Curves 
Boundary  Conditions 
Held  Calculation 
Analysis 


~  Read/write  data  files 
A-  Create/hardcopy  pictures,  set 
*  /  window/tablet 


*  /  window/tablet 
/  Create  elementary  geometry 

/  Define  and  create.'hode  diain 


SAVE/  SAVE  AS 
EXIT 


Fig.l,  Main  menu  of  FEMAG. 


This  mode  of  dialog  with  the  system  is  uniform,  easy  to  use 
and  almost  completely  self  explanatory.  It  makes  the  system 
easy  to  learn  and  to  learn  it  again,  which  make  it  especially 
suited  for  small  industrial  design  offices,  where  it  may  be  applied 
occasionally.  Input  data  are  introduced  by  means  of  mouse,  key¬ 
board  or  digital  tablet,  output  uses:  graphic  display,  laser  printer 
(postscript),  x-y  plotter,  graphic  printer 
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^  „  IVN  NUMERICAL  EXAMPLES 

A  number  of  typical  applications  of  the  package  for  CAD  of 
electrical  machines  and  devices  will  be  discussed  during  the  pre¬ 
sentation.  ■  One  example  is  shown  in  Big.  2  The  flux  plot  in 
an  ac  fed  squirrel-cagc  inotor  at  the  slip  s  =  0.02  with'  zero  ex¬ 
ternal  current  conditions  for  rotor  as  well,  as  the  calculated  and 
the  measured  frequency  response  are  shown  in  Fig.  2. 


S  s-  Define  and  createhode  chains 
S  Generate  basic  inesh 
A-  Submenues  for  node  and 
yj  element  manipulation. 
j  ~  Define  windings-elcments  and 
/  subregions 

Introduce  material  attributes 
Moove,  copy,  rotate  and 
reflect  structure 

•  - Define  windings  structure 

.  Define  magnetization  curves 

•  - Define  boundary  conditions 

•  - Solve  equations  system  with 

|  •  mesh  refinements 

!  Postprocessing  (fluxes,  impe- 
dancess,  forces,  torques,  volta- 
A  ges,  losses,  2D  and  3D  plots) 
\A-  Save  file  with  old/new  name 
End  of  session 


Phase  Impedance 


- A - A - A 

frtQJtncy  fHzl 


Fig.  2.  Flux  plot  at  i  =  Os  and  frequency  response  of  ac  fed 
induction  motor. 


Jlj  FEMAG  -  Reference  Manual,  Version  5.0,  ETH  Zurich, 
1990. 


Database  mid  database  management  system:  provides 
easy  access  to  data*  by  means  of  a  number  of  well  defined  sub¬ 
routines.  Structure  of  B'EMAG  is  modular  and  data-oriented, 
therefore  easily  expandable.  Flexibility  of  data  structure  has 
been  achieved  using  special  data  types  (RF.CORDS)  Changes 
in  software  involving  new  group  of  data,  e  g.  new  element  at¬ 
tributes,  require  the  change  of  relevant  records  only  without  af¬ 
fecting  other  parts  of  the  existing  software 


(2)  Tarnhuvud  T.,  Reichert  K.,  Skoczylas  J.:  Problem-onented 
adaptive  mesh  generation  for  accurate  finite  element  calcu¬ 
lation”,  IEEE  Trans.  MAG-90,  pp.779-782. 


(3)  Reichert  h.,  Skoczylas  J.,  Tarnhuvud  T.  .  Eddy  current 
calculations  in  electrical  machines,  Numerical  solution  tech¬ 
nique  and  accuracy  problems",  Int.  Conference  on  Electrical 
Machines  ICEM-88,  pp.59-64. 
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'MECHANICAL^ STRESSES  DISTRIBUTION  DUE  TO  RADIAL' SHORT-CIRCUIT.- FORCES' 
IN  TRANSFORMED  WINDINGS 

ZBIGNIEW  WESELUCHA 

Institute;  of* Electrical;  Engineering 

04-703  Warsaw.  Pozaryskiego  28,  POLAND 


Abstract  -  The  analytical  approach  presented  in.  the 
paper  provides  a -better  under  standing  of  the, behavior 
of  power  transformers  under  radial  short-circuit 
forces/  The  inner  (LV)  winding  was  taken  into  account 
as  a’ multi-layer  structure  and  i exemplary  results  are 
enclosed.  '  » 

•  «  INTRODUCTION. 

The  current  carrying«conductors.'of  the  .transformer 
windings  are  situated  in  the  region  of  magnetic  leakage 
flux  and  experience  mechanical  forces.  The  excessive 
■increase  * in^  KVA  ratings  of  large  power  transformers  in 
the  last  years  accompanied  .by  a  parallel'  increase  in 
the  forces  generated  in  tKe, 'windings  under  short- 
circuit  conditions/ are  the  main  reasons. of <■  mechanical 
damages  of  transformers.  Statistic  docuaents^rcport- ah 
Increasing  number  of  power  transformers  failures  under 
mechanical  forces  produced  by  fault  currents  (5,81. 

In  concentric  winding  the  axial  component  of  the 
leakage  flux  produces  radial  forces  (Fig. 1). 

The  radial  forces  may  cause  the  well-known  buckling 
phenomenon  of  inner  (LV)  windings.  Previous  concepts,  of 
calculating  those  mechanical  damages  were  based  on  the 
assumption  of  the  average  value  of  the  stress  in  each 
conductor  (31  and  were  far  from  expected  ones.  The 
proposition  placed  in  the  paper  increases  the  possi¬ 
bility  of  the  mathematical  model  of  transformer  coll 
and  lets  to  observe  the  extreme  values  of  stresses, 
strains  and  displacements  in  each  conductor.  The  cir¬ 
cumferential  component  of  stress  In  conductor  expresses 
mechanical  withstanding  of  a  coll  due  to  radial  forces 

LIST  OF  SYMBOLS 


Transformer- coil- Is  a' wult 1-layer  ring  consisting  of 
conductor  and' insulation  materials  (Fig. 1  B) 

The  considerations  for  the"  Inner  winding  may  be 
reduced' to- the  .one*  coil  >  chosen,  from  the  middle  of  the 
winding 

We  call  a  “zone"  each  conductor  and  Insulation  layer 
of  the  multi-layer  coil  structure. 

The  inner*  winding  (LV)  experiences  a  force  acting 
inward  tending  to  crush  or  collapse  it  (8).  It  is 
especially  difficult  problem  since  the,  windings  are 
supported  from  the  core  by  the  distance  spacers  round 
the  circumference.  Failure  nay  'occur  by  bending  in  of 
the  conductorsbetwecn  supports  producing  a  character¬ 
istic  star-shape. 


Fig. l.A)  Axial-  component  of  magnetic  flux  density  and 
radial  forces  in  simple  concentric  windings, 
B)  Tho  multi-layer  transformer  coil  structure. 


B^-  axial  component  of  the  magnotlc  flux  density 
E  -  modulus  of  elasticity 
Fr-  radial  component  of  the  density  of 
electromagnetic  force 
J  -  current  density  in  conductor 
r,0  -  polar  coordinates 

outer  and  inner  radii  of  a  winding 

u,v  -  radial  and  circumferential  components  of 
components  of  displacement 
$  m  (  I  **  for  conductor  material 
\  0  -  for  insulation  material 
c  ,c.  -  radial  and  circumferential  components  of 
'  *  strain 

K  -  tangential  component  of  strain 
k  -  stiffeness  of  support 
v  -  Poisson’s  ratio 

<?r>«re  -  radial  and  circumferential  components  of 
stress 

t  -  tangential  component  of  stress 


All  considerations  are  based  on  the  assumption  that 
the  stress  does  not  exceed  tho  proportional"  limit;  this 
mean  that  strain  must  be  proportional  to  stress,  in 
accordance  with  Hooke’s  law  (2),  The  material  of  the 
ring  must  be  elastic,  thus  capable  of  sustaining  stress 
without  permanent  deformation. 


BASIC  EQUATIONS 

2-D  Hooke’s  Law  (in  polar  coordinates) 

1  1  E 

c  ■  -  (<r  -w  ),  c«  -  (c-  -  wr  1,  y  "  -  t  (2) 

'  E  r  9  9  E  8  r  2(1*,) 

Cauchy’s  relationships  (stralns-dlsplacemcnts): 

Su  1  5w  u  i  5u  5w  w 

C  ■  - ,  C«  -  -  ♦ 

r  3r  Q  r  50 


r  * - - -  (3) 

r  50  5r  r 


FORMULATION 

Since  the  generated  forces  are  electromagnetic  in 
origin  they  are  termed  here  as  the  ■electromagnetic* 
forces.  The  radial ly-actlng  components  of  the  forces  in 
a  transformer  windings  are  easily  and  accurately 
calculated  by  elementary  methods  (2,7,8).  From  the  the¬ 
oretical  point  of  view  we  say  here  about  "weak-coupled' 
electromagnetic  and  mechanical  fields. 

The  electromagnetic  force  density  is  determined  as 
follows  (7) 

F.  ■  J  *  8>  <ll/»’>  (1) 


From  the  theory  of  elasticity  we  get  2-D  Kirchhof’s 
problem  which  is  described  by  two  differential 
equations  of  static  equilibrium  as  follows  (I) 


ftr 

1 

5r  c  - 

r 

♦  _ 

_ ♦  _L 

5r 

r 

50  r 

5t 

2t 

1 

♦ 

Sr 

r 

r  50 
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* BOUNDARY ^ COND IT IONS: 

U  continuity -of*  stresses  ahd<- displacements for  each 
"J"  and  “J+l*  zones' 

«rJ  -  pi* 
t  <r 


-  U>** 

VJ  -  VJ*‘ 

2. -zero  stress  condition  for 
><r  «»0 

-r 

T  •  0 

3. .zero  Stress  condition  for 
t  »;o 


(5a) 


(5b) 


(Sc) 


Fig.  3,  Distribution  of  an  average  and  extreme  values  of 
stresses  V  in  the  inner  conductor,  of  a  coll. 


4.  dual  stress-displacement  condition  for  r  *  R  (7J 
representing  a  contact  problem  w 

.  1  ^ 

u  *  ~  <?'  "  for  support  region  .. 

I  (5d) 

*rm  0  “  outside  the  support  J 

Substituting  (2)  and  (3)  into  (4)  ve  obtain  differ¬ 
ential  equation  'for  'radial  displacements  u.  the 
solution  of-whlch  and  boundary  conditions  (5)  allow  us 
to  reduce  the  contact  problem  to  Fredholm  integral 
equation  of  the  first  kind.  The  solution,  of  the  inte¬ 
gral  equation  Is  the  base  of  the  numerical  algorithm. 

RESULTS 

The  following  parameters  of  the  coll  voro  taken  for 
the  calculations: 

5  conductors  In  a  coll,  inner  radius  R  <■  650  mm, 
36  spacers  around  circumference.  ** 

conductor  width  1.6  am.  Insulation  width  0,3  am. 
spacer  width  20  *a, 

Poisson’s  ratio  for  conductor  v  ■  0.32, 
modules  of  elasticity  E:  -  conductor  1. 125M05  MPa, 

-  Insulator  250  MPa, 

-  spacers  350  MPa. 

The  maximum  electromagnetic  force  has  been  assumed 
as  the  Unit  quantity. 


Fig. 2  Distribution  of  circumferential  stresses  In  the 
support  region  (the  middle  and  the  end  of  spacer) 


Fig. 4.  Distribution  of  the  extreme  values  of  circum¬ 
ferential  stresses  c r  in  every  conductor  of  a 
coll.  8 


CONCLUSION 

It  has  been  shown  (Flg.2.3)  that  the  mechanical  "over- 
stresses"  occurring  In  conductors  of  a  coll  near  the 
end  of  the  spacers  are  even  few  times  greater  then  the 
average  value  of  stress.  The  extremely  hard  conditions 
appear  in  the  inner  conductor  of  a  coll  supported  on 
the  stiff  spacers. 
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A  GENERAL  SCHEME  FOR  CUTTING  THE,  MAGNETIC  SCALAR 
REGION  IN  MULTIPLY  CONNECTED  3D  EDDY  CURRENT  PROBLEMS 


P.J.’ Leonard  and  D.' Rodger 
School  Electrical  Engineering 
-University  of  Bath.  UK 


Abstract  —  The  efficient  solution' of  eddy  current  problems  can 
be  achieved  using  the  formulation.  This  docs  however  re¬ 
quire. cuts  to, be  made  in  the  magnetic  scalar  region  whenever  the 
conductor  is  multiply  connected.  This  paper,  present's  a. method 
for  modelling  cuts  for  a  variety  of  topologies  including  interlinked 
conducting  circuits. 

I.  INTRODUCTION 

In  a  previous  paper  (1)  a  scheme  was  described  for  cutting  the 
magnetic  scalar  potential  over  the  spanning  surface  of  a  hole  in 
a  conductor.  The  scheme  was  limited  to  surfaces  which  did  not 
intersect  thcmseUes.  In  this  paper  wc  generalises  the  scheme  to 
allow  intersecting  surfaces  and  folds. 

II.  THEORY 

A.  Modelling  a  single  hole 

The  formulation  is  used  to  model  eddy  current  problems. 
The  conductor  is  modelled  using  the  magnetic  vector  potential, 
whilst  surrounding  non-conductors  modelled  using  the  magnetic 
scalar  potential.  The  usual  finite  element  approximation  for  is 
a  continuous  function.  Reference  (1)  describes  how  the  represen* 
tation  can  be  modified  to  allow  a  jump  in  the  value  of  ^  as  we 
pass  through  a  surface  spanning  a  hole. 


=  r  (cut) 

lo 

V 

nA 

(jfsa 0 

(Figure  1.  Cross  section  of  conducting  circuit  showing 
the  simple  cut  in  0^ 

The  definition  of  the  magnetic  scalar  is  modified  in  a  layer  of 
elements  that  spans  the  hole  (see  figure  1); 

H  r=  (1) 

The  w,  are  the  noi  trial  basis  functions  fur  a  continuous  scalai 
potential;  is  an  additional  discontinuous  basis  function,  that 
lives  within  the  layer  of  elements  spanning  the  hole. 


The  additional  basis  function  is  defined  to  be  zero  on  one  side  of 
the  layer  and  unity  on  th’c  other  With  this  new  basis  function 
the  value  of  rj;  jumps  as  it  passes  through  the  =  1  surface, 

=  f  +  h  (2) 

The  finite  element  model  follows  the  usual  development.  Howc\cr 
we  have  an  extra  equation  corresponding  to  the  new  unknown, 

-  j  (Vu(  x  A)  ■  n  dr  +  j ;<V«£  ■  V0<m  =  0  (3) 

In  this  equation  can  be  regarded  as  a  test  function  which 
(weakly)  ensures  that  the  line  integral  of  A  around  the  cut  is 
equal  to  the  magnetic  flux  flowing  through  the  surface. 

It  is  easily  seen- that  any  number  holes  can  be  modelled  using  this 
technique  providing  the  surfaces  do  not  interfere  with  each  other. 

B.  Conducting  bar  piercing  cut 

Figure  2  shows  a  slightly  more  complex  situation,  thcie  is  a  con¬ 
ducting  bar  which  interferes  with  the  cutting  surface. 


Figure  2.  Showing  a  loop  with  a  bar  which  pierces  the 
_ cutting  surface _ 

In  fact  this  case  does  not  need  any  special  treatment.  The  re¬ 
duction  of  the  i  itcgral  in  the  region  should  be  equal  to  the 
increase  of  the  integral  over  the  P(A  interface  (equation  3. 

C.  Interconnected  circuits 

If  two  circuits  link  each  other  as  shown  in  figure  3,  then  the  two 
basis  functions  for  the  cuts  interfere.  However  the  principles  re¬ 
main  the  same,  we  just  have  two  extra  equations  Each  equation 
will  see  a  contubution  from  the  other  through  the  definition  of  v 

0=w(i&>  W 
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Figure  3.  Two  interlocking  circuits 


D.  Self  intersecting  or  folded  cutting  surfaces 

Some  shapes  of  circuit  have  holes -which  are- more  difficult  to 
cut<  For  example  a. helical  winding  is-' topologically  equivalent 
to  a  doughnut  but  a  cutting  surface  which  docs  not  interfere  with 
itself  is  more  difficult  to  construct. 

Allowing  the  surface  to  fold  and  intersect  iUcJf  helps- the  con* 
struction  of  the  cutting  surface  but  requires  a  modification  of  the 
basis  function  The  correct  strategy  is  to  add  the  contributions 
when  these  folds  or  intersections  occur.  In  practice  it  is  easier  to 
construct  the  cutting  surface  which  makes  the  problem  simply 
connected  and  then  deduce  the  require  jumps  (2). 

For  example  consider  the  loop  of  conductor  shown  in  figure  4,  and 
the  cutting  scheme  shown  in  figure  5. 


Figure  4.  Loop  of  conductor 


-  Figure  5.  Loop  of  conductor  shown ‘the  cutting  surface 
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Figure  6.  Slice  through  the  loop  showing  the  required 
_ basis  function _ 


III.  CONCLUSION 

We  have  presented  a  scheme  for  cutting  the  magnetic  scalar  region 
in  multiply  connected  eddy  current  problems  and  given  a  simple 
example  of  U*s  application.  The  scheme  can  be  used  even  when 
cutting  surfaces  overlap. 
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The  outermost  surface  requires  a  jump  equal  to  the  current  in 
the  loop  whilst  the  inner  surface  requires  twice  this  amount  The 
basis  function  required  to  produce  this  jump  is  illustrated  in  figure 
6.  The  integers  are  the  the  nodal  values  of  the  basis  function  W( 
within  a  particular  region,  thus  the  value  of  the  basis  at  a  given 
node  will  depend  on  which  element  we  are  looking  at.  Note  that 
this  could  be  regarded  as  the  sum  of  two  overlapping  surfaces. 


Three-dimensional  Electromagnetic  Computations 
in  Quasi  Axisymmetric  Structures 


R.  Albanese* 


Introduction.  ^Several  Numerical  procedures  and  rela¬ 
ted  computer  ,  codes  :  are  now  ,  available  for.  the  analysis 
of  electromagnetic  ^  (e.g.  magnetostatic  and  eddy  cur¬ 
rent)  problems  in  three-dimensional  geometries.  However, 
there  are  a  number  oh  applications  for  which  fully  three- 
dimensional  models 'can' hardly,  b'e  used.  An  example  of 
this  kind  is  furnished  by  the  analysis  of  a  plasma  discharge 
in  a  Tokamak.  In  such  devices  the  hypothesis  of  axisym- 
metry  is  acceptable  and  even  mandatory  for  thw  plasma. 
On  the  other  hand,  the  hypothesis  is  not  directly  applica¬ 
ble  to  the  patterns  of, the  eddy  currents  and  electromag¬ 
netic  fields  in  the  metallic  structures,  due  to  their  seg¬ 
mentation  along  the  toroidal  direction.  Nevertheless,  the 
interest  is  often  focused  on  the  plasma  region.. Here,  the 
electromagnetic  fields  produced  by  eddy  and  magnetising 
currents  localised  in  the  metallic  structures  in  the  pres¬ 
ence  of  typical  axisymmetric  excitations  (currents  flowing 
in  poloidal  field  coils,  plasma  motion  and  disruptions)  are 
substantially  axisymmetric.  In  this  case,  it  is  possible  to 
approximate  the  quasi- axisymmetric  metallic  structures 
by  means  cf  equivalent  axisymmetric  models  connected 
to  a  fictitious  electric  network.  Parameters  (resistances 
and  inductances)  of  this  network  and  material  properties 
of  the  axisymmetric  model  can  be  selected  such  as  to  man- 
tain  the  input/output  behavior  as  close  as  possible  to  that 
of  the  three-dimensional  system. 

Three-dimensional  model.  The  eddy  current  prob¬ 
lem  in  a  three-dimensional  conducting  region  V  can  be  de¬ 
scribed  by  expanding  the  eddy  current  density  as  J  (x,  t)  = 
w^ere  fh*  basis  functions  J*;(x)  are  con¬ 
strained  to  be  solenoidal  in  V  with  J*(x)  •  n  =  0  on  5V. 
For  nonmagnetic  conductors,  the  time  evolution  of  the  co¬ 
efficients  /*(t)  can  be  obtained  by  means  of  the  following 
linear  initial  value  problem  {!]: 

LL+£L=Z  (1) 

with  L,k  =  “/v/v  dV  dV,  R,k  =  /v  J,(x) 

r)Jt(x)  dV  and  V,  =  Jv  J,(x)  .  E,(x,t)  dV.  Hers  ^  is 
the  vacuum  permeability,  tj  the  resistivity  and  E«  the  ex¬ 
ternally  applied  electric  field.  The  strict  analogy  with  a 
lumped  parameter  approach  is  evident:  £  and  R  play  the 
roles  of  the  inductance  and  resistance  matrices,  respec¬ 
tively,  whereas  l  and  V  play  the  roles  of  currents  and 
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applied  voltages. 

In  basically  axisymmetric  . systems -like  the  tokamak  re¬ 
actors,  it  is  interesting  to  analyse^  the  influence  of  the 
quasi-axisymmetric  structures  on  the,  mutual  interaction 
between  axisymmetric  conductors.  In  this  case,  the  ap¬ 
plied  electric  field  is  usually  given  by  a  sei  of  axisymmetric 
currents  /«*  (input  quantities): 

L=-Met  (2) 

where  Me, fc  is  the  magnetic  flux  linked  with  due  to 
the  unit  vector  current  density  shape  function  J,;  the  rel¬ 
evant  output  quantities  to  be  computed  are  generally  the 
magnetic  fluxes  ^  linked  with  a  number  of  axisymmetric 
coils: 

!£0  =  M0-Z+M0'-I<  (3) 

Monk  is  the  flux  produced  by  the  unit  axisymmetric  cur¬ 
rent  lck  and  linked  with  the  circumference  I\,  whose  cen¬ 
ter  is  on  the  toroidal  axis  of  symmetry;  Moik  has  a  similar 
meaning  but  is  due  to  the  degree  of  freedom  /*  associ¬ 
ated  with  the  ehape  function  J,.  Therefore,  in  complex 
notation,  the  syetem  can  be  put  in  the  form: 

(»i  +  g) W 

<5> 

where  t  indicates  a  variable  in  the  s  domain. 

Quasi-axisymmetric  model.  Toroidally  continuous 
structures  can  well  be  schematiied  as  axisymmetric  con¬ 
ductors  having  equivalent  sections  and  resistivities.  This 
is  for  instance  the  case  of  the  vacuum  vessel  of  a  toka¬ 
mak.  However,  there  are  structures  like  the  tokamak  first 
walls  which  consist  of  a  number  of  toroidally  insulated 
sectors  (see  Figure  l).  As  suggested  in  (2,3),  structures 
of  this  kind  can  be  schematired  as  a  set  of  axisymmetric 
coils  connected  to  an  external  network.  The  parameters 
(inductances  and  resistances)  of  the  branches  of  this  ficti¬ 
tious  network  can  be  selected  to  approximate  the  effects  of 
the  nontoroidel  currents.  The  behavior  of  such  a  system 
can  be  described  by  the  following  equations: 

(z-,+d+i)-L  =  ->M<‘L  («) 

^o-Lx+M O'-L,  (7) 

Here  •  indicates  the  matrices  due  to  the  axisymmetric  con¬ 
tributions  and  to*  indicates  the  axisymmetric  independent 
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Figure  i:  Current;  distribution" in  one ; half,  of  a  blan¬ 
ket/first  wall  module. 

** 

‘  currents. the  admittance  matrix  of  the  external  ficti¬ 
tious  network.  The  topology  of  this  network' is  chosen  such 
as  to  have  a  close  relationship  with  a  phisical  interpreta¬ 
tion  of  the  circuit  parameters' involved.  These  unknown 
parameters  can  be  initially  computed, using  approximate 
analytical  expressions,  which, in  case  of  a  tokamak  first 
wall  are  reported  in  [2,4].  Of  course  the  I/O  behaviors  of 
the  3D  system: 

=  (8) 

and  of  the  approximate  2D  quasi  axisymmetric  system: 


will  result  to  be  different.  However,  their  difference  can 
be  reduced  by  using  a  procedure  similar  to. the  Levy’s 
hfethod  [5].  In  fact,  the  unknown  coefficients  of  the 
polinomials  at  the.  numerator  and  denominator  of  Hid 
(//2d(3)  =  B(s)/A(s))  can  be  computed  as  the  min¬ 
imum  of  the  quadratic  form  U  =  E*Ei  where 
Ei  —  A(;w,)  —  B(;w,),  w,  are  a  number  of  suit¬ 

able  frequencies  and  ■*  denotes  adjoint.  Of  course,  the 
relationship  among  coefficients  and  circuit  parameters  is 
non  linear  and  can  be  obtained  by  a  classic  procedure  of 
circuit  synthesis.  In  a  more  direct  numerical  way,  resis¬ 
tances  Jl/y  and  inductances  L/y  of  the  fictitious  external 
network  can  be  tuned  in  order  to  minimize  the  functional: 


A  I  g.n(M-g,n(w)r 

h  i2 


(10) 


where  w,  are  again  a  number  of  suitable  frequencies.  R/j 
and  Lf3  should  be  positive,  so  that,  in  order  to  have  an  un¬ 
constrained  minimization  problem,  they  shoud  be  rewrit¬ 
ten  as  R/j  =  L/j  =  Lq •  The  problem  is 

nonlinear.  However,  there  is  the  advantage  that  the  phys¬ 
ical  interpretation  is  not  lost,  and  that,  apart  the  I/O 
behavior,  other  pi  op  cr  ties  are  somehow  preserved  by  the 
quasi-axisymmetric  model. 


Results  and  conclusions.  As  an  example,  we  studied 
the  blanket/first  wall  geometry  described  in  Figure  1. 
Due  to  the  symmetries  of  the  system,  the  3D  numerical 
model  (72  elements,  62  degrees  of  freedom)  is  limited  to 
one  half  of  one  of  the  48  modules.  The  conductors  car¬ 
rying  the  currents  L*  are  symmetrically  located  with  re¬ 
spect  to  the  equatorial  plane.  Two  possible  inputs  are 
considered:  the  first,  symmetric,  produces  on  the  equato¬ 
rial  plane  a  vertical  field  BJt  the  second,  antisymmetric, 


Figure  2:  BR  as  a  function  of  time.  The  curves  refer  to 

the  output  of  3D. model  (- - ),  2D  approximate  model 

(  -“■-)  and  2D  optimised  model  ( - -). 

gives  at  the  same  position  a  radial  field  Br  .  Accordingly, 
the  outputs  are  the  flux  linked  with  the  plasma  and  two 
flux  combinations  giving  the  field  BM  »  Aip/2nRAR  and 
Br  «  -*A0/2xi2Az. 

The  2D.  system  is  made  out  of,  16  axisymmetric  conduc¬ 
tors,  properly  connected  to  an  external  fictitious  network, 
as  previously.described. 

The  antisymmetric  output  (the 'field  Br .  due  to  an  anti¬ 
symmetric  input  linearly  rising  to  1  MA  in  10  ms  and  then 
constant),, is  shown  in  Figure  2,  where  a  comparison  is 
made  among  the  3D  case,  the  approximate  2D  case  whose 
network  parameters  have  been  analitycally  estimated,  and 
the  optimized  2D  case. 

In  this  simple  case,  it  can  be  seen  that  the  axisymmetric 
model  works  in  a  satisfactory  way  also  without  the  opti¬ 
mization  procedure.  Of  course,  more  complex  real  cases 
cannot  be  handled  with  a  similar  accuracy  without  adopt¬ 
ing  a  systematic  approach  like  the  one  here  described. 
The  I/O  behavior  is,  preserved  with  a  little  effort  (uncos- 
trained  minimization  of  a  nonlinear  functional).  However, 
the  physical  interpretation  is  not  lost.  Other  local  and 
global  quantities  (other  than  those  used  in  the  optimiza¬ 
tion  process)  can  be  obtained  with  sufficient  accuracy.  In 
this  respect,  the  optimization  plays  the  role  of  a  tuning 
of  the  parameters  of  the  fictitious  network.  Finally,  it  is 
worthwhile  to  notice  that  the  relevant  3D  transfer  function 
can  also  be  obtained  by  means  of  a  set  of  measurements. 
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Abstract  -  Three-dimensional  computations  of  electromagnetic 
fields  are  presented  with  the' aid.,  of*  the  method,  of  finite 
elements.  Magnetostatic,  eddy.,  current  and  microwave  fields 
are  treated  Various  formulations  are  discussed  with  some 
advantages  .and’ disadvantages  {pointed  out. 

INTRODUCTION 

fhe- differential /equations  of  electromagnetics',  can ‘be¬ 
set  up  in  terms  of  various*  variables:  It. is  possible; to ;  use 
the  field  quantities  directly,  or  to*  introduce  potential  func¬ 
tions.  In  the  course  of  their  numerical  solution,  the  differen¬ 
tial  equations  are  replaced  by  sets,  of  algebraic  equations. 
If  nodal  finite,  elements  (l]lare  used'to  set  up  these  equa¬ 
tions,  the  potentials  are  advantageous,  since  they  are  continu¬ 
ous.  Recently, .vectorial. or. edge  clements.are  also  frequently 
employed  [2,3],  they  fit  well  to  a  rcpresenUtlon  by  field 
vectors  whose  tangential  components  only  are  continuous. 

For  -three-dimensional  computations,  the  numerical 
stability  of  the  method,  is,  extremely  important.  Due  to  the 
large  number  of  degrees  of  freedom,  Iterative  methods  are 
bound  to  be  used  for  the  solution  of'  the  equations,  the 
most  frequent  one  .being  the  method  of  conjugate  gradients 
[4].  Unless  the  uniqueness,  of .  the  ^variables  Is  ensured,  the 
number  of  iterations  needed  becomes  unbearably  high. 

Tn'theToHowIng  discussion  of  various  3D  computations, 
the  authors’  experience  with  respect  to  the  above  two 
points  wlH.be  outlined. 

MAGNETOSTATIC  FIELDS 

Magnetostatic*  fields  can 'be.  represented  with  the- aid 
of  .a.  reduced  scalar  potential.  [5j,  be.  as  the  sum  of  an 
Impressed  field  satisfying  Ampere’s  law  and  of  the  gradient 
of -a  scalar  potential.  Nodal  finite  elements  suit  Well  for 
the  approximation  of  this  scalar.  Serious  cancellation  errors 
occur  In  highly  permeable,  parts,  where  the  Impressed  field 
calculated  accurately  by  Blot-Savart’s  law  is  several  orders 
of  magnitudes  higher  than  the  resultant  field,  obtained ;  by 
subtracting  the  gradient  of  the  reduced  scalar  potential,  an 
operation  Involving  numerical  differentiation.  The  most 
widely  used  method  to  overcome  this  difficulty  Is  to  set 
the  Impressed  field  to  zero  in  regions  with  high  permeability, 
thus  arriving-  at  a  total  scalar  potential  here  [5].  Another 
possibility  is  to  compute  the  Impressed .  field  under  the 
assumption  of  Infinite  permeability  [6],  In  both  cases,  the 
discontinuities  of  the  Impressed  field  give  rise  to  additional 
Interface  conditions  to  be  satisfied  by  the  scalar  potential. 
The  authors  have  recently  experimented  successfully  with 
a  continuous  Impressed  field  represented  with  the  aid  of 
edge  elements  [7],  A  similar  method  was  suggested  In  (8], 


Fig.  I  Flux  density  distribution  in  ferromagnetic  plates 


Recent  results-  of  a  TEAM  Workshop  problem  by 
several  groups  around  the  world  by  widely  differing  methods 
[9,10]  have  pointed  "out  that  scalar  potential  methods  .-.may 
yield  too  high  field  values  in  feriromagnetlc  regions.  The 
-  problem  consists-of  iron  channels  around  a  racetrack  shaped 
coll.  The  flux  density,. Is,  shown  In  .Fig.  I  In,- one  -  eighth  of 
the  model  as' obtained1  by  total  and  reduced  scalar-potentials 
by.  C.  Magele  flO]/  The'  results  seem '"completely  acceptable, 
butf  the*  value' of  the  .flux  density  .In 'the  Iron  j  plates -is 
about  10  .  percent  higher  than  ..the  .  measured  value  even  If 
the  discrittization  Is  extremely  fine. 

An  alternative  to  the;  scalar  potential  ‘  representation  Is 
to  use  a  magnetic  vector  potential.  In  case  of.  nodal  elements, 
111- Conditioning) -occurs  If  no  gauging  Is  used  [H].  The 
Coulomb  gauge  can  be  enforced  on  the  vector  potential  to 
ensure  Its  uniqueness  ,and  thus  numerical'  stability  [123, 
however,  too  low  flux  densities  are  obtained  In  ferromagnetic 
/parts  [ll).  This  problem  has  been  overcome  by  the  authors 
by  freeing  the  normal'  component  of  the  vector  potential 
over  iron/alr  Interfaces  [ll,7j.  The  alternative  of  employing 
edge  elements  to  represent  the  vector  potential  have  yielded 
encouraging  results  (see  results  by  T.  Nakata,  [9]).  but  It 
seems' that  the  numerical  stability  becomes  inferior  .and  it 
can  be  little,  Improved  by  gauging  [13]. 

It  seems’  that  the  computation  of  magnetostatic  fields 
in  three  dimensions  is  by.  no ’means'- a  problem  that-can  be 
considered  to  have  been  solved  long  ago. 

EDDY  CURRENT  FIELDS 

Due  to -the  coupling  between  the,  electric  and  magnetic 
field,  eddy  current  fields  cannot  be  described  by  a  scalar. 
Various  continuous  potentials  can  be  introduced  to  represent 
them,  in  the  most  general  case  both  a  vector  and  a  scalar 
potential  are  necessary  In  eddy  current  carrying  conductors. 
One  possible  pair  is  a  magnetic  vector  potential  and  an 
electric  scalar  potential  (A.V),  the  other  Is  a  current  vector 
potential  and  a  magnetic  vector  potential  (T.i^).  The  static 
magnetic  field  In  the '  surrounding  nonconducting  region  can 
again  be  described  either  by  a  magnetic  scalar  potential  or 
by  a  magnetic  scalar  potential.  All  these  formulations  In 
conductors  and  nonconductors  can  be  coupled  to  each 
other  in  a  symmetric  way  [14].  In  order  to  achieve  numerical 
stability.  It  is  necessary  to  gauge  the  vector  potentials.  A 
possibility  is  to  enforce  the  Coulomb  gauge  on  them  [12]. 


1 

& 

Fig.  2  Eddy  current  density  in  a  liquid  nitrogen  shield 
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Ia  so= sr  problfisj,  tbe  fdiy  carts:  aojfcf  coodoctow 
are  tHa  sheets  with  s*tfi{24c  variation  of  the  eddy  carts! 
•JfsjKy  1 a  the  direction  soraal  to  Lbdr  ssiface  aod  with 
tbe  waul  component  of  tbe  carts:  deaHy  practically 
zero.  la  tbs*  case  the  etc  of  roleae  tltatslx  to  describe 
tbe  eddy  entreats  leads  to  oeaeica!  probleats  dse.to  tbe 
large -differences  la  elcatst  sizes  la  tbe  vidssty  of  tbe 
coad^cw*.  A  possibUty  is  to  model  tbe  coadcctors  by 
sheets  of  zero  thickness  aad  to  describe  tbe  eddy  carrrats 
by  a  slagle  component  current  sector  potential.  A  method 
to  cocple  this  stream  function  to  *  scalar  potential  la  tbe 
so?Tcwn<£cg  nonconducting  region  has  beea  preseated  *a 
[l SJ.  Tbe  aatbors  hare  recently  developed  *  method  to 
correct  tbe  above  description  with  a  vector  potential  for rela¬ 
tion  of  tbe  sorrooodlag  static  field  [16J.  T Ms  method  bas 
beea  applied  'to  tbe  competition  of  tbe  transient- eddy 
ctiiTeet  field  of  a  large  superconducting  coil  In  an  experi*nental 
setup  for  future  TOKAMAK  reactors  [f7j.!  Tbe  distribution 
of  tbe  eddy  current  density  la  a  liquid  nitrogen  shield 
around  such  a  D-shaped  coil  dse  to  a  discharge  Is  shown 
In  Kg-  2 

3D  CAVITIES 

la  tbe  analysis  of  3D  cavities,  the  problem  is  to  find 
tbe  frequencies  and  the  corresponding  field  patterns  that 
can  exist  without  external  excitation.  In  contrast  to  the 
previous  deterministic  problems,  this  leads  to  an  eigenvalue 
problem.  "P'e  finite  element  computation  of  such  problems 
has  long  been  plagued  by  so  called  spurious  modes  which 
are  nonphysical  solutions  that  approximate  no  real  mode. 
They  are  now  recognized  as  approximations  of  gradient 
fields  corresponding  to  the  zero  frequency  eigenvalue  that 
has  infinite  multiplicity  unless  the  vector  variable  used  is 
gauged  properly  [18]. 

One  possibility  to  eliminate  spurious  inodes  '  is  to 
employ  nodal  elements  but  enforce  the  Coulomb  gauge  on 
the  vector  potential  with  a  scalar  potential  also  used  [19]. 
This  method  leads  to  as  many  zero  eigenvalues  as  there 
are  unknown  scalar  potential  values.  The  second  mode  of  a 
rectangular  cavity  partially  filled  by  a  dielectric  slab  [20] 
is  illustrated  In  Fig.  3  by  the  distribution  of  the  electric 
field. 

Spurious  modes  can  also  be  avoided  by  using  edge 
elements  and  a  field  quantity  as  system  variable.  In  this 
way,  the  space  of  gradient  fields  Is  projected  to  a  subspace 
of  finite  dimensions  in  the  space  spanned  by  the  edge 
elements.  Therefore,  similarly  to  the  above  method  used  in 
conjunction  with  nodal  elements,  the  number  of  zero  eigen¬ 
values  is  known  in  advance,  with  the  rest  of  the  modes 
approximating  physical  solutions  [l8,2i]. 


reviewed.  Tbe  importance  of  tbe  appropriate  choice  of  tbe 
system  variables,  of  tbe  gasge  conditions  and  of  tbe  elements 
esed  bas  beer  pointed  oct. 
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ASsrrart-Ix^uifigthe  magnetic  vee  cor  potential  for 
dij  cur rent  calculations.  there  otllW  the  problem  of 
selecting  the  gauge  for, it.  As  typical  gasses,  there 
are  the  Coulcsb  and  Lorentz  gauges.  As  is  veil  know, 
ty  the  gauge  transformation  tberfoxner  transforms  to 
the  latter,  and  vice  versa!  1]. 

Io  this  paper,  the  gauge  transformation  between  the 
Cooloah  and  Lorentz  gauges  *s  investigated  clerically 
for  2D  and  3D  eddy  correct  problems,  csing  the.  finite 
difference  ret  hod  for  the  conductor  region  and.  the 
boundary  ictegral  equation  netbod  for  the  free  space 
region. 

It  is  concluded  iron  the  computed  results  that  the  two 
formulations  yield  almost  the  same  results  provided 
that  aa  appropriate  mesh  is  used  for  the  conductor  re¬ 
gion. 

INTRODUCTION 

As  is  veil  known,  the  electric  field  intensity  is  ex¬ 
pressed  in  teres  of  the  magnetic  vector  potential  aad 
electric  scalar  potential  as: 

E  -  -JA/3t  -  W  (1) 

Physical  quantities  such  as  the  electric  field  inten¬ 
sity  and. current-  density  resain  unchanged  under  the 
following  gauge  transformational: 

A’  -  A  +  VC.  V*  -  V  -  3t/3t  (2) 

where  C  is  an  arbitrary  scalar  function.  It  follows  as 
a  consequence  that  there  exist  infinitely  cany  formula- 
Cions  for  the  eddy  current  calculation  vhich  use  the 
cagnetlc  vector  potential. 

For  exasple,  let  us  assume  that  A  satisfies  the  Coulonh 
gauge  (9-A  -  0).  Then,  if  ve  choose  the  scalar  function 
C  **: 

V2Z  -  yoK/3t  +  yoV  -  0  (3) 

the  corresponding  A*  and  V*  satisfy  the  Lorentz, gauge: 
V-A*  +  yoV*  -  0  (4) 

In  this  paper,  the  two  typical  gauges,  nasely  the  Cou- 
losb  and  Lorentz  gauges,  are  investigated  nuserlcally. 
In-the  cunerical  conputatlon,  the* finite  difference 
cethod  is  used  for  the  conductor  region  and  the  bound¬ 
ary  Integral  equation  cethod  for  the  free  space  region, 
and  the  Causs-Seidel  cethod  Is  used  as  the  solution 
cethod. 

FORMULATION  USING  THE  COULOMB  GAUGE 

For  slsplicity,  ve  assuse  that  the  peraeablllty  and 
conductivity  are  constants  in  each  region.  The  formula¬ 
tion  is  written  asl2J: 

In  the  conductor  (Oj): 

VJAt  -  po3Aj/3t  -  uoW  -  0,  V2V  -  0  *5) 

In  free  space  (Q2): 

V2A'  +  mjJo  •  0  *  (6) 

On  the  interface  (r):  *. 

£1  -  A2,  n  x  1/y  V  x  A|  -  n  x  I/y0  V  x  Aj 
v‘Ll  •  n-(3Aj/3t  +  Wj  •  0  (7) 

Boundary  condition -at  infinity: 

A2(r,t)  -  0(l/|r|2)  (8) 


Initial  condition  at  C  •  0: 

Aj  -  0.  V,-  0,  Aj  -  0  (9) 

FORMULATION  USING  TEE  LORENTZ  CADGE 

As  is  veil  known.  the  Lorentz  gauge  magnetic  vector  po¬ 
tential,  and  electric  scalar  potential  arc  written  In 


the  conductor  as(3): 

-  p=3Aj/5t  -  0,  V2V  -usiV/St  -  0  (10) 

Applying  the  following  gauge  transformation: 

V2e  “  vs3(/3t  -  0  in  2|,  and 

J£/;a  -  f  3V/3n  dt  on  T  (U) 

0 

to  Eq. (10)  gives: 

V*A|'  -  po3Aj73t  -  0  In  Cj  (12) 

V2V'  -  poJV/3 t  -0  in  Qj,  and 

3V'/?n  -  0  on  f,  V'(r,t)  -0  at  t  -  0  (13) 


Fros  Eq.(13),  ve  obtain  V*  •  0  In  flj,  therefore,  A|* 
reduces  to  the  sodlfied  cagnetlc  vector  potential. 

In  the  following  formulation,  ve  use  the  modified  vec¬ 
tor  potential  in  the  conductor  and  the  magnetic  scalar 
potential  in  free  space. 

In  the  conductor  (fij): 

V2A  -  po3A/3t  -  0  (14) 

In  free  space  (fl2): 

H  -  H0  -  V#,  V2$  -  0,  and 

H0(r,t)  -  Jfl2  J0(r')  x  (l  *  "  l'l*  dQ2  <15> 

On  the  interface  (D: 

n-A  -0,  o  x  1/u  V  x  A  -  n  x  (H<>  -  V#),  and 
n-V  x  A  •  n*Uo(Ho  ”  W)  (^) 

Boundary  condition  at  infinity: 

*(r,t)  -  0(l/|r|2)  (17) 

Initial  condition  at  t  •  0: 

A  -  0,  $  -  0  *18) 

NUMERICAL  COMPARISON  OF  THE  FORMULATIONS 

Ve  consider  a  3D  tice-haraonlc  eddy  current  problc  *  •**  r 
an  aluminum  block- shown  in  Fig.l,  placed  in  a  uni  »*i 
cagnetlc  field. 

Physical  properties  used  arc  as  follows: 


Fig.l  Aluminum  Block 
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V  -  4t  X  10‘7  B/a;  c  -  2.5  x  107  S/s, 
f  -  50  and  100  Hz,  39  -  1  +  JO  T  (19) 

The  discretization  of'tfce  block  is  shown  ia  Fig. 2.  The 
total  member  of  codes  for  chef inice  difference  equa¬ 
tions  is  2,168  and  that  for  the  boundary  Integral  equa¬ 
tions  Is  832.  The  confutation  is  carried  out  for  the 
1/8  geometry,,  using  a  symmetry  relation.  The  shape  of 
the  element  for  FPH  is  a  cube  yhosc  size  is  4  sa  x  4  ea 
x  4  so,  and  that  for  BID!  Is  a- square  whose  size  is  4 
era  x  4'sss.  Zero-order  interpolation  is  used  for  BIEM- 
The  solution  nethod  used  is  the  Causs-Seidel  cethod. 
Confuted  results  for  the  eddy  current  distribution  in 
the  block  are  shown  in  Fig-3  for  the  frequency  of  50  Ez 
and  in  Flg.4  for  the -frequency  of  100  Hz.  Computer  used 
is  EP9000-360.  The  confutation  times  for  the  Coulomb 
gauge  and  Lorentz  gauge  formulations  are  respectively 
74  sin  55  sec  and  57  min  54  sec. 

It  is  seen  In  Fig. 3  and  Fig. 4  that  there  exists  a  dis¬ 
crepancy  of  10-15  percent  between  the  two  formulations. 
The  cause  for  the  discrepancy  is  low  discretization  of 
the  block.  The  discrepancy  becomes  smaller  with  higher 
discretization.  This  fact  his  already  been  proved  for  a 
2D  case. 

CONCLUSION* 

In  this  paper,  the  Coulonb  and  Lorentz  gauge  magnetic 
vector  potential  formulations  for  3D  eddy  current  cal¬ 
culations  are  Investigated  numerically  for  a  3D  eddy 
current  problem.  From  the  computed  results,  -it  may  be 
concluded  that  the  two  formulations  yield  almost  the 
same  results  provided  that  an  appropriate  discretiza¬ 
tion  Is  applied  to'  the  conductor  region. 

For  the  problem  having  a  constant  conductivity,  the  Lo¬ 
rentz  gauge  formulation  has  an  advantage  over  the  Cou¬ 
lonb  gauge  formulation  In  the  computation  time.  However 
this  advantage  falls  for  the  problem  having  a  changing 
conductivity. 

This  work  was  partially  supported  by  Crant-iii-Aid  for 
Scientific  Research  01302031  from  the  Ministry  of  Edu¬ 
cation,  Science  and  Calture  in  Japan 
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Fig. 2  Discretization  of  The  Block 
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Fig.3  Computed  Eddy  Currents,  Frequency  *  50  Hz 

■  !•*  «/•*  m  W*  «/.’ 


Fig. 4  Computed  Eddy  Currents,  Frequency  ■  100  Hz 
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Introduction 

In  electromagnetic  field  computation,  the  magne¬ 
tic  scalar  potential  is  usually  employed  in  the  air 
space.  However,  the  use  of  a  scalar  potential  brings 
the  multi-valued  problem  when  the  studied  domain 
is  multiply-connected.  The  cutting  surfaces  or  the 
cutting  lines  should  be  introduced  for  solving  this 
problem  [1][2]. 

It  has  been  shown  that,  in  a  hybrid  finite  element 
-  boundary  integral  formulation  in  terms  of  electric 
field  [3],  to  avoid  the  multi-valued  problem  due  to  the 
-scalar  potential,  one  can  take  the  electric  field  e  as 
working  variable  in  the  conducting  region  as  well  as 
in  the  air  space.  The  electric  field  in  the  air  space  is 
expressed  by  a  boundary  surface  current  distribution. 
The  use  of  a  spanning  tree  technique  [4]  permits  to 
ensure  the  divergence  free  of  the  surface  current  den¬ 
sity. 

In  this  paper,  we  show  that  the  multiply  connected 
problem  can  also  be  solved  in  a  magnetic  formulation, 
provided  that  the  vector  variable  h  is  used  as  working 
variable  instead  of  a  scalar  potential  in  the  air  space. 
The  magnetic  field  h  in  the  air  can  be  expressed  by 
a  boundary  surface  current  k  as  we  do  in  the  electric 
formulation.  The  vector  variables  h  and  k  on  the 
boundary  arc  discrctiscd  by  boundary  edge  elements. 
The  cmploycmcrit  of  the  boundary  tree  technique  [4] 
ensures  the  curl-free  of  h  and  the  div-frec  of  k.  The 
number  of  boundary  unknowns  is  associated  with  the 
number  of  branches  of  the  tree  which  has  the  same 
order  as  the  boundary  nodes.  Comparing  with  the 
eletric  formulation,  the  number  of  boundary  unkowns 
is  less  important. 

Hybrid-Magnetic  FormnlMion 

Solving  weakly  the  Faraday’s  law  curl  e  =  -db/dt, 
we  get  the  following  variational  formulation  in  terms 
of  magnetic  field  h  : 

Jn  ^curl  h'  •  curl  h  dft  +  —  Ja  ;i  h'  ■  h  d(l 

+^,h'-n  x  erdr  = -^.h' -u  x  eo</T  (1) 

where  fl  is  the  conducting  region  bounded  by  F,  a 
and  it  arc  the  conductivity  and  the  permeability  of 
the  conductor,  li'  is  a  test  function.  In  exterior  re¬ 
gion,  the  electric  field  e  is  the  sum  of  a  reduced  field 
c,  and  a  source  field  co  due  to  the  excitation  current. 

The  volume  integral  of  cq.(l)  is  discrctiscd  by  the 
tetrahedral  edges  elements  as  in  (5]. 

The  exterior  region  of  H  is  taken  into  account  by 
the  surface  integral  term  on  the  boundary  F.  Instead 


of  introducing  a  magnetic  scalar  potential  in  the  air 
space  [5],  we  use  the  reduced  magnetic  field  hr  as 
unknown  variable.  Eq.(l)  can  be  solved  if  a  relation¬ 
ship  is  established  between  'he  tangential  component 
of  er  and  the  tangential  component  of  h,. 

Introducing  a  surface  current  distribution  k  on  T, 
nxe,  and  n  x  br  at  an  any  point  X  on  T  arc  given 
by 


.  .  d  r  nx  x  1 

n,xe,(x)  =  -^-/r  — 


|x-y| 


-dT 


(2) 


and 


n,  x  h,(x)  =  -k(x)  4__ 


1/ n, 

lx  dr 


x((x-y)xk(y)) 


|x-yp 


dV 

(3) 

where  n,  is  the  outward  unit  normal  at  the  boundary 
point  x  and  |x  -  y|  is  the  distance  between  x  and  a 
boundary  point  y. 

Eqs.(2)  and  (3)  arc  discrctiscd  by  a-variational 
method  by  multiplying  test  functions  h'  to  (2)  and 
k'  to  (3).  Elimilating  the  surface  current  density  k 
from  (2)  and  (3)  after  the  boundary  discretisation, 
we  relate  er  and  hr  by  a  matrix  equation. 

Substituting  er  by  hr  in  the  finite  element  ma¬ 
trix  equation  obtained  from  eq.(l),  we  get  the  whole 
matrix  system.  The  degrees  of  freedom  are  the  circu¬ 
lation  of  the  magnetic  field  on  the  inner  edges  of  the 
conductor  and  on  the  independent  boundary  edges 
determined  by  the  spanning  tree  technique. 


The  boundary  of  the  studied  domain  is  meshed  by 
triangles.  The  reduced  magnetic  field  on  the  boun¬ 
dary  is  approximated  by  using  the  triangular  edge 
elements  preserving  the  tangential  continuity.  In  a 
triangle,  we  have 


br 


3 

~  2  wmhm 

m= 1 


(4) 


where  the  unknown  coefficient  hm  is  the  circulation  of 
h,  along  the  edge  m.  hm  =  fm  hr  •  I  dl  and  wm  is  a  vec¬ 
tor  interpolation  function  defined  by  thi  barycenter 
coordinates  Apr 


wm  =  Ar,  grad\r ,  -  Ar,  grad\u  (5) 

with  i,  j  the  two  extremities  of  the  edge  m. 

Similarly,  the  surface  current  density  in  a  triangle 
is  intcrplolated  by 
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k=  £  v„ 

m=  1 


(6) 


with  measured  ones  in  the  Fig.l.  A  good  agreement 
between  the  calculated  and  measured  values  is  ob¬ 
served. 


where  the  unknown  coefficient  is  the  line  integral 
of  k  along  the  edge  m :  km  =  Jm  k  •  s„  dl  with  sm  the 
unit  vector  in  the  plane  of  the  triangle  normal  to  the 
edge  m,  and 

vra  =  n  x  (Xu  gradXrj  -  ‘Xrj  gradXfi)  (7)  • 


The  coefficient  ^  represents  in  fact  the  surface  cur¬ 
rent  across  the  edge  tn.  The  so  defined  triangular 
edge  element  ensures  the  normal  continuity  of  k  on  a 
common  edge  of  two  adjacent  triangles  if  the  normal 
vector  sm  is  well  defined. 


The  previously  described  triangular  edge  elements 
provide  the  tangential  continuity  of  h  or  the  normal 
continuity  of  k,  but  not  the  curl-free  of  h  or  the  div- 


Calculated 
Fora,  in  E 


Calculated 
Fora,  in  H 


free  of  k. 

The  curl-free  of  h  and  the  div-free  of  k  on  the 
boundary  surface  is  equivalent  to  the  zero  curvilin¬ 
ear  integrals  on  a  closed  simply  connected  path  (the 
closed  path  do  not  link  the  conductor  and  do  not 
around  the  hole  of  the  multiply  connected  domain): 

fh  ■  ldl  =  £  ■  Idl  =  £  =  0 

fk-sdl  =  ^J^k-smdl  =  £A‘ra  =  0 

The  above  expressions  mean  that,  on  the  triangular 
mesh,  only  a  set  of  edges  is  independent. 

It  can  be  shown  that  the  number  of  independent 
edges  is  the  number  of  branches  of  a  tree  plus  two 
times  the  number  of  holes  of  a  multiply  connected 
region.  An  automatical  tree  generation  algorithm  is 
dcvclopped  to  identify  the  independant  edges  [4). 


The  following  example  is  a  benchmark  problem 
of  TEAM  Workshop  [6).  It  concerns  the  analysis  of 
eddy  currents  in  a  asymmetrical  aluminum  plate  with 
a  hole  when  the  excitation  coil  is  supplied  with  a 
sinusoidal  current. 

We  solve  this  problem  by  the  described  hybrid  for¬ 
mulation  in  terms  of  magnetic  field  h.  The  conduc¬ 
tor  is  meshed  by  2367  tetrahedra  with  3471  edges. 

The  boudary  mesh  lias  452  nodes,  906  triangles  and 
1356  edges.  The  number  of  degrees  of  freedom  on  the 
boundary  of  the  domain  is  the  number  of  independent 
edges  which  is  equal  to  453. 

Tlie  samc.problem  with  the  same  mesh  has  been 
solved  by  the  electric  formulation  (7J,  where  the  boun¬ 
dary  unknowns  arc  associated  with  the  boundary 
edges  which  is  equal  to  1356.  The  boundary  matrix 
calculation  is  more  consumed  in  this  case. 

The  z-componcnt  flux  density  along  a. line  (y  = 

72  mm,  2  =  34m'm)  obtained  by  two  dual  methods 
(magnetic  and  electric  formulations)  are  compared 

1625 


Fig.l.  z-componcnt  flux  density  (real  part) 
along  the  line  y  =  72 mm,  z  -  34 mm. 

Conclusion 

In  a  hybrid  magnetic  formulation,  the  multiply 
connected  -problems  can  be  solved  by  using  the 
magnetic  field  as  unknown  variable  in  both  the  con¬ 
ducting  regiou  and  the  air  space.  The  use  of  the 
boundary  spanning  tree  technique  permits  to  ensured 
the  curl  free  of  the  magnetic  field  and  leads  to  a  re¬ 
duction  of  boundary  unknowns.  The  numerical  ex¬ 
ample  shows  the  validity  of  the  method. 
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ABSTRACT:  The  analysis  of  electromagnetic  inlets  under  the 
Influence  of  eddy  currents,  in  particular  shielding  structures,  is  a 
well  Inown  and  developed  art  as  of  today  [1-4J.  Design  synthesis 
methods  as  opposed  to  analysis  methods  have  recently  emerged  as  a 
viable  tool  in  Pu  optimisation  of  device  performance  in  order  to 
meet  specified  goals .  This  paper  applies  finite  element  optimisation 
techniques  and  a  newly  developed  geometric  parametrization  method', 
to  the  shape  optimisation  of  electromagnetic  shielding  systems. 


INFINITE  ELEMENTS  AND  OPTIMIZATION  TECHNIQUES 


In.  tbe  optimal  design  of  shielding  device*  we  generally  .-want  to 
achieve  a  desired  Jinx  density  distribution  within  the  shielded 
region  by  appropriately  varying  the  design"  parameters  such  as  the 
material  and  geometric  properties  of  tbe  shield  itself.  For  this 
purpose  an  object  function  F  is  defined,  which  is  at  it*  minimum 
when  the  design  of  the  shield  is  optimal.  For  example,  the  object 
function 

F.£(Bi-B0pi.,)J  O) 

i 

describes  the  deviation  of  the  computed  flux  density  B,  as  obta’ned 
for  a  given  trial  geometry,  from  the  desired  field  distribution  B0p{ 
at  given  sampling  points  i  within  the  shielded  region. 

To  approach  the  minimum  of  the  object  function  systematically, 
gradient  methods  have  been  found  to  perform  fastest  (S).{6).  In  the 
course  of  these  powerful  gradient  methods,  the  optimum  is 
approached  in  successive  line  searches:  at  each  iteration  the 
improved  parameter  vector  p‘+*  it  found  from  the  previous  one  by 
minimizing  the  object  •  along  a  search  direction  S,  performing 
essentially  a  search  foe  the  ideal  value  of  the  scalar  a 

P**I  *  p'+aS  (2) 


Which  way  the  search  direction  itself  is  chosen,  depends  on  the 
gradient  method  emplojed  (conjugate  gradient  or  steepest  descent 
method),  but  it  basically  relies  on  the  gradient  of  the  object  function 
f  in  eq.  (1)  with  respect  to  all  parameters  p.  In  ' this  essential  part  of 
the  optimization  procedure,  the  gradient  dF/dp  is  obtained  as 
summation  of  terms  of  the  following  form 
dF  3F  dFdB  SB 


dp  *  dp  *  3B3p  *  2(!>  ■  Bopi)  Sd 


Sp 


(3) 


Here  the  first  term  of  the  partial  derivatives  generally  vanishes  due 
to  the  nature  of  tbe  object  function,  which  does  not  explicitly 
depend  on  p.  The  magnetic  flux  density  B.  related  to  tbe  magnetic 
vector  potential  A  by  the  differential  operator  “rot Vi*  expressed  in 
the  finite  element  discretization  within  an  clement  in  terms  of  the 
first  order  differentiation  vectors  (d)  acd  the  nodal  potential  values 
(A)  as  B  ■  {d)t(A).  IS).  Thus  the  parameter  sensitivity  of- the- flux 
density  within  a  finite  clement  is 


8B  3fdH 
dp  "  dp 


(A)  ♦  (d)* 


a(A) 

ap 


(4) 


with  the  first  term  as  the  sensitivity  of  the  first  order 
differentiation  vectors  and  the  second  one  as  the  sensitivity  of  the 
nodal  values  of  the  magnetic  potential.  In  general  we  specify  the 
desired  field  values  Bopj  in  the  shielded  region,  which  is  not  part  of 
the  parameter  dependent  shielding  structure  <  itself,  so  that  the  first 
term  vanishes.  The  gradient  of  the  object  function  thus  merely 
depends  on  the  parameter  sensitivity  of  the  magnetic  potential'  A. 
This  potential  sensitivity  may  be  evaluated  directly  from  the  finite 
element  solution  without  the  need  for  an  additional  field  solution 
P!«  In  tbe  case  of  eddy  current  analysis,  the  complex  valued  finite 
element  equation 


([R(p.x.rt)  ♦JlQ(«,o.«.rt))  (A)  .  (P)  {A)  .  (I)  (5). 


as  derived  front  the  diffusion  equation  for  the  magnetic  vector 
potential  A.  is  differentiated  with  respect  to  the  design  parameters 
p  to  yield 


The  coefficient  matrix. (P)  has  been  previously  assembled  during  the 
standard  field  solution  of  cq.  (5).  In  applying  Cholcski 
decomposition  schemes  as  matrix  solver s,  we  can  use  this 
decomposed  matrix  (PJ;-  the  only  computational  effort  for  obtaining 
the  many  potential  gradients  is  then  just  tbe  assemblage  of  the  right 
hand  side' and  forward  elimination  and  back  substitution  for  each 
parameter  (5.8J. 

The  assemblage  of  the  right  band  side  can  be  classified  into  tbe.  two 
cases  of  material  optimization  and  shape  optimization.  In  the  first 
case  of  material  properties  as  parameters,  tbe  partial  differentia)  of 
the  material  independent  source  term  vanishes.  The  explicit 
dependence  of  the  coefficient  matrix  on  the  material  parameters 
renders  straightforward ’partial  derivatives  (7).  It  is  the  second  case 
which  is  'of  special  interest  in  this  paper,  where  the  optimization 
parameters  are  used  to  describe  the  shape  of  the  geometry  which  we 
want  to  optimize.  Then  the  derivatives  on  the  right  hand  side  of  (6) 
have  to  be  evaluated  for  each  element  as 


with  x|  as  the  spatial  coordinates  of  all  element  nodes.  The 
derivatives  of  the  dement  matrices  with  respect  to  the  locations  of 
the  vertices  follow  directly  from  the  finite  element  matrices  using 
the  first  order  differentiation  vectors  (7).  It  is  the  calculation  of  the 
gradient  of  the  nodal  locations  that  arises  in  shape  optimization  and 
that  requires  further  investigation,  as  outlined  in  the  following 
section. 

2)f»EQMEIRY  PARAMETRIZATION  AND.  DESIGN  .MODEL 

The  successful  application  of  gradient  algorithms  for  the  process 
of  shape  optimization  is  frequently  jeopardized  by  discontinuities 
in  the  object  function.  As  the  geometry,  of  the  device  changes  m  the 
iterative  search  for  the  minimum,  a  new  finite  dement  mesh  is 
necessitated  for  each  new  geometric  parameter  value.  In  employing 
free  meshing,  topologically  different  meshes  arise,  each  one 
representing  a  possible  domain  discretization  with  an  inherent 
error.  This  change  of  mesh  topology  and  discretization  error  occurs 
in  discrete  steps  and  leads  to  discontinuities  in  the  object  function. 
The  now  just  piecewise  continuous  object  function  obstructs  a 
smooth  convergence  towards  the  minimum  (9J.  Additionally,  we  have 
to  acknowledge  that  especially  in  eddy  current  analysis  sharp 
changes  of  field  strengths  within  the  often  relatively  small 
penetration  depths  have  to  be  modeled,  so  that  any.  finite  element 
mesh  is  an  approximation,  often  relatively  crude.  This  holds 
particularly  for  3D  analyses,  where  one  knowingly  accepts  a  larger 
approximation  error  in  order  to  keep  the  computational  effort 
affordable  (3). 

The  basic  requirements  for  the  successful  application  of  gradient 
methods,  however,  are  that  the  object  function  gradients,  the  mesh 
distortions,  and  the  finite  clement  discretization  errors  vary 
sufficiently  smoothly.  As  tbe  need  for  parametrization  is  rather  to 
describe  a  variation  to  a  given  geometry  than  to  define  it.  it  has  been 
shown  (101  that  all  requirements  are  ideally  fulfilled,  when  the 
finite  element  mesh  for  new  geometric  parameter  values  is  obtained 
through  a  mapping  technique,  which  is  based  on  similarities  with 
optimization  problems  in  civil  engineering  [II].  In  the  course  of 
applying  this  method,  we  enclose  the  area  to  be  parametrized  and 
some  surrounding  domain  within  a  .subregion,  for  which  we  perform 
a  structural  finite  element  analysts  (12],(13h  The  solution  of  the 
displacement  vector  (u)  resulting  from  applied  forces  (F)  to  a 
structure  described  by  the  stiffness  matrix  (KJ  is  obtained  from 

(K1  (u)  -  (F)  <8> 

With  this  solution  for  the  displacement  (u|  the  spatial  coordinates 
(x)  of  the  structure  deflected  from  us  initial  shape  (x0)  are  given  as 

(x}-{xo)+{u)  (9) 
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If  we  do  not  'excite*  the  structural  displacements  by- applied  forces 
to  the  structure,  but  rather  by  specified  displacements  u,  at  the 
surfaces  of  the  design  geometry,  we  obtain-  the  unknown 
displacements  by  appropriate  partitioning  of  the  stiffness  matrix  as 
solution  to 

IXinJ  (Uu)  -  [KuslM  (10) 

The  applied  surface  ^displacements  (u*)  are  now  employed  as 
optimization  parameters  p.  The  solution  of  eq.  (10)  is  then  basically 
the  mapping  of  the  parameters  as  surface,  displacements  u*  onto  the 
whole  structural  solution  -  domain,  giving  the  structural  deformation 
and  hence  the  smooth'  deformation  of  the  finite  element  mesh  for 
each  new  parameter  value  p.  The  solution  of  eq.  (10)  readers  a 
displacement  space,  from  "which  we  can  deflect  and  distort  our  finite 
element  mesh  during  the  parameter  changes  of  a  line  search  by 
linear  superposition,  replacing  the  need  for  free  meshing.  In 
addition,  if  we  choose  unit  applied  surface  displacements  uj,  the 
sensitivities  of  the"  nodal  coordinates  are  also  given  as' the  solution 
of  eq.  (10)  itself 


3xj  9(xo»a)  Au 

a?  “  ap  "  Ap 


Au  ■  u« 


(ID 


Thus,  for  the  cost  of  an  additional  structural  solution  per  lice 
-search  with  a  model,. containing  just  the  regions  of  variable  geometry, 
the  required  smooth  changing  discretization  error  Is  achieved. 
Furthermore,  smooth  shape  outlines  are  obtained  even  with  only  few 
optimization  parameters,  as  will ’-be  shown  in  the  following  example. 


For  the  purpose  of  demonstrating  the  methodologies  described  in  the 
previous  sections,  we  Investigate  the  - shape  optimization  of  an 
electromagnetic  shield.  Due  to  the  eddy  currents  induced  in  the 
shield,  the  magnetic  field  is  prevented  from  penetrating  into  the 
region  behind  the  shield.  In  this  example,  the  shield  itself  is 
interrupted  by  slots,  like  cooling  slots  in  the  housing  of  an  electric 
device.  The  field,  as  originating  from  the  source  coil  oa  the  left  side, 
shall  not  exceed,  or  in  the  case  of  minimal  shield  volume,  be  exactly 
the  desired  flux  density  of  001  T  at  the  right  side  of  the  shield 
(Fig.  I).  Without  using  a  shielding  at  all.  the  flux  densities  along 
the  line  of  object  evaluation  arc  in  the  range  of  0.12  to  0.15  T.  As 
initial  guess  for  the  shape  of  the  shield  a  rectangular  crossectlon  is 
chosen,  with  a  thickness  of  twice  the  penetration  depth  and  a  height 
equal  to  'the  distance,  along  which  the  desired  flux  density  is 
specified.  7  parameters  are  used  to  modify  this  shape,  where  the  7th 
one  is  the  uniform  displacement  of  the  top  shield  face.  The  location 
of  the  right  shield  face  and  the  location  and  width  of  the  slots  are 
fixed.  The  optimum  shape  of  Fig.  2  has  been  reached  within  only  14 
conjugate  gradient  line  searches  without  the  problem  of 
encountering  discontinuities  m  the  object  function.  As  expected,  the 
shield  crossection  narrows  towards  the  top,  where  the  need  for 
shielding  reduces.  Above  the  slot  at  the  line  of  symmetry,  the  shield 
crossection  approaches  a  thickness  of -zero,  so  that  this  slot  may  be 
manufactured- in  fact  wider  without “ violation  of  the  object  function. 
vThe  flux  density  within  the  shielded  region  approaches  the  desired 
value  closely  (Fig,  3), 


4)  SUMMARY’ 

A  robust  and  reliable  parametrtzation  method  has  been  presented 
for  the  shape  optimization  of  eddy  current  devices.  The 
discontinuities  in  the  object  function,  due  to  the  finite  element 
discretization,  are  eliminated  by  a  structural  mapping  technique, 
which  it  especially  useful  for  geometries  of  high  complexity. 
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Fig.  1)  Problem  definition  and  starting  geometry  of  the  shield 


Fig.  2)  Shield  geometry  and  contour  lines  of  the  magnetic  potential 
A  at  flie  optimum  shield  design  (contour  lines  displayed 
only  in  the  vicinity  of  the  shield) 


Fig.  3)  Flux  density  distribution  along  line  of  object  evaluation 
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Abstract  -  Two  different  formulations  foe  computation  of  eddy 
current  fields  are  presented.  These  are  representative  of  methods  that 
use  potential  functions  on  the  one  hand  and  field  variable  on  the 
other.  The  two  methods  also  contrast  the  finite  element  method  and 
the  coupled  finite  element,  boundary  element  methods.  In  terms  of 
approximations,  one  uses  standard  nodal-based  elements  while  the 
other  the  vector  (edge)  elements.  Results  typical  to  those  obtainable 
from  these  formulations  arc  shown. 


>  INTRODUCTION 

The  computation  of  3D  eddy  currents  is  normally, done  by  one  of 
two  basic  methods.  The  first  defines  potential  functions  to  represent 
the  magnetic  field(lJ.’Tne  most  common  method,  is  the  use  of  the 
magnetic  vector  potential  A  (defined  as  B=VXA).  To  this,  a  scalar 
function  is  often  added.  The  divergence  condition  on  the  vector  A  is 
specified  through  the  Coulomb  or  Lorenz  gauges.  In  essence,  this 
method  is  a  direct  extension  of  the  two  dimensional  formulations 
where  the  use  of  the  magnetic  vector  potential  is  almost  universal. 
The  advantage  of  using  A  in  the  two  dimensional  case  is  obvious: 
since  A  has  only  one  component,  it  is  considered  to  be  a  scalar, 
resulting  in  a  single  degree  of  freedom  per  node  in  the  finite  element 
mesh.  In  the  3D  case,  three  components  must  be  used  to  which,  in 
most  cases,  a  scalar  potential  must  be  added,  resulting  in  four 
degrees  of  freedom,  per  node. -These  formulations  use  almost 
exclusively,  first  or  second  order  scalar  (nodal-based)  finite 
elements. 

An  alternative  method  is  to  use  the  field  variables  B  or  E 
directly[2J.  In  this  form.  Maxwell’s  equations  are  written  in  terms  of 
one  of  the  variables  and  these  are  approximated  over  a  finite  element 
mesh.  Because  of  the  continuity  requirements  for  the  field  variables, 
vector  finite  elements  are  more  attractive  than  scalar  elements.  These 
elements  guarantee  tangential  continuity. 


to  use  the  current  continuity  equation  VJ5=f)  as  a  constraint  The 
field  equation  together  with the  constraint  equation  arc 

lVxVxA=J,-o(|^+VV)  V.(o?i+oVV)  =  0 

H  .«  3t  (3) 

The  divergence  of  A  is  enforced  using  Coulomb's  gauge  (V.A=0). 
With  this  equation  (3)  becomes  ‘ 

-J-V?A  =J,-o(^-+W)  V.(o~+oVV)=0 

P  dt  dr  (4) 

Equation  (4)  is  the  governing  equation  for  the  constrained  A-V 
formulation.  ' 

This  formulation  can  be  ihodified  to  kebunt  for  other  phenomena. 
First,  since  displacement  currents  have  been  deleted,  the  equation  in 
(4)  cannot  account  for  propagation  aspects  of  fields.  By  adding  the 
displacement  currents  back  into  the  formulation,  the  basic  equation  in 
(4)  can  be  used  to  compute  resonant  frequencies  in  cavities  as  well  as 
los$e$(3).  The  formulation  new  looks  as: 

V*A  «  pj,  -  co2pcA  -  op(j(oA+VV)  t  V.  (jooA  +  oVV)  *=  0  /jj 

Another  modification  consists  of  adding  a  velocity  term  to  equation 
(4).  This  yields  an  eddy  current  formulation  with  velocity  effects  that 
allows  the  computation  of  fields  of  moving  devices[4J. 

In  considering  the  inclusion  of  velocity  terms  in  the  eddy  current 
equations,  the  total  electric  field  is:  E=uXVXA-jcoA-VV,  where  u  is 
the  velocity  vector.Thc  governing  equations  with  the  velocity  term 
arc: 


Two  formulations,  one  a  finite  element  method  based  on  the 
magnetic  vector  potential  and  one  a  coupled  finite  clcment-bourxlary 
integral  formulation  based  on  E  (or  H),  as  representative  of  the  many 
formulations  that  exist,  arc  outlined  here. -The  first  uses  a  first  order 
(bnck)  nodal-based  finite  element  while  the  second  uses  a  first  order 
(tetrahedral)  vector  (edge)  element  and  a  corresponding  triangular 
edge  element  on  the  surface. 


FORMULATIONS 

Using  the  magnetic  vector  potential  A,  the  curl-curl  equation 
representing  eddy  current  phenomena  is 


Vx-l-VxA  -  V-^V.A  *  J,  +  cuxVxA  -  icoaA  -  oVV 

H  »  (6a) 

V.(cuxVxA  -  jcooA  -  oW]  **  0  (65) 

The  solution  of  these  equations  (4),  (5),  or  (6)  leads  to  the  correct 
field  quantities  based  on  the  general  field  representation  at  any 
frequency.  The  magnetic  flux  density  is  calculated  from  VXA  and 
the  electric  field  from  E*-(jo)A+VV)  or  Eo(uXVXA-jcoA-VV). 
Other  quantities,  such  as  the  Q-factor  of  a  lossy  cavity  can  be 
computed  from  the  fields. 


jLVxVxA  *J,  +  Je 


0) 


where  Js  is  the  source  current  density  and  Jc  is  the  induced  eddy 
current  density.  Displacement  currents  arc  normally  deleted  from 
eddy  current  formulations.  The  eddy  currents  arc: 


A  second  method,  based  on  the  field  variable  E  (or  alternatively, 
II)  is  to  solve: 

VxE  ■  -jcopH  t  VxH»jcoerE  wherte**€+<,/i<d  (7) 
or,  by  eliminating  II,  we  get  for  E: 


(2) 

where  V  is  the  electric  scalar  potential.  In  eidy  current  problems, 
because  eddy  currents  are  not  arbitrary,  it  is  important  to  restrict  the 
eddy  currents  m  the  solution  region.  This  can  be  done  by  introducing 
a  constraint  equation.  A  simple  way  to  constrain  the  eddy  currents  is 


Vxr^-VxE  +  itoeE  «  0 

m  (8) 

In  this  form,  both  eddy  currents  and  displacement  currents  are  taken 
into  account  through  the  complex  permittivity  e'  However,  since  a 
coupled  nvtthod  is  sought  we  assume  that  the  source  is  external  to  the 
solution  domain.  A  weak  form  of  this  may  be  written  as 
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j  (jirVxE}^XW“'dV+  J  icoe‘E-w“dv=j  (nxH).wnds 

where  wm  are  a  set  of  vector  weighting  function i,  The*'  veighting' 
functions  are  the  vector  element  shape  function$[5J  ana,* together 
with  the  Galeridn  method  for  discretization  of  the  weak  form  provide, 
a  method  for  solving  the  eddy  current  problem.  The  current  density. 

-  on  the  surface  J=nXH,  is  used  as  a  nonlocal  boundary  condition  to 
couple  the  boundary  integral  method  on  the  surface  and  the  finite 
element  method  in  the  volume.  Using  the  approximations  EsEEfcWfc 
and  nXHsJXnXHJkwk^rFkwic  for  an  element  k  in  the  volume, 
and  E=XEif|  and  nXH=Z(riXH)ifi=IFifi  for  a  boundary  element  1 
on  the  surface  of  the  solution  domain,  provides  two  coupled 
equations: 

(A](E)  =[B]{F}<  fa.'the  volume  mesh  (10) 

[C](FJ  =**{E'}  +{D](E}'  for  the  surface  mesh  (II) ' 

where  (E1)  is  the  incident  electric  field  on  the  surface  of  the  solution 
domain.  Solution  of  this  coupled  system  provides  a  method  for  an 
coupled  FEM-BEM  solutioa 


RESULTS 

As  examples  to  the  use  of  these  two  formulations,  two  problems 
are  solved.  The  first  consists  of  an  aluminum  bar  2.54cm  X  2.54cm 
X  20.32cm,  with  a  coil,  8.128cm  long  and  0  685cm  thick,  at  the 
center  of  the  bar.  The  bar  has  conductivity  of  3.54X1 07S/m.  The 
losses  due  to  eddy  currents  in  the  bar  are  calculated.  The  results, 
shown  in  table  1  are  obtained  using  equation  (3)  and  compared  with 
experimental  data  given  in  (6J. 


Table  1.  Losses  in  the  aluminum  bar  for  different  currents  in  the  coil. 


While  the  results  for  1A  agree  well,  the  results  for  higher  currents  do 
not.  This  is  due  to  the  hearing  during  the  experiment,  as  was  already 
pointed  out  in  (6J. 

The  second  problem  is  shown  in  figure  1.  A  perfectly  conducting 
cube,  of  elcctnc  size  ks*2,  where  s  is  the  side  width,  is  placed  in  an 
incident  field  with  the  electric  field  in  the  z  direction,  propagating  in 
the  +y  direction  as  shown.  The  looping  surface  current  on  the  surface 
along  the  line  abed  is  shown.  This  particular  problem  was  solved  for 
two  reasons:  one  is  because  comparison  data  is  available  in  [7J  and 
the  second,  because  ^demonstrates  the  capability  of  the  method  to 
calculate  scattering  problems.  The  same  problem  can  be  solved  with  a 
finite  conductivity,  or  with  a  lossy  dielectnc  replacing  the  conductor, 
The  only  difference  is  the  value  of  the  complex  dielectric  e* 


i  b  '  '  c  d 


Figure  1.  Magnitude  of  looping  current  in  a  conducting  cube. 


CONCLUSIONS 

The  formulations  and  results  presented  here  are  representative  of 
eddy  current  problems.and  their  formulation  using  finite  elements. 
While  no  particular  method  can  be  claimed  to  be  better  than  the 
others,  each  has  its  own  advantages.  However,  the  use  of  vector 
shape  functions  seems; to  be  more  efficient  and  better  suited  for 
computation  of  vector  fields. 
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Abstract 

The  suitability  of»a  3D  finite  element  code  (MEGA)  for 
modelling  electromagnetic  compatibility,, problems  (EMC),  in 
the  frequency  range  up  to  40  kHz  is  assessed.  Results  from  a 
simple  shielding  experiment  are  compared  with  predictions. 

Introduction  * 

The  problem  of  shielding  electrical  equipment  from 
electromagnetic 'interference  has  always  been  important,  but 
new  urgency  has  been  introduced  by  the  need  to  comply  with 
a  recent  EEC  directive*  82/499/EEC ‘which  ‘states*' Thar  'A 
permit  is  required  to  operate  any  equipment  or  installation 
producing  electromagnetic  oscillations  from  10  kHz  to  3000 
GHz*. 

Switching  frequencies  of  20  -  40  kHz  have  become  common 
in  power  electronics  equipment.  A  method  for 'modelling  the 
shielding  for  such  apparatus*  is  obviously  important;  -This 
paper  describes  work  done  to  validate  the  use  of  a  3D  finite 
element  electromagnetics  program  (MEGA)  Tor  modelling 
shielded  equipment  in  this  frequency  range.  An  experiment 
consisting  of  a  coil  shielded  by  copper  sheets  arranged  on  the 
faces  of  a  cube  was  devised  and  measurements  were  taken  of 
fields  outside  the  cube  at  various  frequencies  from  5  to  40 
kHz. 

Theory 

The  method  described  here  [1)  allows  nonmagnetic 
conducting  sheets  which  are  thin  compared  to  their  other 
dimensions  and  of  arbitrary  3D  shape  to  be  modelled 
economically,  the  use  of  standard  volume  finite  elements  to 
model  the  sheet  is  of  course  possible  but  .very  uneconomic  if 
the:  usual  requirements  for  aspect  ratio  of  elements  is  to  be 
met. 

Non  conducting  regions  surrounding  the  sheets  are  modelled 
using  total  or  reduced  magnetic  scalar  potentials  y«  so  a 
Laplacian  type  equation  of  tne  form  div  v  grad  \p  arises  (from 
div  B  »  0), 

If  eddy  currents  flow  in  a  conductor  which  is  thin  compared 
to  the  skin  depth  of  the  material,  the, distribution  of  current 
can  be  taken  to  be  uniform  across  the  sheet.  If  we  define  a 
surface  current  K  as  K  *  Jd,  where  d  is  the  thickness  of  the 
sheet  then,  as  K  vanes  only  along  the  sheet  surface.  K  can  be 
represented  by  means  of  a  scalar  quantity  T  which  exists  only 
on  the  sheet  surface; 

K  .  |.(tndT*ii)  (i) 

In  the  above,  n  is  the  normal  to  the  sheet  and  the  |j  term 
is  introduced  in  order  to  achieve  symmetry  in  the  final  matrix. 


Fig  *e  1  shows  a  typical  sheet  conductor,  surrounded  by  the 
magnetic  scalar  potention  <p,  If  the  potentials  on  either  side 
of  the  sheet  are  labelled  <p.  and  v>2.  at  an  arbrary  point  on  the 
sheet,  since  *H,  xa  ♦  xn  -  K.  then: 


n 


figure  I  A  typical  thin  sheet 


grad  V,  x  n  ♦  grad  xp2  x  n  -  ^  (grad  T  x  n)  (2) 


dividing  by  ad  and  taking  the  curl  of  equ  (2)  yields; 
curt  ~  (grad  V,  x  n  •  grad  ip2  x  “)  * 

curt^f  (grad  T  xn)  ja  (3) 


From  equation  (I)  and  since  curl  E  ■  -  ^ ,  then: 

curl  (grad  T  x  n)  a  -  -  B  ja  -  jq  jji“M 4) 

In  order  to  implement  this  method,  equation  (3)  must  be 
solved  on  a  conducting  sheet,  together  with  a  Laplacian  type 
equation  m  the  surrounding  non  conducting  volume. 

The  right  hand  side  of  equ  (4)  appears  in  a  Galerkin 
treatment  of  div  n  grad  ip,  so  this  term  links  v>,  and  v>2  to  T, 
while  yielding  continuity  of  B.n  across  the  sheet. 

Figure  2  shows  the  experimental  apparatus.  Six  copper 
sheets  (2S0  x  2S0  x  0.55  mm)  are  symetncally  arranged  on 
the  faces  of  a  cubic  (300  x  300  x  300  mm)  wooden  box.  The 
copper  sheets  (conductivity  35.5  x  106  S/m)  were  not  joined 
at  the  edges  for  this  test  as  the  resistivity  of  the  joints  would 
be  difficult  to  quantify.  Inside  the  box  at  the  centre  is  a 
solenoidal  coil  (inner,  outer  radii  20  and  40  mm,  thickness  15 
mm),  made  from  stranded  wire  and  carrying  100  A  turns. 
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coil  at  the  cube 


8S*Hi  RESULTS 
♦  18  «  Wf  RESULTS 
O  31  RESULTS 
X  49  Wc  RESULTS 
—  fijwc  dement 


line  1  from  (0.03, 0.125, 0.185)  to  (0.21. 0.125, 0.185) 
line  2  from  (0. 1 1 5, 0.1 575, 0  03)  to  (0. 1  i  5, 0. 1 575, 0.2 1 ) 


figure  3  Magnitude  of  Bx  along  line  1 


line  3  from  (0.1 15, 0.03. 0.185)  to  (0.1 15, 0.21, 0.185) 


figure  2  The  experimental  apparatus 
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Measurements  were  taken  along  the  three  lines  shown  on 
Figure  2.  Results  for  5.  10,  31  and  40  kHz  are  shown  on 
Figures  3-5.  The  calculated  values  of  B  did  not  vary  as 
much  as  the  experimental  values  with  frequency,  so  that  only 
one  representative  calculated-curve  is  shown  on  the  figures. 
The  calculated  results  tend  towards  the  5  kHz  measured 
results,  this  a  reasonable  as  the  assumption  of  the  0.55  mm 
sheet  being  thinner  than  the  skin  depth  is  more  valid  at  5 
kHz  (084  mm)  than  at  40  kHz  (0.3  mm).  The  correlation 
between  measured  and  calculated  values  is  quite  good. 
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It  has  been  shown  that  3D  finite  elements  should  be  useful  in 
low  frequency  EMC  calculations, 

llstenc.es 
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3D  eddy  current  flow  in  thin  conducting  sheets'  Proc 
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figure  4  Magnitude  of  B  along  line  2 
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figure  5  Magnitude  of  Bz  along  lme  3 
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Abstract -This  paper  describes  numerical  analysts  using  T-mcthod  and 
an  application  to  eddy  current  testing '  The  code  using  twenty-node 
isoparametric  elements  was  applied  to  a  standard  problem  proposed 
by  the  TEAM  •  Workshops  "and  the  results  were  compared  with 
measured  ones.  The  formulation  using  twelve-edge 'elements  was 
also  introduced  and  a  new  code  has  been  developed. 

,  I.  INTRODUCTION 

We  have  already  proposed  the  T-mcthod  [1],  where  the  magnetic 
scalar  potential  Q  of  T-Q  method  is  not  included.  Advantages  of  the 
T-method  are:  (1)  no  variables'^  space  and  (2)  easy  treatment  of 
external  current  and  field.  But  it  has  a  disadvantage  that  a  large  core 
memory  ts  needed  due  to  a  dense  matrix.  In  order  to  overcome  this 
difficulty,  we  have  already  proposed  ah  iterative  solution  technique 
for  the  T-method  [1],  The  code  has  been  developed  using  twenty- 
node  isoparametric  elements  and  it  was  applied  to  an  eddy  current 
testing  problem  (a  standard  problem  proposed  by  the  TEAM 
Workshops  (2})  for  two  frequencies  (500Hz,  I  kHz)  (1). 

In  this  paper  wc  show  improved  results  for  the  same  eddy,  current 
problem  and  also  present  a  new  formulation  using  twelve-edge 
elements. 

II.  EDDY  CURRENT  ANALYSIS 

Since  current  conservation  is  secured,  current  vector  potential,  T, 
is  defined  as,  J*VxT,  The  relation  between  induced  magnetic 
induction  and  current  vector  potential  is  given  like  {1), 

+  (1) 

where  Bt  and  p0  are  induced  magnetic  induction  vector  and  magnetic 
permeability. 

We  obtain  the  governing  equation  for  an  AC  eddy  current  problem 
using  imaginary  unit,/,  and  angular  frequency,  to. 

Vxivxr+/o)n0r+>j)^/,T0v^i)S'+>fl<«o  <2) 

We  must  solve  cq(2)  with  the  Coulomb  gauge  (cq.(3))  and 
boundary  condition  (eq  (4))  oh  conductor  surface. 

VxT*  0  (3) 

Txn* 0  on  conductor  surface  (4) 

In  the  previous  paper  (1)  we  solved  eq.(2)  and  satisfied  the 
Cbubmb  gage  and  the  boundary  condition  by  using  a  penally 
method.  When  we  used  a  large  penalty  number,  good  convergence 
characteristics  could  not  be  obtained  in  the  iterative  matrix  solution. 

Hence  we  here  propose  a  new  formulation  using  twelve-edge 
elements  in  order  to  avoid  such  difficulty.  The  current  vector 
potential,  T,  is  expressed  in  the  local  coordinates  (£,  rj,  £)  using  the 
shape  functions  proposed  by  K.  Sakivama  [3], 

T-{T5,rr).r?>r.|jvir,  <5) 


where  ,  Wni ,  Wt,,}r  These  shape  function;  arc  defined  like, 

fVjD-nXi-U  %  = 

%=i<i«iXi+S),  %*3<i-»iXu?).  (6> 

and  arc  expressed  in  the  same  manner.  Using  the  above 
shape  functions  the  Coulomb  gage  (eq.(3))  is  automatically  satisfied 
The  boundary  condition  (eq  (4))  is  also  easily  satisfied  by  prescribing 
the  current  vector  potential  on  the  surface  to  zero.  The  code  has  been 
developed  here  and  it  was  applied  to  a  simple  3-D  problem.  Fig.l 
shows  eddy  current  distribution  (z»00,  -00169m)  in  conductor 
when  transient  external  field  was  applied.  The  results  was  favorably 
compared  with  those  by  the  code  "ELMES"  using  twenty-node 
elements. 


Fig  1  Eddy  current  density 


«;m*  (iic) 

Fig  2  Time  variation  of  eddy  current  density 
(x=0.073  ,  y=0.0  ,  zsO.O) 
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>UL  RESULTS  AND  DISCUSSION* 

ELNfES  was  applied  to  an  ECT  problem.  Ufc  adef*cd  a  suodard 
one  proposed  in  lie  TEAM  Wxfahops  (2).  The  test  pace*  is  2 
rectangular  block  330x285x30mm  with  a  40x0-5x1  Oesa  crack  00 
the  center  of  ooe  of  (be  larger  faces.  It  ts  made  of  aastcakic  sleet 
Relative  permeability  pf  »  I  and  electrical  conductivity  o  is 
O-MxKFS/ra. 

A  differential  probe  moves  00  the  surface  of  the  block.  Tbepcobe 
is  a  cylinder  with  an  active  solenoid  and  two  smaller  receptive 
solenoids.  The  experiments  were  carried  o«2  fer  a  parallel  movement 
'to  the  crack  under  four  different  frequencies  (505Hz.TkHz,  2kHz, 
5kHz)  (4  j. 

Numerical  analysis  was  also  carried  out  foe  the  same  problem. 
For  ibe  symmetry  the  region  to  be  meshed  is  only  a  half  of  the  block. 
The  crack  is  treated  as  a  low  conductive  region  and  included  in  the 
analysts  domain.  According  to  each  cod  position,  we  used  a  different 
type  of  mesh  division.  In  order  to  get  higjj  accuracy  of  magnetic  field 
calculation  we  adapted  the  mesh  division  to  the  positions  of  cods  and 
a  crack,  and  used  the  symmetric  mesh  division  in  the  region  near  the 
coiL 

Figs34  show  the  numerical  and  experimental  results  for  two 


faytada.  T^ljhwtfejyrbniioftbcrccN.  T ’bsaccuSna 
b  the  tafcSc  is  same  2s  that  ofVatzs  [l\  as  shown  b  %5. 
Nrrrrx^  resets  of  the  Ushcaky  of  b  the  u65c  were 
obC2bcd  from  the  xefatsas  [1.5J.  The  resets  show  abaa  good 
agrrr-aacjr  with  ojerbaa]  cecs  both  b  shape  (especially  the 
locasku  of  tsrsbgpoigsof  the  signal)  and  chase  aerie  b  the  rtss&s 
ofSOOHz'aod  1kHz. 

Assets  wocldEfcc  to  acknowledge  Mr.  SKawaaara  and  Mr.  X 
Sciuawa  for  their  help  a  cibbtea.  AD  edeehtea  wg  performed 
Cray-YMP&'4128  of  tie  Sepcrcccnpcter  Center  of  the  ln«x=te 
of  Ffcid  Scko9e.Tbhoks  Uahtniy. 
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(b)  Experimental  result 


Fig3  Signal  trajectories  (500Hz) 


Fig.5  Definition  of  parameters 


Table  I  The  results  of  'Parallel  movement  to  the  crack' 
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.FAST  EVALUATION  C?  3-D  EDDY  CUS3EKT  LOSS  DISTaiSUTIGH 
IK  TRMJSFOaXZB  TAKES  ' 
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Xb^rtr*.  -  72s*  ectbor's  acdcl  end  method  5305-3D  used 
to  ;*r  ter  3-9  utlysis  of  leakage  field  distribctlos 
b£5  been  rev  developed  for  *  fest  simplified 
mlcnira  of  tfiy-carest  loss  tfistrihstioa  oa 
fstermsl  smrfece  of  tank  wall  of  large  three-phase 
power  cressferatrs.  Field  ceapoeests  H^erd  loss 
distribetioo  ore  calculated  autceati colly.  with 
Kalytic  preproetssor  end  graphic  postprocessor.  Odaj 
to  eligible  sizplifi-  cations.  only  15-20  #  oo  #  PC 
ccojsttr  ere  sufficient  for  simulation  of  one 
structure  configuration,  An  influence  of  teak 
screening  end  general  formula  for  tbs  total  power  loss 
in  task  wall  is  presented. 


I.  introduction 

Eddy  currents  induced  in  inactive  ports  of  large 
three-phase  power  transformers  produce  highly 
non-unifora  distributed  losses  and  local  heat i no. 
which  are  important  factors  of  total  energy  saving  end 
power  system  reliability.  This  distribution  has 
extremely  three-dimensional  character,  especially  when 
various  screens  and  slants  are  applied.  The  most 
popular  approach  to  eolution  of  such  problem  ie  to 
apply  3-D  finite  element  method  111.  However  full 
three-dimensional  solution.  taking  into  account 
nonlinear  magnetic  permeability,  heating,  etc.,  has 
proved  as  "too  costly  in  terms  of  both  een-hours  and 
computer  time  end  not  attractive  for  design  use"  JU-- 
K oreover.  the  3-D  FDi  approach  is  not  suitable  for 
model  ling  on  a  personal  computer,  what  is  necessary  in 
practice. 

To  overcome  this  problem  the  author’s  simplified 
reluctance  network  3-D  method  12.5J  CFig.2a).  together 
with  analytical  formulae  14)  was  applied.  Leakage 
field  of  threi-phase  power  transformer  in  Fig. 2  is 
simulated  and  resolved  with  the  help  of  the  RXM-3D 
computer  program,  containing  dedicated  analytical 
preprocessor,  general  network  solver  and  graphic 
postprocessor.  Such  an  approach  enables  to  eodol  and 
analyse  field  in  one  3-D  structural  variant  (configu¬ 
ration)  within  ca.  15  s  CPU  time  on  a  personal 
eccputcr  IBM  PC/AT.  In  the  papers  published  so  far 
(2.31  only  one  1^  coevonent  was  analysed.  In  the 
present  paper  both  1^  and  field  components  as 
well  as  resultcnt  field  -  V H^x  ♦  H^y  on  the 
Internal  surface  of  tank  wall  were  cc.ifidered.  The 
absolute  value  l^l2  1®  responsible  for  local  losses. 


11.  LOCAL  EDDY  CURRENT  LOSS  IN  STEEL  WALL 

Going  out  frcca  Maxwell’s  equations  for  sinusoidal 
coop lex  expressions  and  at  V  “  const 

7  “  Km  “  r  Eo  ond  7  “  Eo  ”  iUfi  <l) 

one  can  obtain  |4J  the  classical  formula  for  eddy 
currwnt  loss  in  a  solid  metal  wall,  W/o^ 


where  at  pi  -  const  a^  -  1,  but  at  steel  with  u  - 
/i(H)  *  const  ap  %  1.4  is  the  inward  (in  the  Z 
direction)  linearisation  coefficient  (41.  <*>  ■  2nf. 
ps  "  *Vprs*\s*  "  magnetic  permeability  on  steel 
surface;  r  -  metal  conductivity. 

To  consider  iron  nonlinearity  along  surface  ona  can 
usa  author’s  analytical  approximation  (41  of 


segaetisstica  curve  (Fig.l)  of  solid  steel 

/pr  H2  -  CjH  ♦  c .  0  3  B  1  1B3-10 2  1 

ubtr.  Cj  -  310-102  1  -a!  C2  »  7.9  . 

Ome  can  see  free*  Fig.l  that  even  more  simple 
approximation 

H  *  .1.3  cj?  -  10.27  H2  M> 

with  corresponding  error  is  possible. 


Fig.l.  Analytical  approximations.i3)  end  (4) 
of  magnetisation  curve  of  solid  steel. 
XXX  -  measured  points. 


III.  COMPUTATION  OF  3-D  FIELD  AND 

LOSS  DISTRIBUTION  ON  TANK  WALL 

Leakeys  field  in  a  three-phase  transformer  has  been 
simulated  with  the  3-D  reluctance  network  (Fig. 2a). 


V 


Fig. 2.  Three-phase  power  transformer 
e)' computational  RNM-3D  model, 
b)  investigated  example  with  Cu  or 
laminated  Fe  partial  screens. 

where  elements  were  calculated  analytically  in  the 
preprocessor.  In  irrotationa!  field  area  they  were 
calculated  from  a  geometry  of  corresponding  3-D  boxes 
(5| .  In  conducting  solid  steel  elements  -  from  complex 
value  in  1/H  (4) 

Bpl  *  (0.37  +  j  0.61) 

which  takes  into  account  nonlinear  steal  permeability 
end  eddy  currant  reaction.  Copper  or  aluminium  screens 
were  also  represented  by  corresponding  complex 
formulae  (31 .  The  model  in  Fig. 2a  has  4  »  186  nodes. 
Some  more  in  windows  than  that  in  papers  (3)  end  (6) . 
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As  ctw?lt3.  tt^cs  vle»ct  cr  with  eoetissoes  Co  or 
lcsisit*i  F#  jcreess  (Fij.S)  w*r#  considered.  K;*  cf 
tb*  screen  is  situated  fa  discrete  nod*  point*.  The 
reluctance  ca  tb*  edge  is  celcaleted  ea  parallel 
c3c^:‘.Jm  of  two  caaoscu 

JSo.-aU«»,*a  *=*  Mi  -  M 

Therefore  tbe  first  of  these  values  was  neglected.  ccd 
th*  second  -  o  little  decreased  os  3:  2pJ.  giving 
e  relatively  nsll  error.  One  rectengalar  elecent  was 
subdivided  fcto  four  small  rectangles.  Timid  end  less 
in  each  of  them  wtr#  cel  evicted  frea  the  adjacent 
ec=««!.=t*  <€„  •»«  >4r  '  w«y  tb#  Mtaorfc  km 
p.-eciically  made  feur  time  denser  than-  in  previous 
works  (3.61. 

A  total  loss  cceputaiica  should ^cecsidtr  all  kinds 
of  elements.  according  to  121.  end  corresponding 
screening  coefficients  end  p=  fer  electromagnetic 
end  magnetic  screens  respectively.  Tbo.  total  less#* 
in  V  er#  th#r#fcr# 


-  »a  jj/^l  >4  <“=  *  «p  >4  "st  ]  <» 

where  it  was  supposed 


x 


1 


/; 


0-2*10~3  p 

dCu 


TZ  * 


6p  x  1.4  17) 


d^  -  thickness  of  Cu  screen  in  o.  n  -  r.unber  of 
sh##ts  in  flat  laminated  magnetic  scr«#n.  eg.  n  -  15. 

Influence  of  screens  on  th*  timid  and  '  tosses  is 
shown  in  figures  3  to  5. 

Th#  loss  distributions  P*  In  Fig*.  3c,  4c.  5c  are 
expressed  in  relative  units  referred  to 


where 


WSl 

*5— 


(8) 


Wh  -  rated  ampere-turns  of  MV  winding;  hR  -  h/K. 
h  -  windings  height,  K  •,  1  -  (a^aj^f/tnh) 
-  Rogouski's  coefficient  -151. 


Fig, 3.  Field  components  (a),  (b), 
and  eddy  current  losses  (c)  on  tahlc  surface 
of  a  315  kVA  transformer.  without  screens. 


Fig. 5.  Field  components  1^  (a),  (b), 

end  eddy  current  losses  (c)  on  tank  surface  of 
a  315  kVA  transformer  with  the  partial  Cu  screen. 


IV.  CONCLUSION 

Application  of  the  network  approach  with  the  KNM-3D 
computer  program  has  shown  high  effectiveness  of  this 
method  at  evaluation  of  3-D  field  and  eddy  currant- 
loss  modelling  and  analysis.  A  lot  of  information  can 
be  obtained  within  seconds  on  a  personal  computer. 
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as  iterative  ten »  roe  solvisg 

TK  ISTEXSC  ttOH£H  OT  ELECTRICAL  LOG 

KXASG  2MEX  A3®  LID  JIA  01 
Dept.  of  Math..  farbia  Institute  of  Techmology.  Cl  In 


Abstract:  !■  this  paper,  the  proble*  of  electrical 
lor  ia  geophysics  Is  redaced  to  tJ»e' i averse  proble* 
of  aa  elliptic  differeatlal  e*iaatioa.  Aa  iterative 
•etbod  for  solving  the  iaverse  proble*  is  presented. 

1  JSTRODCCTItW 

Ike  technique  of  eletrical  lor  is  very  iaportaat  to 
production  of  petroleaa  la  oil  field.  It  is  pat' into 
practice'  by.  means  of  a  systea  of  electrodes  below 
veil.  The  electrodes  transmit  stead/  currents  to  the 
earth  so  tkat  an  electrical  field  is  created.  By 
receivinr  the  electric  responses,  one  can  identify 
various -formation.  If  the  earth  is  synmetrized  with 
respect  to  the  well  axis.  the  problea  of  electric 
lor  can  be  redaced  to  a  tvo-dimeasioa  problea  ia  a 
aeridlan  plane.  The  distribution  of  nndervell  aidioa 
is  deaonstrated  in  Fif.l.  where  Oa.Q1.Q2  and  Qa 


Fir.l 


da- 

•  -  =0.  (r.z)£  T 1 

do 

<2> 

u—bto  (r.z)€  T i.  i-1,—  ,p 

(3) 

a=0.  <r.z>€r2 

(4) 

where  l«ro  :Ti  is  the  insulation  boaadary,  a  is  the 
oater.  normal -direction  of  T i'.Vz  i*  the  itb  elec¬ 
trode's  ho«adary,-bi  is. the  potential  <m  T:,  p  Is 
the  aamber  of  the  electrodes:  T a  is  the  infinite 

p 

boaadary.  Xotate  TiS  U  T;  then  V  i.Y  z  and  T a 

i=l 

fora  the  whole  boaadary  of  Q . 

Let  T «  be  the  demarcation  plane  between  the  zones 
of  respective  median.  On  Pc»  there  exist  join  con¬ 
ditions 

da  do 

a.=  o»,  (  o  — -).=  (  o - (r.z) 6  F«  (5) 

d  n  do 

where  n  is  r«*s  nornal  direction  and  subscript  ***  , 
represent  two  sides  of  P «. 

In  addition,  a  also  satisfies  measured  conditions 
(correct  conditions) 

r  d” 

J  o - ds=l|;  i»l,  —  P  <6) 

rl  do 

where  Ii<£«I . —  p>  are  the  known  current  values. 

Thus,  the  model  of  electrical  log  is  reduced  to  the 
inverse  problea  constituted  by  eq.U). boundary  con¬ 
ditions  (2)^(4), join  conditions  (5)  and  aeasureaent 
coaditioa  (6). 


represent  the  respective  area  of  auddy  zone.  Invaded 
zone, original  foraation  and  adjacent  foraation.  Let 
o  (r.z)  be  distributive  function  of  electric  conduc¬ 
tivity.  It  can  be  assumed  to  be  a  piecewise  constant 
function  (l).I.e. 

o(r,z)so<l>,  (r,z)(  Oi,  i»0,I,2,3. 
vhtr«  -0  0  <l>,  a  <2>  and  o  w>,are  the  respective 

conductivity  of  and.  invaded  zone, original  foraation 
and  adjencent  foraation.  0  <#>  is  known,  a  0  <2>  and 
o<J>  are  unknown  parameters  characterizing  the 
unknown  foraation  in  the  electric  field.  Generally, 
the  electrode’s  currents  and  the  electric  potential 
on  the  electrode's  surface  can  be  aeasured.  Ihe  In¬ 
version  of  electric  log  is  to  recovejr  au),o<2>  and 
a<J>  by  electrode's  and  potential  values. 

11  .MATHEMATICAL  MODEL 

The  potential  distribution  function  u(r,z)  in  the 
underwell  electric  field  satisfies  the  following 
partial  differential  equation  (1) 


d  du  d  du 

— —  (K--— )+ - (K — )=0, 

dr  dr  dz  dz 

and  boundary  conditions 


(r,z)€Q=U  Qi 
1=0 


(1) 


HI .  ITERATIVE  METHOD 

In  order  to  deteraine  o  .  the  iterative  procedure 
is  introduced  by  letting  [2] 
k,.i  =  M6k»,  u»«t=  u»*  6  Ui,  6k,=  r6  o«. 

n-0,1-  (7) 

where  Is  the  initial  guess.  Ik,l»l  6k* I  and 
I  u» I  » I  6  u.  I  are  necessary  tor  the  convergence 
of  the  iterative  algoritba.  Upon  substituting  (7) 
into  (l),  one  obtains 

d  d<o»*0o«>  d  d(ot*6o») 

dr  dr  dz  dz 

(8) 

Upon  collecting  equal  order  tcras  and  neglecting 
second  order  tcras  of  6  ,  one  obtains  a  systea  for 


d  du»  d  do. 

- -<k» - >+ - lk,— *>  =  0 

dr  dr  dz  dz 


•^-^■=0.  (r.z)£  T 1 

dn 

u,=  bi,  (r.z) £  P 
o,  =  0.  (r.z)€  P i 
and  a  systea  for  6  0, 
d  d6cw  d  d6«h 

1  — (k. - >+r-(k-— >= 

\  dr  dr  dz  dz 


(9) 
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:  >*  ? 

(-  - >*  - 

Tr  dr  ?z  ?z 

d  6  a, 

- =0.  (r.z)  6  r i  <10) 

da 

V6«.=  0.  (r.z)€  r:ur3 

By  asiag  the  Lr  ten’s  faactioa  method.  (10)  caa  be 
traasfor*  la  to  a  ad  explicit  Fredholm  iategral  eqoa- 
tioa  of  the  first  kiad  for  the  oakaova  6  k* 

6m,(r.z)  =  X  J  G,(r.z.r'.z*)V«(r';z*)dr’dz'  (1I)‘ 

<2 

d  do*.  d’  do* 

where  W,(r.z)=-I - ( 6 k. - >+— (6k, - ))* 

dr  dr  dz  dz 

and  6»(r.z,r*,z*)  satisfies, 
d  d  d  d 

C I - Ob - )+— *(k*~— )Xi(r.z,r*,z')=:6(r**T’,z-z’) 

1  dr  dz  dr  dz 


followiag  aamerical  sloalatioa  procedure  is  carried 
oat: 

First.  o*e  chooses  paraaeter  a  (r.z)  which  stpposed 
to  represeat  the  correct  co*d activity  of  aa  n;tk 
aad  also. tie  boaadary  coaditioas  bi.i-1.— ,  p.  Then 
tie  system  (l)— (5)  with  tie  chosen  a  (r.z), and  bj. 
i‘l,~»  p  is  solved  by  numerical  method.  tics  oae 
caa  generate  tie  measured  data  Ii.i-1,— ,  p.  Xext, 
tie  initial  gaess  o*(r.z)  Is  assomed.  Cpoo  solving 
(7).(9)  and  06).  0\  is  obtaiaed.  Then  in  a  simi¬ 
lar,  manner  oj  is  obtaiaed.  Oae  continues  this  pro* 
cedare  aatil  finally  a  aaaerical  lisit  o  %  is  rea¬ 
cted.  Tie  computational  results  for  various  cases 
are  tabulated  in  tie  following  table 


- =0.  (r.z)  €  T  i  (12) 

|  do 

vg*=o.  <r.z)er2urj 

Approximately  replacing  6  o.  In  (11)  by  u-o,  and 
taking  the  nornl  derivative  with  respect  to  Tj, 
one  obtains 

do  do*  dG*(r.  z.  r\  z’) 

- -  /  / - Vi(r'.z')dr'ds’  . 

dn  do  Q  dn 

(r.z)eri,  03) 

By  integrating  (13)  on  T 2  and  according  to  (6). (13) 
is  written  as 

a* 

Jr  I - 4s=  f  f  Q»(r‘  .2*>Wr'  .z’Mr'dz'  .  <14> 

r5  3o  a 

where  1 1  =  1 1  /  ^  <a> .  *od 

dG.Cr.z.r'.z1) 

Cl!<r'.z,)=  J  - ds  05) 

rj  a» 

The  right  hand  side  of  (14)  can  be  further  simplified 
I  f  QlO’.zOW.O'.z’JdrMz' 

a 

3 

=  2  If  q!< 

J=0  Qj 

3  a  an.  a  a.< 

=  -  2  «.?’  f  I  «'.<r’,z')i - (i - )+ - <. - )]drdz 

j=o  Gj  ar  ar  az  az 

3 

=  -  sdiSol1’  .  <es  «<>'!’•<» 

J=1 

a  du.  a  a* 

where  di=-/  f  ojtr'.z’)! - <i - )+ - <1 - Mrfz. 

Qj  dr  dr  dz  dz 

du. 

Lctdi~Ir  J  — . dz.  (1*1, 2,«*/ -p).  Then, (14)  can 

ri  dn 

be  farther  written  as 
3 

-2  dl«  oS'^d1.  ,1-1 .  ,  06) 

j=I 

(16)  is  a  sjstea  of  linear  algebraic  equations  with 
respect  to  6  o*4>.  Solving  (16),  one  can  obtain 
6  0<J>. 

Now,  eqs.<7),(9)  and  (16)  fora  the  basic  structure 
for  each  iteration  in  the  numerical  iterative  algo- 
rltha. 

In  order  to  test  the  feasibility  of  the  method,  the 


initial  value 

truth  value 

iterative  soloticn 

_a> 

<T*  U»  O. 

o®  a®  o® 

'  _<D  a  CD 

<*  •  o  •  o*. 

1  5 

2  2  2 

10  15  2D 

10X00 15X0 19.98 

2  8 

5  5  5 

S)  VO  10 

50X0099X2 10X0 

3  4 

1  1  1 

3  10  1 

3X0  9.99  1X0 

4  4 

11  1 

5  20  3 

5X9  19X0  3X0 

The  siaolations  show  that  the  iterative  method  for 
solving  the  inverse  problem  of  electrical  log  is 
effective. 


Ill  D.Q.li. Apply  the  finite  element  method  to  elec¬ 
trical  log.oil  industry  publisbinghoase.  1980 
(in  Chinese) 

[21  Y.M.Chcn.J.coapat.phys.vol.dS.N’o^.^l 


CONVEX -ANALYSIS  Ain)  SU2D1FFERSNTIAL  METHODS 
I H  ELECTROMAGNET I Stt 


Stanislav  X.  KrzemiAski 

Institute  Electrical  Theory  and  Measurements 
w_.saw  Technical  University 
Koszykova  75,  00-661  Warsaw 


Abstract:  A  new  method  of  formu ) at ing  boundary 
and  initial  boundary  value  problems  with  uni¬ 
lateral  constraints  for  t leld.vectors  is 
sented.Such  problems  of  electrodynamics  were 
described-with -a' system  of raultieqvations  and- 
variational  mequaJ  lties- 


1.  INTRODUCTION 

In  the  last  years  much  was  done  in  appli¬ 
cation  of  mathematical  'physics  egua-  tions 
to  fietd  problemsa3  regards  mathematical  mo¬ 
delling  of  initial  boundary  value  problems 
with  constraints  for  field  vectors. The  fun¬ 
ctions  of  state  satisfy  not  only  the  condi¬ 
tions  resulting  from  the  given  boundary  value 
problem  but  also  additional  ones  entailed  by 
unilateral  constraints,  that  is  inequality 
conditions. 

Such  problems  may  be  described  naturally 
with  elements  of  convex  analysis  and- subdif fe- 
rential  calculus  11.31.  This  approach  to  pro¬ 
blems  with  unilateral  constraints  results  in 
their  being  described  with  multiequations,  or 
differential  inclusions  or  variational  inequa¬ 
lities  11.21.  Tnis  paper  presents  a  mathemati¬ 
cal  model  for  an  initial  boundary  value  pro¬ 
blem  with  unilateral  constraints  for  the  sour¬ 
ce  current  density  vector  and  the.  magnetic 
induction  vector. 


ELEMENTS  Or  SOBDIFFERENTIAL  AND 
CONVEX  ANALYSIS 

Tne  mathematical  basis  of  convex  analysis 
and  subdif ferentia I  calculus  are  given  in  (31, 
Here  will  be  given  only  those  elements  of  this 
theory  which  arc  relevant  to  the  subsequent 
considerations. 

(Dl);  If  functional  J:V— -*R  is  convex  and 
lower  semicont inuous.  then  its  coupled  fun¬ 
ctional.  of.  in  other  words,  the  polar  fun¬ 
ctional  J  : V  (fi) - >R  is  of  the  following 

form. 

J  (u  )  *  sup  <  <.u  ,u>  -  J(u>  >  <i) 
u  «  V 

(D2)s  We  call  the  element  u  cV  tO)  a  subgra¬ 
dient  of  the  convex  lower  semiconti-nuous 
functional  J.V - >R  in  point  (u)  when  it  sa¬ 
tisfies  the  Fenchel  inequality  (3j, 

<-u* ,  v  -  u>  +  J(v)  £  J(u)  ,Vv  «V  (2) 

We  denote  the  set  of  subgraaients  <u  )  of  fun¬ 
ctional  J:V— — *  R  by  <U(u>«  The  subdifferen¬ 
tial  dj<uj  nas  the  following  properties. 


u  e  dj ( u ) 

H  U  O  <IJ  <U  ) 

13) 

d(J.LMu) 

•  L  dJ(L*u) 

(4) 

An  initial  boundary  value  problem  with 


unilateral  constraints' for  field  vectors  wiii 
now-be  formulated. 

MODEL  INITIAL  BOUNDARY  VALUE  PROBLEM  OF 
ELECTRODYNAMICS 

A  model  of  initial  boundary  valueproblem  for  a 
medium  with  dominant  current  conductance  pro¬ 
perties  will  be  considered. For  such  medium  the 
canonical  equations  [2]  are  given  by 


L**  H  - 

(5) 

B  - 

H  •  H 

(6) 

L  •  A  - 

B 

(7) 

where  L  -  L  •  (Vx)  are  the  operators, 
vector  of  source  current  density  j(x,t) 
the  vector  of  magnetic  field  intensity 
have  two  components. 

The 

and 

H(x.t) 

J(x.t) 

-  J(X.t)  *■  3(x.t) 

(8) 

H(x.t) 

-  H(x.t)  ♦  H(x,t) 

(9) 

Components  j  and  H  are  nonlinear  and  de¬ 
pend  on  the  vector  B.  Constraints  for  the 
induction  vector  are  contained  in  the  defined 
set  K(O). 

K(fi)  -  <  At  A  oV(0).|  VxA  |  i  Bo  >  (10) 

The  indicator  function  of  set  K(O)  is  of 
the  form. 

0  for  As  K(O) 

indj.(A)  •  p(A)  •  (11) 

+«>  for  A  e  K(O) 

The  normal  cone  of  set  £(0)  in  point  A 
is  a  set  of  subgradients  A  or  3  of  the 
form.  Accordingly,  one  may  write  that  the 
controlled  component  of  source  current  3  be¬ 
longs  to  the  cone. 

3  «  -  NK(A>  -  -dindK(A>  -  -*>(M  (12) 

Tne  cnaractenstic  of  the  magnetic  medium. 
H(B)  is  given  by, 

f  for  B<  0 


H(B>  -  i*oB  for  0  <  B  <  Bo  (13) 
i>oB5  tor  B  >  Bo,  V'-  B^/H^ 
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m"1  -  If-  e  <W3)  U4) 

The  nonlinear  component  H(3)  belongs  to 
the  subdifferential  of  the  indicator  function 
of  the  epiH. 


H(3)  «  dindepiH{B.H(B>>  -  ^r(B)  115) 

Thus,  the  dynamics  of  electromagnetic 
field  in  region  Q  with  unilateral  const¬ 
raints  for  field  vectors  contained  m  (12) 
and  (15)  was  described  by  two  inclusions, 
cultiequations  of  the  following  form. 


<U-A.l’-  H-j  +  r- 1|->*  f>(U)  -  p(A)  <  0 

(19) 

<V  -  H  .  L-A>  +-  v*(V)  -  P(H>  S  0  (20) 

Eliminating  the  vector  of  magnetic  field 
intensity  H  from  (20),  one  gets  the  parabolic 
type  evolution  variational  inequality. 


<r-g£-  ,U-A>  *  a(A.U-A)  5  <j,U-A>  (21) 


satisfied  for  every  vector  U  «  K(O)  and 
A  o  K(O). 


(16) 


INEQUALITY  MODEL  OF  THE  BOUNDARY 
VALUE  PROBLEM 


L  A  -  K  fi-  »f(L  A)  (17) 


The  subdifferential  ^H(B)  of  the  chara¬ 
cteristic  (13)  has  the  following  analytical 
form. 


<?H(B) 


-2  i>o 

(-2  »>o.  vj 
v 

o 

l  v  ,2  v  ) 

o  o 

2  poB 


for  B  <  0 

for  B  *  0 

(18) 

for  0  <  B  <  B^ 
for  B  -  Bo 
for  B  >  B^ 


The  model  initial  boundary  value  problem 
of  electrodynamics  is  of  the  form, for  given 
curient  sources  j.  r  and  material  parameters 
Y  ,  n  in  region  Q,  determine  the  potential 
distribution  A  «  K(O)  satisfy-  mg  variatio¬ 
nal  inequality  (22)  and  the  initial  condi¬ 
tion  (23), 

Ah 

<r-5~-  ,U-A>  *  a(A,U-A)  ^  <j.U-A)  (22) 

A(x.O)  -  Ao(x).  x  O.VU  -a  K(O)  (23) 

In  the  case  of  a  magnetostatic  field,  the 
boundary  value  problem  is,  for  given  current 
sources  j  ,  r  and  material  parameters  r  .  a* 
in  region  (O)  determine  distribution  of 
magnetic  potential  A  a  K(O)  satisfying  the 
variational  inequity 

(A.U-A)  <  <J.U-A>  ,  V  U  a  K(O)  (24) 


Fig.l.  presents  the  subdifferential  of 
the  characteristic  H(B) . 


Fig.l.  Subdifferential  0H(B) 

In  virtuo  of  the  subdifferential '3  pro¬ 
perty  (D2)  the  system  of  mul tiequat; ons  (16)- 
(17)  Is  equivalent  to  a  system  of  variational 
inequalities  (19). (20)  II],  in  which  (*>)  and 
(V>)  are  indicator  functions  jf  sets  K(O)  and 
epiH  respectively. 


The  variational  inequality  (24)  was 
solved  by  approximating  function  A(x.y).  (x.y) 
a  O.  with  a  finite  element. 


CONCLUSIONS 


Subdifferential  calculus  and  convex  ana¬ 
lysis  have  never  previously  been  applied  to 
descriptions  of  nonlinear  problems  of  electro¬ 
dynamics  with  unilateral  constraints  for  field 
vectors. 

This  approach  enables,  in  a  natural  man¬ 
ner,  the  construction  of  a  mathematical  model 
consisting  of  elliptic  and  parabolic  variatio¬ 
nal  inequalities. 
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Abstract-  N-body  problem  of  classical  elec¬ 
trodynamics  is  considered.  The  Newtonian  in¬ 
stantaneous  action  at  a  distance  is  replaced 
by  an  action  at  a  distance  propagated  with 
finite  velocity.  Fixed  point  approach  allows 
to  formulate  "escape"  conditions  fo-  the 
charged  particles- 

1.  INTRODUCTION 

The  two-body  problem  of  classical  elec¬ 
trodynamics  without  radiation  term  has  been 
posed  by  J-L. Synge  C13;  using  the  relativis¬ 
tic  expressions  for  the  Lorentz  ponder mot i ve 
force,  due  to  Ziirauli  C23.  The  main  feature 
of  his  considerations  is  that  the  Newtonian 
instantaneous  action  at  a  distance  repla¬ 
ced  by  an  action  at  a  distance  propagated 
with  finite  velocity-  The  problem  stated  has 
been  investigated  by  several  authors  using 
different  approaches  (cf.  C33-C53). 

A  natural  extension  of  the  problem  menti¬ 
oned  is  a  N-body  problem  of  classical  elec¬ 
trodynamics.  From  the  physical  point  of  view 
this  problem  is  based  on  the  linear  supei — 
position  principle  of  the  fields.  This  means 
that  the  Lorentz  force  can  be  calculated  by 
summation  of  the  retarded  fields  produced  by 
all  other  particles.  In  C63  one-dimensional 
case  it  has  been  considered. 

In  the  present  paper  we  formulate  ah  ini¬ 
tial  value  problem  for  equations  of  motion 
corresponding  to  three-dimensional  N-body 
problem.  By  means  of  a  fixed  point  theorem, 
proved  in  C53,  we  formulate  "escape"  condi¬ 
tions  for  the  moving  charged  particles. 

XX.  FIXED  POINT  THEOREMS 

Let  X  be  a  separated  uniform  space  with  a 
uniformity  generated  by  a  saturated  family 

of  pseudometrics  $  >  where  «B 

is  an  index  set.  Let  be  a  mopping 

with  iterates  defined  as  follows:  * 

WP*^*)),  </<>>-?  <k-l,2, ...).  Let  <«>  be  ^ 
family  of  contractive  functions«k<+> : 
with  the  properties:  7 

<®1>  is  continuous  from  the  right, 

strictly  increasing  and  0<4^<i)<^  for  -fc>0; 

<«2>  .  0  <or 

n— ^ 
every  >0  and  . 

The  operator  T  :  X  ->  X  is  said  to  be  <t>- 
ccntractive  if  for  ever y  x,<cX  and  left  the 
inequality  v 

jyTx,1\j>i<£f<  fw  «,3» 

is  satisfied. 


THEOREM  2.1  C53  Let  T  3  X  — >  X  be- ®rcon trac¬ 
tive.  For  each  there  is  <J^(i)e  such 

that- -for-  i>0 

and<&(i>/-t  is  non -deer  easing.  In  addition, 
there  is  x+eX  such  that 

#(J,X*,T*o>< 

<k-0, 1,2, ... ) . 

Then  T  has  at  least  one  fixed  point  inX. 

It  is  easy  to  verify  that  a  sum  of  O- 
contractive  and  completely  continuous  opera¬ 
tor  *is  no  longer  densifying  one,  because  of 
that  we  use  the  notion  of  O-densifying  opera¬ 
tor,  introduced  in  C53- 

The  operator  T  is  said  to  be  fc-densi- 
fying  if  for  every  bounded  set  and 

Jf<T<£l»i4yy  *,,<&», 

where  coincides  with  Kuratowskii’s  or 

Hausdorff's  measure  of  noncompactness. 

It  is  known  that  every  separated  locally 
convex  (linear  topological)  space  possesses  a 
uniformizable  topology  being  completely  regu¬ 
lar.  We  suppose  E  has  the  property  <C>:  the 
convex  closure  of  every  compact  set  is 
compact  in  E.  When  E  is  a  Banach  space  (C)  is 
satisfied  in  view  of  Mazur's  theorem.  If  £  is 
a  locally  convex  one, (C)  is  satisfied  if  E  is 
complete  or  even  quasycomplete. 

THEOREM  2.2  C53  Let  T:J1->M  be  continuous 

densi fying  mapping.  For  every  bounded  set  Q. 
with  <&)><>  the  inequality  holds 

<j,Si><*><n*0,l,2,...> 

for  some  positive  constant#.  Then  T  has  at 
least  one  fixed  point  in  M. 

XXX.  ON  THE  N-BODY  PROBLEM 

Wo  formulate  N-body  problem  taking  into 
account  that  every  charged  particle  is  under 
the  influence  of  another  N-l  particles.  The 
right  hand  sides  of  the  equations  of  motion 
we  calculate  by  Lienard-Wiechert  retarded 
potentials  following  the  technique  due  to 
Synge  £13  and  Pauli  C23.  So  the  right  hand 
side  of  every  equation  turps  out  a  sum  of 
retarded  fields  produced  by  the  last  N-l 
particles. 

Introduce  the  denotations  <c.f.C53>:  the 
space-time  coordinates  are  £„  =  <**» JC^Wi  )  (C- 

the  speed  of  light),  where  Latin  suffices  run 
over  1-4,  while  Greek  suffices  run  over  1-3 
with  Einstein  summation  convention;  the 
scalar  product  is  ;  the  proper 

masses  are  ***<!  *1,2, . . .  ,N> ,  the  charges*#*  , 
unit  tangent  vectors  A* , the  elements  of  the 
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/  w  OO  «>  l*) 

proper  timeo5,the  velocities  U d  ^  = 

*  s  <  >', 

81  (*; 

fe _ , 

Q***V  *  are  isotopic  vectors 

Vvi* 

?‘r“'“=  -*«,(«) ,  «««*,«»  j 

where  the  retardations  satisfy  the 

functional  equations 


•**„«  =  j-lj 

Equations  of  motion  for  N  charged 
particles  are: 


oU 

‘5s 


<*) 


,<W)  («.«> -,*«•>  p«.w]  , 

+  *«c 


(*) 

<n 

<**V 


**T 


-retarded  potentials. 


<r.»J 

(k-l,2,...,N),  where  =rf, 

^Xn 

And  4>'“1  . 

^ (  -  <A«>  f'-’y 

The  system  (1)  is  a  neutral  functional  dif¬ 
ferential  one  with  respect  to  the  unknown 
velocities.  After  usual  substitutions  we  can 
reduce  (!)  to  the  following  one: 

w>=2|C“; 

xwi*  (2) 

wjrt)  =  <^'X 

where  are  prescribed  initial  acce¬ 

lerations. 

The  following  theorem  is  based  on  Theorem 

2.2. 

THEOREM  3.1. Let  the  function*  lW  4:> :  CO,*)  -> 
(0,«*>)  satisfy  the  inequalities  <k*l,2, . . .  ,N> 

22  M  "  Mi 

a;1.  ,  1 

1-62  ~  $2  — |] 
wil  **•»  I  I  I  Xc  Ij 

**♦*  «*♦« 

then  for  every  function  W«2j  <  ^  ;  £*,) satisfying 
the  inequalities 

i,  0,f, 

sj*. 

£  w,  (4)  •£ 


such  that  |  v/^e  i'flj-  W0ct), there  exists  a 
solution  of  the  initial  value  problem  (2)  be- 
0*  _  j  a*i 

longing  to  C  £»/,*<*  >3  such  that  (r>  are 
the  distances  between  the1 particles  for  -t  20. 
It  is  easy  to  see  that 

lim  Y^iV)  =  ®, 
t->» 
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Abstract-rThe  paper  presents-a  method- of  opti¬ 
mization  the  constructional  paraceters  consi¬ 
dering  maximization  of  the  selected  work’s  pa¬ 
rameters  of  the  multidimensional  electromecha¬ 
nical  system.  The  problem  of  optimization  has 
been  solved  using  8ox-Wilson’s  method  and  the 
analysis  of  absolute  function  of  sensitivity 
of  the  first  order.  This  paper  presents  appli¬ 
cation  of  the  above- method  and  results  of  op¬ 
timization. 


I.  INTRODUCTION 


With  the  analysis  of  complex  multidimensional 
objects,  especially  there  wheri  phenomena  that 
are  mechanic  in  their  character  are  accompa¬ 
nied  by  magnetic  and  electromagnetic  phenome¬ 
na,  the  task  of  optimization  an  object  on  the 
basis  of  analysis  of  physical  phenomena  is  ve¬ 
ry  difficult,  sometimes  impossible.  Most  im¬ 
portant  problem  for  the  electrical  transducers 
for  the  measurement  of  displacement  (for  ex. 
for  the  inductive  transducer)  is  optimization 
construction’s  parameters  considering  maximi¬ 
zation  of  the  metrological  characteristics  pa¬ 
rameters.  The  Box-Wilson’s  method  has  been  used 
to  solve  this  problem  [l]  .  The  problem  of  op¬ 
timization  has  been  reduced  to  the  classical 
problem  of  nonlinear  programming  with  limita¬ 
tions  (maximization  output  quantity  y  through 
changes  of  the  Input  parameters  xi,x?,...Xn 
in  the  preset  intervals  of  permissible  varia¬ 
tions).  Use  has  been  made  of  the  experimental 
design  techniques,  approximation  of  the  object’s 
properties  and  hypersurface  of  the  I-st  degree 
by  the  formula; 


y  «  b0  ♦  Vi—Vf.  a> 

where  b0,bj,...bn  are  the  coefficients  of 
this  relation  In  order  to  identify  the  model 

CO  .CO  ■ 

The  analysis  of  absolute  function  of  sensiti¬ 
vity  of  the  first  order  £2]  by  the  formula: 


•#x 


3  b 


Qy  , 

~  >1,2 . n 


i 


(2) 


has  been  given  information,  which  constructio¬ 
nal  parameters  are  more  important  than  another 
in  this  optimization. 


II.  APPLICATION  OF  THE  METHOD 

fhe  above  method  has  been  applied  in  the  ana¬ 
lysis  of  an  inductive  transducer  system  used 
for  the  measurement  of  displacements  which 
works  in  the  systems  £3]: 

1.  of  a  hybrid  transformer  and 

2.  of  a  resistance  bridge. 


The  measuring  range  has  been  taken  as  an  output 
quantity  y  but  it  has  been  observed  sensiti- 
vitytfm  mV/mm)  and  residual  voltagetfin  V) 
this  same  tine.  The  range  of  the  linear  cha¬ 
racteristic’s  part  in  the  separate  experiments 
has  been  defined  for  the  three  values  of  the 
relative. error  (for  0;5%,  IV and  1,5%)  based 
on  the  linear  approximation  £33- 
The  target  of  the  analysis  and  experiments 
is  obtaining  of  the  maximal  linear  range  (mea¬ 
suring  range)  shown  in  Fig.  1. 


Fig.  1.  Dependence  of  the  output  voltage’s 

amplitude  of  the  measuring  system  from 
the  core’s  displacement 

All  parameters  characterised  the  inductive 
transducer  has  been  selected  on  the. 

1.  measurable  input  quantities  which  are  *he 

subject  of  th°  optimization  process 

(xl.x2 . x„) 

2.  and  fixed  Reasonable  Input  quantities  [3J. 
The  constructional  parameters  regarded  as  the 
input  quantitres,  have  been  presented  in  Ta¬ 
ble  1. 


TABLE  I 

MEASURABLE  INPUT  QUANTITIES 


Deno¬ 

ta¬ 

tion 

The  name  of  the  parameter 

Ground 

level 

Permissible 
range  of 
variations 

X1 

Frequency  of  the  power 
supply 

10  kH2 

0,5-20  kHz 

x2 

Number  of  turns  of  each 
windings 

1360 

H5300 

windings 

x3 

Distance  between  the  win¬ 
dings 

2,5  era 

0,05*33  cm 

xi 

Length  of  the  ferrite  core 

22,5  cm 

1*30  mm 

x5 

Inside  diameter  of  carcass 

4,9  era 

3, 4*5, 4  cm 

X* 

Outside  diao.  of  ferr.  core 

3,5  era 

3*5  cm 
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Thus  the  problem  is  reduced  to  the  maximiza¬ 
tion  of  y  through 'the  changes  of  the  parame¬ 
ters  xi,X2,...,xg  in  the  preset  intervals  of 
permissible  variations  for  two  systems  of  the 
-transducer's -work. 


III.  RESULTS 

Owing  to  the  realization  of  the  experimental 
design  technics  (according  to  a  two-level  half 
design)  32- experiments  have  been  made  for  each 
transducer's  system  of  the  work.  The  results 
have  been  presented  in  Table  H  and  Table JH  [3] . 

Next  experiments  has  been  executed  to  finding 
the  local  maximum  based  on  the  Box-Wilson’s 
method  [3] .  The  local  maximum  has  been  found: 

1.  in  the  system  of  a  hybrid  transformer  in  37 
experinent  (6,2  ram  for  0,5%;  7,2  ran  for  1%; 

7,9  mm  for  1,5%;  sensitivity  220  mV/raa;  resi¬ 
dual  voltage  0,015  V), 

2.  in  the  system  of  a  resistance  bridge  in  36 
experiment  (7,7  ma  for  0,5%;  8,7  mm  for  1%; 

9,3  mm  for  1,5%;  sensitivity  200  mV/ram;  residual 
voltage  0,049  V). 

That  were  the  results  of  optimization.  Identi¬ 
fication  of  the  coefficients  of  the  form  (l) 
basing  according  to  the  principle,  of  the  least 
sum  square  [31.  This  coefficients  shows  Table 
IV  and  Table  \T. 


TABLE  V 

RESISTANCE  BRIDGE 


:rror 

M 

ki 

Coefficients 
k2  |  k3 

k4 

k5 

k6 

l  ! 

! 

0,5 

-0,353 

-0,147 

-0,047 

0,191 

0,009 

-0,009 

t 

1 

-0,441 

-0,078 

-0,028 

0,266 

0,041 

-0,041 

1,5 

-0,503 

-0,078 

-0,003 

o;29i 

0,053 

-0,022 

I 

! 

8.  The  local  maximum  has  been  found:  Analsysed  the  absolute  .function  of  sensitivi- 

2  system  of  a  hybrid  transformer  in  37  ty  of  the  first  order  (2)  for  system  of  ahy- 
nt  (6,2  ram  for  0,5%;  7,2  mm  for  1%;  brid  transformer,  results  has  been  got.  This 

or  1,5%;  sensitivity  220  mV/raa;  resi-  analysis  informs,  that  outside  diameter  of  the 

tage  0,015  V),  ferrite  core  is  parameter,  which  makes  up  the 

2  system  of  a  resistance  bridge  in  36  least  sensitivity,  for  system  of  a  resistance 

at  (7,7  ma  for  0,5%;  8,7  mm  for  1%;  bridge  this  analysis  informs,  that  this  same 

ar  1,5%;  sensitivity  200  mV/ram;  residual  parameter  makes  up  the  least  sensitivity. 

3,049  V). 

re  the  results  of  optimization.  Identi- 

of  the  coefficients  of  the  form  (l)  IV.  CONCLUSIONS 

:cording  to  the  principle,  of  the  least 

re  [3l.  This  coefficients  shows  Table  The  gradient’s  optimization  method  has  been 
\f.  appeared  to  be  very  effective  to  the  optimiza¬ 

tion  and  structural  design  of  the  metrological 
TABLE  IV  properties  of  an  inductive  transducer  for  the 

HY8RX0  TRANSFORMER  measurement  of  displacements.  The  analysis  of 

_  sensitivity  can  help  in  the  selection  of  the 

~  |  _  |  more  and  less  important  constructional  pararne- 

toemcients  ters  to  the  next  experiments  and  analysis.  It 

k.  k,  k,  k.  kr  K  seems  that  it  can  be  successfully  generalized 

1  1  *  456  for  other  systems  especially  for  electromucha- 

-  nical  measuring  transducers. 

»«  "0,078  -0,166  0,247  0,091  -0,020 

159  -0,041  -0,141  0,334  0,078  -0,022  „ 

O  me  „  ,,o  „  M  Box  0.E.P,  Wilson  K.B. :  On  the  experimental  at- 

175  -0,025  -0,119  0,394  0,081  -0,019  taintment  of  optimum  conditions.  Journal  of  the 

- - - - - -  Royal  Statistical  Society,  Ser.  Bj  No  1,  1951 

[2j  Frank  P.M. :  "Intrudocution  of  system  sensitivity 
theory”.  Academic  Press,  Hew  York  1978 
[3j  Jasirteka-Choromafcka  0.:  "Mathematical  model  of  in¬ 
ductive  transducer  for  measurement  of  displacement”, 
doctor’s  dissertation,  Warsaw  University  of  Techno¬ 
logy,  Warsaw,  1983 

TABLE  U  AN0  TABLE  III  [3] 

RESULTS  OF  EXPERIMENTS  OF  A  HYBRID  TRANSFORMER  AND  OF  A  RESISTANCE  BRIDGE 


Error 

w 

kl 

k2 

k3 

k4 

k5 

k6 

0,5 

0,116 

-0,078 

-0,166 

0,247 

0,091 

-0,028 

1 

0,159 

-0,041 

-0,141 

0,334 

0,078 

-0,022 

1,5 

0,175 

-0,025 

-0,119 

0,394 

0,081 

-0,019 

ractors  x. 

Q 

xi 

*2 

x3 

x4 

x5 

x6 

Sasic  level  xR 

10 

1360 

2,5 

22,5 

4,9 

3,5 

ixpccloental  step  Axn 

3 

144 

0,5 

2,5 

0,5 

0,5 

i 

Jpper  level  (♦) 

13 

1504 

3 

25 

5,4 

4 

.ower  level  (-) 

7 

1216 

2 

20 

4,4 

3 

Standardized  variables  t0 

% 

*2 

s 

s 

t^ 

tfi 

Experiment  l 

+ 

4 

4 

♦ 

♦ 

2 

4- 

4 

4 

♦ 

- 

3 

♦ 

4 

4 

- 

4 

4 

4 

♦ 

4 

- 

♦ 

- 

5 

4 

4 

- 

- 

4 

♦ 

6 

4 

4 

- 

4 

- 

4 

7 

4 

4 

- 

4 

4 

t 

8 

4 

4 

- 

• 

- 

- 

1 

9 

4 

“ 

“ 

4 

4 

♦ 

i 

? 

38 

13,0 

1294 

2,20  26,7 

5,08 

3,38 

y0,5Y 

yn 

yl,5Y 

y4 

y5 

5,6 

5,8 

6,3 

234,8 

0,015 

6,1 

7,1 

7,8 

.231,3 

0,065 

4,9 

6,8 

257,5 

0,010 

4,4 

5,1 

6,0 

238,2 

0,020 

5,6 

6,5 

7,3 

220,9 

0,053 

5,2 

6,1 

6,6 

239,6 

0,031 

A, 7 

5,7 

6,3 

270,1 

0,004 

5,? 

6,8 

7,4 

216,0 

0.0S1 

4,1 

5,0 

5,6 

278,4 

0,016 

5,2 

6,2 

6,9 

235,8 

0,031 

6,2 

7,0 

T£ _ 

216,5 

0,013 
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NEW  METHODS  AND  RESULTS  IN  3D-DEVICE-SIMULATION 


II.  CljctmaM.  w.  Kli>«  R.  DillmnnW. 

R.  Rcutn^’.U.Siciucfo 

(a)  University  of.'1  cchnology  Dresden 
Institute  of  Ii!ccirotcdinics/I.2ccirnjiics 
Mommsenstr,  13, 13-8027  Dresden 
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(c)  University  of  Transport  and  Communication 
Institute  of  rjcctrotcehnks 
rricdrich-Iist-PIat/^  1, 13-8010  Dresden 

Abstract  In  ibis  paper  we  present  cfTicicnt  methods  for  llic  solution  of 
large  systems  of  linear  equations  arising  within  the  3D-IX’vicc  Simulator 
'SIMBA,  and  simubtion  results  for  trend)  cells  Kill  Is  and  an  IPG- 

rcr. 

(.  Introduction 

We  describe  numerical  methods,  which  arc  now  implemented  in  the 
3D-Dcvicc  Simulator  SIMBA.  and  results  of  lire  simubtion  of  new  se¬ 
miconductor  devices,  ‘Hircc  dimensional  rimubtions  l»ccomc  very  im¬ 
portant  for  the  development  and  optimization  of  nxxkrrn  microelectro¬ 
nic  devices  and  VI.SMCs, 

lire  first  rdcasc  of  SIMBA  was  avaibblc  for  industrial  applications  in 
1982  An  analytical  model  and  an  Interface  to  the  Process  Simulator 
DUPSIM  arc  included  m  SIMBA  for  nonpbnar  surfaces  and  the  do¬ 
ping  profile, 

'I he  behavior  of  the  semiconductor  device  (Si.  (la  As.  Ini’)  is  modelled 
by  the  well-known  system  of  semiconductor  equations; 


div(t*  gratty)*  — (p  —  zi  h  CN) 

(1) 

—  Op 

div.9p  q  *(/?**«  O  4— ) 

(2) 

«liv.s.«  «»< K-a+%) 

(•') 

>■-«  Z? -» 

(2) 

a  m-1  *  0‘,  *  nit:-  n,  *  frail /|) 

(5) 

By  using  the  band  parameter  model  |l|,  it  is  povubk-to  swuibtc.  Intern 
devices,  fhc  recombination  model  includes  lire  SHOCKI  JiY/KlvAD/ 
HAM.  recombination  with  doping  dependent  life  tunes  and  live  Auger 
recombination,  A  modified  model,  according  to  (TIAUOI.Y/1  MO* 
MAS  is  used  for  live  carrier  mobility  (2J. 

2,  Numerical  McdxxU 

Ihe  set  of  equations  (|)  (J)  is  Iransfonncd  with  tlx.  GAUVSun  Ltw  and 
then  discretized  by  a  seven  point  difference  scheme  in  a  rectangular 
nonunifonn  god,  Ihc  backward  I.UI.F.K  formula,  combined  with  a 
predictor,  is  used  for  (lie  time  discretization. 

To  describe  the  current  densities,  the  GUMMMJM  MARI  UTlhR 
method  is  used.  Ik  haw,  eel!  for  the  discretization  is  ,i  cube  with  an 
inner  gnd  point,' At  tlx:  outu  planes  of  the  device  some  parts  of  the  cell 
arc  removed  ansi  tlK  gnd  point  is  located  at  I  lie  surface  of  the  tfcvtec. 
It  is  possible  to  describe  a  nonpbnar  surface  by  means  ol  tins  tcclmx|uc. 
The  Poisson  and  the  continuity  equations  arc  solved  consecutively  ac¬ 
cording  to  GUMMlil.'s  mctlwxl.  Normally  we  use  for  the  discretized 
form  of  the  Poisson  equation  p  ■*  zi  «  /i(^)  ami  solve  it  nonli¬ 
near  by  using  NBWION's  method.  Sometimes  it  is  useful  to  consider 
p,n  in  (I)  as  independent  on  <>.  Ihcn  a  system  of  linear  equations  re¬ 
sult*.  Ihc  discretized  continuity  equations  arc  treated  linearly.  I1»c  set 
of  unknowns  consists  of  the  electrostatic  potential  and  the  comer  den¬ 
sities.  The  Urge  systems  of  linear  equations 
A  *x  »  b 

arc  solved  with  a  mrxhficd  K’CG  method  |3]  and  iftc  ICBCG  method 
(4J.J5)  for  the  Poisson  equation  and  the  continuity  equatin'.*.  respec¬ 


tively.  In  principle  llic  algorithms  read  as  follows. 

-  initialization 

r„  -  b  -  A  *  Xa  ;  r  V,  «  r„ 

Ih  “  V-Uf*  *  <b  :  (tf'f'  *  '<□ 

*  loop  until  convergence 


,  ■  r**n 
*  "  /V(dl7»,) 

rM  **  r,  -  n*  (A  */>) 

(«/)-'  *rm 

xin  +  «  *n 

rsM  -  n,  -a*  VT*r\) 


dli  =  (/.to-1  *  z.f/+| 

0  <H**(AT*ps{) 
/yt 

Pin  «  dl  +  ft  tp, 

/><„,  »  <th  +  ft  f  ptp 


where  (/,(/)  and  (lA1)  itenote  incomplete  dconmpmiliotis  of  the  matri¬ 
ces  A  and  AT,  respectively. 

SIMBA  runs  on  VAX/VMS,  IBM  3090.  CONVI  X,  and  SUN-worl- 
slations.  Essential  parts  of  live  code  arc  vectorized,  which  results  in  a 
remarkable  decrease  m  GPU  time  (a  factor  of  ca,  4  was  measured  for 
typical  examples).  It  is  pbnncd  to  study  live  possibilities  of  further  im¬ 
provements  in  order  to  exploit  the  vector  and  parallel  capabilities  of  llic 
IBM  3090  Multiprocessor,  As  the  solution  of  (Ik  s\ steins  of  linear 
equations  is  the  most  lunc  consuming  part  (about  SO  %  of  CPU  time) 
it  will  be  the  favorite  candidate  for  parallelization.  Nome  of  possible 
technique,*  for  llic  parallelization  of  the  above  slated  algorithm  may  be. 


•  spread  llic  simple  vector  operations  ( v~z4  » tx  and  ywjrfv) 
across  several  processors, 

•  in  a  matrix-voctor-product  partition  llic  tnitnx  honzonUlly  .ami 
compute  each  corresponding  part  of  the  resulting  vector  separately 
in  one  processor, 

•  apply  block  teshniques  in  order  to  gel  a  parallel  vilver  for 
(l.U) » *x and  (/.//)  '*v. 


3.  Results 


1  he  punch-through  be  Jxivkii  ol  Iniuli  cells  m  219-  and  31)  uiscisu  first 
result  of  the  simulations  performed  with  MMBA,  llw  sketch  of  tlx. 
device  and  the  eum.nl  volt  ige  re bt urns  are  shown  m  I  ig.  1  and  2.  I  he. 
difference  between  (lie  two  eateubted  punch-through  voltages  is  ibout 

2  Volts,  caused  by  the  influence  of  llic  back  area  m  3D  ease.  About 
200000  grid  points  were  used  for  this  simulation. 

‘Ihc  structure  of  a  n-channel  IGBF  (insulated  gate  bipolar  transistor) 
is  shown  in  Pig  3  ’llic  thickness  of  the  gate  oxtd  is  300  nm.  'llic  Kill  F 
was  biased  to  a  gile  voltage  of  10  V  to  calculate  tlx>  output  clwMctcn- 
Stic,  *3*own  in  Fig,  4, 

By  means  of  SIMBA,  it  was  possible,  to  simulate  v>  called  IN-Pbnt- 
Gated  (IPG)  field  effect  transistors.  WlliCK  am!  PI -OOG  developed 
these  devices  J6],  which  luve  a  quasi  one-dimcuuwul  (Q|D)  tunable 
earner  channel.  I  >g  3  shows  the  structure,  whxh  "as  used  for  lire 
3D  simubtion,  Ihc  hetcro-slrueturc  consists  of  an  undoped  GaAs 
layer,  a  20  nm  undoped  AlG.aAs  spacer,  and  a  50  nm  doped  AlGaAs 
bycr  The  channel  is  insulated  laterally  from  the  two  dimensional  elec 
tron  systems  (2DF.S),  which  serve  as  gates,  Ihc  conductivity  m  'he 
Q1D  clianncl  is  controlled  via  a  gate  to-sourcc  voltage  Vrv  which  is 
applied  between  the  channel  and  adjacent  2DIiS  regimes.  Because  the 
regions  near  the  msubted  paths  luve  lower  tarnci  densities  and  mobi 
lilies,  they  were  included  m  our  simubtion.  Hie  calculated  1,-anjfcr 
cliaraetcristic  at  room  temperature  is  represented  m  Fig  6,  which  «*x)\vs 
a  threshold  voltage  of  about  7  V  and  a  dram  current  of  116  /iA  at 
Vf,s  *  A-  Ihc  results  of  live  simulation  agree  well  with  (hose  obtained 
cxpcnmcntally  in  |5), 
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4  Condudons 

♦  'I Tic  ICBCG  method  is  an  efficient  algorithm  fur  tlic  .solution  of 
large  linear  systems  arising  in  device  simulation. 

♦  The  use  of  a  vector  processor  accdcralcs  the  overall  performance 
of  the  simulation  njns  by  a  factor  of  ca  4. 

♦  The  simulation  of  trench  cells  and  an  11*0-1*171'  compares  very 
well  with  experimental  results  obtained  by  other  authors. 


Tig  I.  Sketch  of  the  simulated  trench  cells  with  equipotential  lines 


Mg  3  Structure  sketch  of  tlic  ICiBT  Mg  4  Oulput  diaractcristic 
of  the  KJBT 

JiwOtor 


Hg  5,  Slradutc  slcttli  of  ilir  Il'CS  ficll  cited  trwnlor 


VOS=  1.S  V 


V6S  /V 


llg  6.  Cilcclitol  'I  ranefer  cKvadcridic  of  llw  IPG  field  effed 

irandstor 
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SIMULATION'AND  OPTIMIZATION  OP  CIRCUITS 
BY- MEANS  OF  WAVEFORM  RELAXATION  METHODS 


KLAUS  R.  SCHNEIDER 
Kf.-I-Weierstrafi-Inslitute  ol  Mathematics 
MohrenstraBe  39 
Beilin, 0  -  1086,  F.R.G. 


Abstract  -  The  waveform  relaxation  method  (WRM)  is  an  impor¬ 
tant  tool  to  compute  the  solutions  of  Cauchy  problems  for  very 
large  systems  of  differential- algebraic  systems.  By  means  of  Pon- 
trjagm's  maximum  principle,  problems  of  optimal  control  can  be 
reduced  to  the  solution  of  boundary  value  problems.  In  case  of 
large  structured  systems  it  is  efficient  to  compute  the  solution  of 
the  corresponding  boundary  value  problem  by  WRM.  We  give 
conditions  for  the  convergence  (both  global  and  quadratically 
local)  of  the  WEM. 

1.  GLOBALLY  CONVERGENT  WRM 

The  transient  analysis  of  electronic,circuits  is  equivalent  to  the 
solution  of  the  Cauchy  problem  for  the  differential-algebraic  sys¬ 
tem 


The  proof  of  the  following  theorem  can  be  found  in  [3]. 

THEOREM  1.1.  Assume  hypothesis  (V)  holds.  Then  the  Cauchy 
problem  (1)  has  to  anyT  >  0  a  unique  solution  and  the  iteration 
scheme  (2)  converges  for  any  initial  guess  to  the  unique  '  lution. 

2.  QUADRATICALLY  CONVERGENT  WRM 
In  an  appropriate  Banach  space  we  introduce  the  operator 

F(tvX0  :=«(<)  -  [  1  mAld’A’l'V’, 

*(<;  -  sKW.'WiW.o- 

Then,  the  Cauchy  problem  (I)  is  equivalent  to 


|=7((,d(/dt,i),t),  i)=s((,d(/dl,i),t)  (I) 

((«.)  =  (.,<  6  S-.=  (tt,h  +  n 

In  case  dim  (  +  dim  j?  >  1  circuit  simulation  is  a  time  con¬ 
suming  task.  In  overcoming  this  difficulty  WRM  plays  a  central 
role,  it  is  an  iterative  method  for  determining  the  solution  of 
(l)  which  is  based  on  the  decomposition  of  the  large  system 
into  subsystems  and  on  the  independent  (effective)  solution  of 
each  subsystem,  WRM  is  immedeately  suitable  for  parallel  tech¬ 
niques.  The  corresponding  canonical  scheme  reads  (2) 

~  =  /(x'.x'-'.dl'-'/dl, 
a*  =  (2) 

**(<«)  =  I  6S.  *  =  ».V- 

Concerning  f  and  g  we  suppose 

(V).  / :  RnxRnxR*xRmxR  -  R\  j :  RrxFrxPTxRrxR  — 
Rm  are  continuous  and  satisfy  Vw,  tZ>  €  where 
w  :=  (*,,  *,,**,  a),  ti  ;=  (ii.Ij.Is,  l),‘£R 

l/(w,  t)  -  /(t5,  !)l  <X>,|i.-*.l  +  k«|s  J| 

0  “  01 5  ]C  -  ril  +  *<l*  “  *1 

1=1 

where  the  spectral  radius  o(L)  of  the  matrix 


obets  o{L\  <  1 . 


?(ur)  =  0  (3) 

In  general,  the  operator  F  has  a  special  structure  which  can  be 
exploited  for  a  block  iterative  scheme.  In  the  simplest  case  we 
represent  (3)  in  the  form 


t\(w,,w,)  =  0  (4) 

F)(uii,wa)  =  0 


Concerning  the  system  (4)  we  assume 
(VI).  (4)  has  a  unique  solution  u»*« 

(Vi).  ?  is  twice  continuously  Fr^chet  diffentiable  near  w\ 

(U 


A*.  \ 

ft...  > 


have  a  bounded  inverse  at  w  =  w*. 


Consider  the  block  iteration  scheme 
= w‘ + 

w}+1  =  w\  +  “!)  -  (5) 


THEOREM  2.1.  Assume  the  hypotheses  (V,)  -  (V5)  hold.  Then 
the  iteration  scheme  (5)  converses  with  /-step  Q-order  2  to  the 
uniyue  solution  u>‘  oj  (4)  provided  the  initial  jtiess  to0  is  suffi¬ 
ciently  near  w*. 

The  iteration  scheme  (5)  is  related  to  a  similar  procedure  due  to 
Hoyer  and  Schmidt  |lj  in  case  of  finite  dimensional  space.  The 
general  case  will  be  treated  in  14]. 
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3.  OPTIMIZATION  PROBLEMS  FOR  LARGE  CIRCUITS 


The  problem  to  minimize  the  functional 

j[.]  = 

where  lie  control  function  u  bekejs  to  some  set  ff  and  i  satis£es 


^=7(1,1,*),  x(0)  =  it, 

:can  be  reduced  bj  Pontljipa’rmnriram  principle  to  tie  bound- 
ary  value  problem 


dx 

dt 

dX 

dt 


dH, 
d\ 
dH , 
"  dx 


Ml 

V). 


x(0)  =  xo,A(0)=0. 


(6) 
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where  H  is  the  corresponding  Hamiltonfunctioo. 

In  case  dim  x  =  dim  X  »  1  we  may  in  general  assume  that  H 
has  a  special  structure.  This  snggests  the  use  of  block  diagonal 
procedures  for  the  numerical  solution  of  (6),  especially  WRM. 
The  corresponding  iteration  scheme,  related  immedeately  to  (6), 
can  be  written  in  the  form 


**(l)  =  ic+jfrA,(i‘,i‘-‘,A‘1it-,,i)dJ 

+  j'  A^**,**'1,  A1,  A1*',  s)cfj,  (J) 

A*(t)  = 

+  A*,A‘-*.u)*. 

With  respect  to  tbe  function s  A,  and  A,  we  assume  that  there  ate 
positive  numbers  k^  ,  i  j  =  1,2,  such  that  V  Xi,Xj,  Aj,  A),7i, 

(H). 

|/,(x„  u,  A,,  Aj,  I)  -  /,(!,, Is, X,J,,  l)|  <  (8) 

<D*.0)i*.--i.i+'?’ia.-a;i). 

«=i 

Furthermore  we  introduce  the  matrices 


(  *<l)  tx)  \  . 

«:=(bi»  ?>)■  ,=1'2- 

Now  we  are  ready  to  formulate  the  following  theorem. 

THEOREM  3.1.  Assume  the  hypothesis  (li)  holds.  Additionally 
we  assume 

Tc((I  -  TMi)’lM3)  <  1.  (9) 

Then  the  iteration  scheme  (7)  converges  to  the  unique  solution 
oj  (6)  for  any  initial  guess. 

Theorem  3.1  can  be  interpretedd  as  follows:  Under  the  given 
conditions  the  successive  optimization  of  the  subsystems  leads 
to  the  optimal  control  of  the  full  system.  In  case  that  (9)  cannot 
be  fulfilled  we  may  apply  the  approach  described  in  section  2. 
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Abstract  Mathematical  modeling  of  luaped 
physical  systems  such  as  electronic  circuits 
and  aechanical  systems  leads  frequently  to 
higher -index,  differential -algebraic  equations 
CDAEs?.  In  aany  cases  a  cosputati  onal 
analysis-  of  these  DAEs-  is  of  '  interest. 
However the  well-known  numerical  methods, 
like-  BDF  cr  Runge-Kutta,  are  applicable  in 
general  to  Index  one  problems  only.  In  this 
paper  we  shew  that  any  higher-index  DAE  can 
be  transferred  to  an  index  one  DAE  such  that 
the  set  of  solutions  Is  kept  invariant  during 
the  transformation. 


FCx.x'3  -  O.  C33 

only.  We  associate  with  DAEs  of  this  type  for 
any  non-negative  integer  a.  as  long  as  the 
involved  cappings  are  defined  and 
differentiable.  the  nonliiei/  system  of 
equations 


F0Cx.p5  -  O 


f_cx.  p?  «  o 


C3> 


i  introduction 

DAEs  are  frequently  identified  as 
implicit  equations 

FCt.x.x'3  ■  0.  F: E«K"*R"->Kr'  C13 

for  which  X*  cannot  be  expressed  explicitly 
as  a  function  of  t  and  x.  Such  DAEs  arise  in 
cany  areas  of  science  and  engineering.  In 
particular  aechanical  systems  and  electronic 
circuits  cay  be  modeled  using  equations  of 
the  type  Cl?. 

The  problem  Cl?  is  structural  quite 
different  from  an  ODE.  and  existing  ODE 
aethods  are  in  general  not  applicable  to  DAEs 
without  extensive  nodi fi cations.  The 
variation  in  the  structure  of  Cl?  from  an  ODE 
is  usually  quantified  by  an  integer  called 
the  index  of  the  prcMea  Cl).  While  Index 
zero  and  index  one  problems  can  be  solved  by 
the  well-known  aethods  like  BDF  or  Runge- 
Kutta.  it  Is  difficult  to  solve  higher-index 
problems  numerically.  For  that  reason  several 
techniques  for  the  reduction  of  higher-index 
DAEs  to  DAEs  with  a  lower  index  are  discussed 
in  the  literature  (1 ) .  (2) .  (3) ,  14).  However, 
aost  of  these  techniques  introduce  additional 
degrees  of  freedom  during  the  reduction.  This 
results  in  certain  Integral  invariants  for 
the  reduced  DAE  which  the  numerical  method 
may  not  keep  constant  during  integration. 

In  this  paper  we  discuss  a  technique  for 
the  reduction  of  higher -index  problems  to 
DAEs  of  index  one  that  do  not  introduce  any 
additional  degrees  of  freedom.  integral 
invariants  respectively.  To  distinguish 
reduction  techniques  with  this  property  from 
arbitrary  reductions.  we  call  them.  as 
suggested  in  (2).  Index  transformations. 


2  MATHEMATICAL  BACKGROUND 

To  simplifiy  the  notations,  we  rewrite 
Cl?  as  a  time-invariant  DAE 

FCz.x.x’?  -  0 
z*  «  1 


with 


F^Cx.p?:*  FCx.p?  . 

Fk+lCx'P>:"  Vx,p)D/kCx,p5p 

where 


C^Cx.p? 

is  a  projection  along 


n-1? 


and 


imCD^F^Cx.p?) 
DxFkCx.p>.  DpFfcCx.p> 


denote  the  partial  derivatives,  with  respect 
to  x  and  p.  We  call  C3?  the  der  1  vatjve  array 
of  order  m.  and  we  denote  the  solution  set  of 
C3?  by  LCm?  <3  RnxKn  anc  the  projection-  of 
LCra?  onto  the  first  component  by  MCm?  «  Rn. 

Let  us  assume  now  that  there  exists  a 
non-negative  integer  1  such  that 

Cl:  MCI?  is  a  differentiable  manifold. 

C2:  for  any  x  e  MCI?  there  exists  a 
unique  p  <1  Rn  with  Cx.p?  a  LCi?. 


If  such  an  integer  exists,  then  the  smallest 
integer,  that  satisfies  the  conditions  Cl  and 
C2.  is  called  the  jindex  of  the  problem  C5J . 

Let  C2?  be  a  DAE  of  index  1*  then  the 
condition  C2  defines  a  unique  mapping 
v; MCi?—>Rn  which  is  a  vector  field  on  MCI?. 
We  call  tht  manifold  M.  *  MCI?  the  configura¬ 
tion  space  and  the  mapping  v  the  corres¬ 
ponding  vector_ field  of  the  problem  Obvious¬ 
ly,  the  solutions  of  the  corresponding  vector 
field  are  identical  with  the  solutions  of  the 
given  DAE.  Thus,  in  this  case,  the  well-known 
existence  and  uniqueness  results  for  vector 
fields  are  applicable  to  DAEs  IS). 

In  several  papers  Cl), (3),  it  is  assumed 
that  the  derivative  arrayof  order  i  can  be 
rewritten  in  the  form 


and  consider  DAEs  of  the  typo 
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F1:  Kr'^r->E" 
F*:R"->S!" 


F*<x.p>  -  O, 
J^CsO  -  O. 


with 

arid 


rank(Dj;F1<x.p33  «  n 

ranfcliy^CSOJ  -  m. 


C4.13 

C4.23 


Thus  C4-23  yields  the  configuration  space  M 
of  the  problem  and  C4.13  can  be  considered  as 
the  implicit  description  of  an  COE 

x*  ■  g<x3.  giU-Mfc"  <53 

which  is  called  the  unde*  1  yj nq  1CCE  of  * >e 
pr  obi  eat.  Clearly*  the  restriction  of  the 
sapping  g  to  the  configuration  space  M  is 
identical  to  -  the  corresponding  vector  field 
v.  Thus  the  configuration  space  M  is  an 
invariant  sanifold  of  the  underlying  ODE  <53. 


3  INDEX  TRANSFORMATIONS 

For  the  computational  analysis  of  higher 
index  problems  It  seems  reasonable  to 
integrate  the  underlying  ODE  instead. 
However,  cost  numerical  methods  do  not  keep 
the  configuration  space  -M  Invariant  during 
the  integration.  This  results  In  the 
well-known  numerical  drift-off.  Therefore  we 
derive  lower  index  formulations  which 
maintain  the  configuration  space  of  the 
problem  In  this  chapter. 

First  we  generalize  an  index  transfor¬ 
mation  technique  which  was  proposed  by  GEAR 
in  £31.  Let  us  consider  the  formulation.  <43 
of  the  derivative  array  <33  which  is  an 
over  deter  mined  system  of  equations.  Then  the 
generalization  of  the  stabilization 
technique!,  as  proposed  by  GEAR,  results  in 
the  DAE 


TWCI3  -  NCx3 

where 

denotes  the  tangent  space  of'  MCI3  at  x  £S3. 
Let  PCx3  be  a  projection  onto  NCx3-  Ve 
replace  now  X’  in  C23  by  PCx3x’  and  obtain 
the  new  DAE 


FCx,PC5px*3  -  *  6.  <73 

This  substitution  makes  sense  because  any 
solution  of  <23  has  to  satisfy' 

X*  «  TxMCi3 

and  thus 

x'  ■  PCx3x*. 


Consequently  we  can  formulate  the  following 


Theorem:  The  DAE  <73  is  a  DAE -of  index  one 
and  has  exactly  the  same  solutions  as-  the 
given  DAE  <23  of  index  1. 

Proof :  Let  LCm3*.  MCm3#  be  the  sets 
associated  with  the  derivative  array  of  order 
m  corresponding  to  the  DAE  <73.  Then,  by 
definition  of  the  DAE  <73.  we  obtain  that 


iff 


<x,p3  q  L<03# 

<x,z3  e  LX 03  and  z  *  P<x3p  <83 


However .  because  C  51 


iff 


<x,p3  q  1X13 

<x,z3  <3  L<03  and  z  q  TxMCi3. 


we  rewrite  the  condition  <83  to 

<x,z3  q  L<13  and  z  ■  PCx3p. 


F,Cx.x--DxFiCx3t(j3  ■  O 
F*C>0  •  0 


C6) 


where  n  q  £Rm  and  t  denotes  the  transposed 
of  a  linear  mapping.  This  DAE  is  a  DAE  of 
index  2  and  has  exactly  the  same  solutions  as 
<23  with  respect  to  the  variable  x. 
Furthermore,  we  have  p  ■  0  for  any  solution 
of  <63. 

In  many  cases  it  might  be  difficult  or 
even  impossible  to  reformulate  the  derivative 
array  <33  in  the  form  <43,  Therefore  we 
propose  here  an  index  transformation 
technique  which  does  not  require  the 
formulation  <43. 

For  any  <x,p3  lot  us  consider  the  linear 
subspace 


NCx,p3;  ■  ker 


90<x.p3DxF0<x.p3 


Qi<X.p3DxFi<x.p3 


If  dim£N<x.p31  ■  const,  for  all  <x.p3.  then 
NCx.p3  does  not  depend  on  the  variable  p  and 
ve  have 


This  Implies  that 

MC03®  «  MCD 

and  „  , 

LC1)  "  LCO)  n  TMCO)* 

which  is  equivalent  to  (51 

LCD  -  LCD*.  ■ 
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Abstract  Solving  large-scale  linear 
systems  of  equations- by  conjugate 
gradient  (CG)  or  related  methods  requires 
a  good  preconditioning, of  the  original 
coefficient  matrix  to  obtain. an 
approximate  solution  in  a  reasonable 
number  of  stcp3.  In  can/- applications 
(higher  order  difference  schemes, 
singularly  perturbed  problems)  the 
matrix  is  not  symmetric  and  the  standard 
CG  method  not  applicable. 

In  this  case,  the  so-called  biconjugate 
gradient  method  will  be  interpreted  a3 
an  ordinary  CG  method  applied  to  an 
extended; system, with  symmetric  but 
indefinite  matrix.  This  idea  can  be  used 
to  derive  other  methods. 


X.  INTRODUCTICW 


Consider  the  linear  system  of  equations 

Ax  a  b  (1 ) 

with  symmetric  matrix  A.  Let  a  (possibly 
indefinite)  scalar  product 
(u,v)  =  uFcv 

be  defined  by  another  symmetric  matrix  C. 
Then  the  conjugate  gradient  method  for 
this  system  with  respect  to  this  scalar 
product  is  given  by 


Xj  starting  vector  ,  r1=b-Ax1  ,  p1=Cr1 


*k+1 


with 


■  *kPk 
rk+1  ”  rk"  °^kApk 
pk+1  "  v‘k+1 


Cr-"  +  (3kpk 


(2) 


CKy. 


9 

Pk^k 


and 


rklcrk 


The  choice  of  C  affects  the  convergence 
rate  of  the  method.  It  is  dost  natural  to 
take  as  0  an  approximation  to  the  invoroc 

A"1  by  means  of  incomplete  Cholcaky 
factorization. 

The  method  terminates  when  rk=0;  then  x«k 
is  a  solution  of  (1).  Howovor,  it  broako 


down  whenever  P^Apkc0  or  rj;Crk=0  .  In  this 


situation  the  process  is  ropeated  with 
another  starting  vector  Xj. 


II.  THE  BCG  METHOD 

Assume  now  a  system  (l)  with  a  not  necessa¬ 
rily  symmetric' matrix  A  together  with  the 
transposed  system 


AAy  =  c  . 

Both  equations  can  be  collected  together 

(:  :1  (;)■(;)  » 

and  solved  by  the  above  method  with  A, C 

replaced  by  (°  AQ  )  ,  (°T  q)  where 

the  new  C  is  again  an  approximation  to 

A”1  obtained  from  an  incomplete  LU 
factorisation  of  A. 

The  resulting  process  computes  iteratively 

three  sequences -of  vectors  (  ]  ,  [  *  J 

(  Py\  \yk/'  \sk  ) 

and  I  ‘I  according  to 

Uk  ) 

X|9  starting  vectors,  =  b-Ax1  , 

s1  =  c-A^y,  .  ?!  =  CP,  .  1, ■  CTS1 
xk+1 =  xk*  -*kpk 

yk+i  =  yk+ ot^k  (->) 

rk+i"rk"  ^Pk  >  pk+i=  Crk+i  +  Pkpk 

sk+1=  ak“  >  ^k+1"  C?3k+1+Pkqk 

with- 


°kCrk 

T* - 

qkApk 


and 


sk+lCrfcH 


This  method-  (with  C=I)  is  known  a3  the 
biconjugato  gradient  (BOG)  method.  The 
present  approach  now  suggests  to  study 
other  choices  in  (2)  for  solving  the  sai 
system  (3). 


III.  ANOTHER  SPECIAL  CASE  0?  METHOD  (2) 


in 


Again  A  in  (2)  is  replaced  by  f®  AJ  j 

ordor  to  solve  (3),  but  C  is  taken  as"* 

("T  *i-1 

V  EB^J  w^cro  is  an  approxi¬ 
mate  factorization  of  A  with  easily 
invertible  factors  E  and  P.  In  this  case 
the  updating  of  Xj^y^r^s^  proceeds  as 
in  (4),  but 
p,=.?-VT0i 
and  „  . 

q, .  E-V’r, 

With  y. 

Cl  -  ^k 

ak‘ 


pk+l“  p  1?  ^8k+t +  ^kpk 


2qk  Apic 


qk+1 =  B-‘ff 

Pk  2Tk 


k+1 


fk<>k 


T  -1  -T 

Oj^P  P  ov 


>  Y,Tp— Tp— 1 M 

+  rkE  E  rk 


Jfk“  “V  1  “k 
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Abstract  -  According. to  the  special  feature 
of  power  electronics, -systems  a  siaulation 
program  has  been  developed  and  tested  provid¬ 
ing  a  separate  modeling  of  the  sub-components 
on  an  interdisciplinary  basis. 

While  the  ac  supply  voltages  (1),  the  power 
electronics  circuit  (2)  and  load  (3)  are 
primarily  described  on  «,  network  model- basis, 
the  modeling  of  control  units  (5)  is  made 
with  block-oriented  functional  modules, 
whereas  for  modeling  the  gating  circuit  (4) 
a  control  graph  is  used  (Fig.  1). 

The  analysis  program  is  an  Integral  part  of  a 
design  system  equipped  with  a  data  base  which 
can  be  installed  in  UNIX  compatible  operating 
systems.  Through  its  implementation  in  the 
programming  language  C  the  demand  for  e  good 
portability  is  fulfilled.  As  regards  analytic¬ 
al  results  required  for  design  purposes  which 
are  not  directly  computed,  a  postprocessor 
with  graphic  representation  is  provided. 


J-i  INTRODUCTION 

The  design  of  power  electronics  systems  is  an 
iterative  process  in  which  the  satisfaction 
of  the  function,  being  on  its  turn  determined 
by  the  automation  process  aimed  at,  is  linked 
with  the  requirement  of  harmonizing  the  per¬ 
missible  static  end  dynamic  stress  limits  for 
active  and  passive  components.  For  design 
evaluation  purposes  an  analysis  model  is  necot 


sary  which  does  not  only  reflect  the  system¬ 
relevant  properties  but  is  also  compatible 
with  engineering  aspects.  In  order  to  include 
into  modeling  the  interaction  between  the 
switching  behaviour' of  the  power  electronics 
devices  (thyristor,  transistor,  GTO, 

IGBT)  and  the  actual  system  behaviour  (load) 
it  is  necessary  to  make  use  of  the  analytical 
methods  designed  for  microelectronic  circuits. 

II.  HY8RI0  SYSTEH  HODELING  AND  ANALYSIS 

In  Fig.  1  the  structure  of  a  power  electronics 
system  is  shown. 

Electric  circuits  are  usually  designed  on  a 
network  basis  while  control  systems,  owing 
to  their  mainly  functional  aspocts,  are  not 
currently  designed  on  the  level  of  their 
olectrical  networks. 


Fig.  1:  Structure  of  a  power  electronics 
system 


A.  Model  Properties 

The  network  aodel  of  an  electric  dynamic 
circuit  is  a  Mixture  of  algebraic  and  dif¬ 
ferential  equations-  It  is  non-linear 
especially  if  continuous  nodels  for  power 
electronics  devices  are  included.  Moreover 
these  Models  are  responsible  for  the 
rigidity  of  the  differential  equations  caused 
by  the  dynaaic  components  of  the 'aodel 
(input/output  behaviour) . 

The  network  equation  systea  is  set  up  on  the 
basis  of  the  Kirchhoff  laws,  the  current/volt¬ 
age  functions  of  the  eleaentary  networks  and 
circuit  topology  inforaation  according  to  the 
modified  aodal  approach.  Owing  to  the  pos¬ 
sibility  that  thore  is  a  choice  of  represent¬ 
ing  the  two-pole  relations  either  in  an  adait- 
tance  or  iapedance  fora  this  hybrid  network 
description  is  especially  suitable  for 
realizing  a  aodel  of  an  "ideal  switch". 

The  differential  equation  systea  which  can  be 
read  froo  the  block  structures  is  treated 
according  to  the  fiunge-kutta  oethod  of  the 
fourth  order  due  to  its  insensitivity  to 
discontinuities  in  the  behaviour  of  the 
systea  paraaeters.  The  block  cooputation 
sequence  is  fixed  according  to  the  signal 
direction  by  aeans  of  the  tree  sorting 
aethod.  The  causality  inherent  in  the  gating 
oethod  of  power  electronics  systeos  is  noted 
down  in  the  fora  of  state-event  sequences. 

The  interpretation  of  steady  state  graphs 
m  the  case  of  changing  states  gives  rise  to 
a  nodification  of  the  steplength  of 
integration. 


8.  Wuaerical  aspects 

Considering  the  special  properties  of  the 
algebro-dif ferential  equation  systen  (net¬ 
work)  it  is  solved  step  by  step  by  aeans  of 
the  Newton-Raphson  iteration.  The  energy 
storage  equations  are  discretized  on  a  two- 
teroinal  network  by  8DF  (8ackward  Differ¬ 
entiation  Foraulas)  or  alternatively  by  the 
trapezoid  foraula. 


For  iteration -the. linearized  equation  systeas 
are  repeatedly  solved. 

An  isproved  convergence  behaviour  of  the 
Newton  aethod  with  regard  to  the.  switching 
point  can  be  reached  by  ceans  of  a  defect- 
reducing  eabedding  of  the  network  equation 
systea. 

A  special  daaping  of  the. Newton  correction 
vector  Bakes  sure  that  the  order  of  the 
iterated  Coes  not  change  if.  the  behaviour 
of  the  function  is  very  flet.  The  break-off 
lisit  of  the  Newton  iteration  is  adapted 
to  the  aoaentary  order  of  the  unknown  in 
the  equation  systea.  To  maintain  the 
stability  of  8DF  which  is  controllable  with 
regard  to  its  steplength  and  order,  order- 
dependent  boundaries  of  the  steplength¬ 
changing  factor  are -necessary. 

Owing  to  the  possibility  of  changing  the 
order  of  integration  foraulas,  the  integ¬ 
ration  aethod  can  be  started  by -itself,  and 
aoreover  there  is  also  the  possibility  of 
suppressing  the  influence  of  the  solutions 
obtained  before  a  switching  point  on  the 
solutions  immediately  after  a  switching 
process.  For  this  purpose  the  trapezoid 
foraula  is  coabined  with  the  A-stable 
iaplicit  Euler  foraula. 

for  event  inclusion  asthnds  based  on  polynoa 
interpolations  have  been  tasted  with  a 
view  to  convergence  acceleration.  The 
inverse  Heraitian  interpolation  is  coabined 
with  the  Regula  falsi  to  fulfil  the 
conditions  made  with  regard  to  the  derivation 
values  of  the  polynoa.  As  the  event  includes 
an  increnent  corresponding  to  the  ainiaua 
peroissible  steplength,  it  is  necessary 
to  change  froa  the  interpolation  aethod  to 
the  robust  interval  halving  aethod. 
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The  results  obtained  froa  the  analysis 
of -the  motor  speed. control  for  a  DC  drive 


ere  shown  in  fig.  2.  As-can'beseen  the 
armature  circuit'of. the.  DC : drive  is  fed 
by  e  three-phase  foil-wave  bridge. 


Fig.  2:  Results  obtained  from  the  analysis 
of  the  motor  speed  control  for 
a  OC  drive 

Besides  tha  design  of  refined  valve  models 
future  developments  should  aia  at  using 
nuaerical  optimization  aethods  for  design 
purposes.  The  coaputation  of  the  gradient  of 
objective  functions  is  possible  by  aeans  of 
a  sensitivity  analysis  which  is  applicable 
to  the  network  model.  The  10  decoaposition 
of  the  Jacobi  matrix  during  the  Newton 
iteration  is  the  prerequisite  for  the 
efficiency  of  this  analysis  which  is  aade 
siaultaneously  with  the  transient  analysis. 
At  each  discretization  point  a  linear 
equation  systen  has  to  be  solved  per  design 
parameter. 
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Numerical-  solution  of  partial  differential  equations  with  a  time-dependent 
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1-  Introduction 


confounds  diffuse  with  the  flux 


The  production  of  semiconductor  structures  on 
a  silicon  wafer  will  be  simulated  by  process 
simulation.  The  redistribution  of  impurities 
in  the  silicon  waferj  called  dopant  diffu¬ 
sion,  and  changes  in  the  structure  of  various 
layers,  called  structuring  processes,  are 
cost  important  during  production. 

In  this  case,  a  silicon  wafer  is  considered, 
with  epitaxy  being  regarded  as  a  special 
step.  This  production  step  is  done  in  a  reac¬ 
tor,  m  which  the  silicon  wafer  is  placed  and 
a  gas  stream  transports  both,  silicon  com¬ 
pounds  and  dopants.  In  this  process  cnstal- 
line  silicon  is  deposited,  so  that  the  sub- 
strat  region  for"  dopant  diffusion  changes  in 
time.  There  is_also  a  impurity  redistnbuti on 
in  the  wafer  by  diffusion  due  to  the  high 
temperature,  which  is  coupled  with  a  dopant 
flow  through  the  silicon/gas  surface  because 
of  the  dopants  in  the  reactor  gas.  The  pre¬ 
sent  paper  discusses  a  programme  system  simu¬ 
lating  epitaxy  in  a  two-dimensional  case  (as 
well  as  other  technological ~ steps)  in  the 
context  of  changing  layers  and  diffusion, 
including  modelling  of  tho  boundary  condi¬ 
tions. 

2.  TS  ae-dc-pendent  variation  .  of  regions  in 
epitaxy 

The  2D  process  simulation  is  effected  in 
arbitrarily  polygonal-bounded  and  oinglo- 
conncctcd  regions.  The  discretisation  of  the 
boundaries  by  polygons  corresponds  to  a  given 
accuracy  <in  general  in  a  not-cquidistant 
way),  which  enables  an  effecient  description 
of  complex  regions. 

In  epitaxy  the  time-dependent  variation  of 
the  region  is  done  by' a  string  algorithm. 
Discretisation  of  tho  boundary  is  refined  and 
a  velocity  is  computed' for  each  discretiza¬ 
tion  point.  For  epitaxy,  the  direction  of 
velocity  is  given  by  the  outward  normal  and 
the  value  of  Velocity  depends  on  the  process 
parameters.  In  consideration  of  a  time  step- 
size  a  displacement  vector  is  given  in  each 
discretization  point  and  thus  provisional  new 
boundaries.  This  is  followed  by  an  algorithm 
for  loop  elimination.  Also  the  discretization 
of  the  new  boundaries  Is  adopted  to  tho  given 
accuracy  (s.  picture  1). 

3.  Diffusion  equation  for  impurities  in  epi¬ 
taxy 


g  * 

FZ  **  km*<pD  “  PD >  "ex  - 

Before  inserted  in^tho  silicon  lattice,  the 
dopants  are  stored  in  the. adsorbed  layer  on 
the  silicon  surface.  For  the  stream  F3  of 
impurities  fromthegas  layer  built'  in  .the 
adsorbed  layer  a  stationary  flow  is  assumed, 
namely 


FsCD.-n^  Fs(r>  •  kf  •  (pJ  -  C0F<i-i/kp>  . 

The  mass, balance  for  the  stagnant  gas  layer 
has  obtained  a  steady  state;  this  yields 


y  F  «*  F  (r)  dr  . 

2  \  S 

OF 

In  addition  to  /l/  a  locally  varied  insertion 
flux  into  the  adsorbed  layer  is  assumed. 

Now,  these  equations  can  be  used  to  obtain 


F  (r  > 
S  0 


with 


OF 


yall/b-km-Kf  ■ 
Vallum  +  ^  * 


This  is  tho  now  boundary  condition,  which  is 
used  in  the  algorithm. 


4.  Algorithms  for  the  solution  of  the  impuri¬ 
ty  diffusion  problem 

For  the  simulation  of  epitaxy,  the  algorithm 
in  figure  3  is  used.  The  variation  of  the 
rugion  is  done  according  to  paragraph  2.  In 
the  next  3  steps  of  this  algorithm  (as  well 
as  in  other  process  steps  in  DUPSIM  /2/) ,  for 
space  discretization  tho  cell  method  and  for 
time  discretization  the  implicit  Culer  method 
are  used. 

For  epitaxy,  the  time-dependent  region  has  to 
be  taken  into  consideration.  The  concentra¬ 
tion  of  dopants  from  one  time  step  to  ano¬ 
ther  is  computed  in  assuming  the  condition 
that  no  dopands  are  lost.  The  formula  is 


For  tho  technological  stop  of  opitaxy  tho 
diffusion  equation  has  to  be  solved  os  a 
parabolic  differential  equation.  Hero  only 
the  boundary  condition  of  the  silicon/gas 
surface  in  opitaxy  are  considered  <s.  .figure 
2). 

The  main  gas  stream  of  an  epitaxial  reactor 
transports  silicon  and  dopant  compounds. 
Between  main  gas  stream  and  the  wafer  there 
ifc  a  stagnant  gas  layer  in  which  the  above 


f  c5  .  <  n  vj*1  > 

c  r1  - - - 

where  Ci>  is  the  concentration,  Vk  tho  volume 
of  the  cel  1 »,  N  the  number  of  ail  discretiza¬ 
tion  points  and  n  and  (n+l)  are  two  time- 
steps.  For  discretization  points  according  to 
silicon  surface,  it  is  corrected  by 
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c?’1  :=  cj+1vj.(lfvlad/,v;)/(v;*;*vlad) 

because  of  storage  of  dopands  in  the  absorp¬ 
tion  layer.  In  comparison* to-  diffusion  in 
time-dependent- regions,  the  volume  increased 
by*- vi ad  is  us®d  *or  the'Cgll. Vj-' 

5.  Example  for  epitaxy  simulation 

The  application-of  the  epitaxy- model -will  be 
shown  by  means  of  the  example  of  a  low  resi¬ 
stance-buried  region,  with  the  simple  silicon 
wafer  being  the  basis.  It  is  structured  by 
lithography  in  such,  a  way  that  the  silicon  is 
etched  and  antimony.is  implanted  on  the 
right  hand  side  (figure  *f).  Then,  all  layers 
on* silicon  are  removed  and  an  inert  diffusion 
is  effected. 

The  silicon  will  be  deposited  in  the  epitaxy 
Step  within  time  S.Omin,  of  a  velocity^ 
0.4pm/mln  and  a, temperature  1150*C.  Phos¬ 
phorus  from  the  gas  stream  is  built  in.  Figu¬ 
re  S  shows- the  result  of  this  step.  The  anti¬ 
mony  profileon  the  left-hand  side  is  caused 
by  lateral  autodoping.  The  vertical  autodo-- 
ping  results  in  d  greater  slope  of  profile  in 
the  epitaxy  layer  than  in- the  substrat. 
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Figure  2:  transports  of  dopants  In  epitaxy 
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Figure  3:  algorithm  for  epitaxy  simulation 


Figure  4:  structure  alter  implantation 


Jf-MhSt  pff* 

Figure  5:  structure  after  epitaxy 
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Abstract.  When  one  studies  the  influence  of  interconnections 
on  the  delay  time  of-MOS  integrated  circuits,  a  system  of 
parabolic  equations  coupled  by.boundary,  conditions  appears. 
Lower  and  upper  bounds  for  the  solution  are  inferred  and  ver* 
ified  bv  numerical  integration. 


2.-RESULTS 

Lei. us  denote  for- each  *  =  1,2,..  ,n,  by  a,  the  solution  in 
(0,r/2)  of  the  equation 


1. 'Introduction' 

Let  us  consider  a  ”mixed  type’’  circuit  (see  Fig.  1). 


Fig.  1.  The  network  under  study 


It  consists  of  n  retransmission  lines  connected  to  a  2n- 
resistivc  port.  The  lines  are  described  by  one-dimensional  tele¬ 
graph  equations  and  the  multiport  is  modelled  by  the  matrix 
G.  The  sources  are  supposed  to  be  switched  off  at  the  moment 
i  —  0.  These  lead  to  the  following  mathematical  model,  (see 
(3-5]); 

-  a  system  of  linear  parabolic  equations  (u,  is  the  voltage  along 
the  t-th  line): 


du,  d*ui 
dt  ~a‘dx*  ~ 
1  >0,  i€(0,<f.) 


b,m , 


(1) 


-  a  system  of  boundary  conditions  coupling  the  above  cqua- 


r  «t(o) 

-A.&M.) 

-G 

ut(d|) 

«»(0) 

Lu.(</„)J 

the  initial  conditions* 


a,*  tan  a,  max{((?2i-i.?«-i  +  621-1,21);  (Gri.ri  +  ^2i,ti-i)] 
Wc  set 


d,(r)  —  cos  for  x  6  and  A,-  =  4^^-  +  6. . 

Also,  for  every,  i  =  l,2,...,n,  denote  by  0,  the  solution  in 
(0,;r/2)  of  the  equation  0ttan0,  =  ~~  min(]Tj|” ,  C?2,-i,ii 
£jk»»62i,t)  and  kt  P  =  i  =  1,2, ...  ,n)  and  7  — 

min{4~p  +  b{%  i  =  1,2,  Also,  fl\(x)  =  cos-1— —0. 
Now  wc  are  ready  to  formulate  the  result. 


'llEOREM  1. 
ct  us  assume 


that  the  following  hold. 


1.  a,  >0,  6,  >  0,  h,  >  0  for  all  1  =  1,2,  ...,n  , 

2.  G%)  <0  for  all  tj  *s  l,2,...,2n, 

2  n 

^Gt)>0  for  all  i  “  1,2, ...,2n  , 

3.  u, o(x)  >  0  forx.Q  (0,di|  and  i «  l,2,...,n  . 

Then,  if  it  =  (ui,«..,u„)  is  the  solution  of  (l)+(2)+(3)  with 
u,  €  CflO.T)  X  (0,</.))  and  ift,  e  C«0,T)  x  (0, <!,)),  we 
have  for  each  i,  and  x  6  (0,d,): 


u,(^)>o-«x)4n,n^g 


and 


ii,(0, x)  «  «,,(*),  x  €  {0,d,J,  i  =  1,2,..., n  (3) 

This  kind  of  problem  appear*  when  we  study  the  influence 
of  interconnection  wires  to  the  performances  of  a  MOS  inte- 
*hip,  especially  from  "the  delay-time'*  point  of  view 
jl,2j  Till  now  we  have  obtained  results  concerning  the  exis¬ 
tent  and  uniqueness  of  this  problem  (3,4 ,7j,  as  well  as  evalua¬ 
tions  of  the  decay  of  the  solution  (5,G)  even  for  certain  nonlinear 

^Hrre  wc  arc  interested  again  in  asymptotic  behaviour  pre- 
dicting,  giving  upper  bounds  of  each  component  of  the  solution 
(not  a  global  bound  as  till  now)  but  also  lower  uounds  of  ex* 
jjxmefttial  type. 

♦  On  leave  from  taeuliy  <>rtlfctrol«hnicai, 

lVJ)t<\fmk»I  fnililut*  of  Huch*r«i,  77206  Bu<h*r«t,  Romani* 


3.  Sketch  of  moor 

The  proof  is  based  on  (lie  maximum  principle  for  parabolic 
operators,  the  first  step  being  to  prove  that  ti,(t,x)  >  Oforalli, 
t,  x  Let  us  denote  in  =  inf{u,(f,x);  t  €  (0,T|,  x  €  (0,d,],  i  = 
1,2,  ,n)  =  ur(fm,xm)  and  suppose  m  <  0  lfx„,=0,  from 
Olid  rows  of  (2)  wc  easily  obtain  ,  0)  <  0  contradicting 

a  known  result  (8,  Cli  3,Th.4j.  The  same  happens  if  we  suppose 
xm  -  dr.  Then  the  maximum  principle  give  m  =  u,(0,xm) 
from  where  we  obtain  the  desired  positivity. 
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To  obtain  the  positive  lower  bound  we  change  the  lunctions 
u,(f,x)  =  i,,(f,x)c~A‘l  &{x),  and  for  the  system  with  v,-  as 
unknowns  apply  again  the  above  reasoning,  finding  r,(f,x)  > 
mihoocrf;  v,(0,x)  from  where  we  have  the  lower  bound.  With 
the  change  u,(f,x)  =  »;,(<, x)c~,,  ^(x); we  obtain  by  a. similar 
method  v,(t.z)  <  max0<r<4.  «,(0,x)  from  where  we  derive  the 
ikiZ* 

upper  bound 


4.  Numerical  example 

In  order  to  verify  how  far  are  these  bounds  totthc  exact  so* 
lution  wc  have  computed  numerically  many  typical  examples 
from  MOS  interconnection^  Wc  have  used  a- finite  element 
discretisation  in  space  combined  to  a  ready-made  subroutine 
from  NAG  library  (residual  evaluations  method)  for  time  inte* 
gration. 

Let  us  consider  the  tree-type  network  from  Fig.  2 


Figure  2.  An  example 

The  values  of  parameters  arc:  «i  as  1/10,  bi  =  1/100,  a j  =  l, 
^  ss  1/10,  A)  as  hi  at  1,  dj  =a  <f2  =s  1,  for  the  lines  and 
R,  =  1,  JI2  a  10  for  the  lumped  part.  From  here  wc  derive 
the  G  matrix:  Gw  —  Gw  **  1»  G22  -  32/120,  Gu  =  23/120, 
Git  =s  <3*42  -  -1/12,  the  remaining  elements  being  zero. 

The  numerically  computed  components  of  the  solution  are 
shown  in  Figs,  3  and  4. 


The  Figs,  5  and  6  present  the  solutions  at  the  right  hand 
end  of  the  lines  together  with  their  bounds. 


Line-1  point-  100000(21/21) 


MARKERS 
o»  solution 
9  a  lower  bound 
At  upper  bound 


a  «  0.65327 
ft  *  023066 
X*  0.19071 

7 »  aisaiiMO'1 

I-  0.71776 
U»  086266 


Line  * 2  Space  point  *  1XXXJ00  (21/21) 


MARKERS: 

O  =i  solution 
v  » lower  bound 
a  ■  upper  bound 


a**  065327 
0-  023066 
160705 

7—  3.12811*10“ 
I*  076480 
u  0.86266 


5.  Concluding  remarks 

If  in  assumption >2.  we  take  >  0  for  albt  = 

l,2.«.,2n,  then  all  the  components  are  globally  exponential 
asymptotically  stable  even  for  the  case  b,  —  O.  If  G,}  =  0 
for  at  least  one  i  —  1,2, ..  ,2n,  and  for  the  same  *  we  have  6,  =  0, 
then  -y  =  0  and  wc  obtain  only  constant  upper  bounds  of  all 
components.  Prom'this  point  of  view,  the  low’er  bounds  seem  to 
be  better.  Indeed,  if  for  a  certain  i  we  have  max((<?2i-i,2i-i  + 
G’2i-i,2i)l(G,2i,7»  -F  £21,71-1 )}  =  0  and  b,  =  0  then  we  obtain  a 
time-constant  lower  bound  for  u,,  while  the  other  components 
may  have  exponential  lower  bounds.  The  most  numerical  ex* 
amples  seem  to  show  also  that  the  lower  bound  is  tighter,  but 
other  examples  have  given,  an  opposite  result.  Anyway,  the 
bounds  arc  sufficiently  tight  to  be  uscfull  for  designers  if  wc 
take  into  account  their  "global”  feature  and  their  easy  compu¬ 
tation. 
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'A 'fixed- domain  approach  in, an  optimal 
shape  design  problem 


D,  Tiba 
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Abstract.  A  fixed  domain, approach  is  pre- 
sented-for  solving  optimal  shape  design  prob¬ 
lems.  In  the  proposed  method  the  original  op¬ 
timal  shape  design,  problem  is  converted  to  a 
control  problem  settled  in  a  fixed  domain.  The 
method  is  demonstrated  in  solving  an  optimal 
shape  design  problem  arising  from  transmission 
problems.  -Results  of  numerical  tests  are  presen¬ 
ted. 

Keywords.  Optimal  shape  design,  control  ap¬ 
proach. 

Subject  classification:  49E99, 35J67 
1.  Introduction 

The  standard  way  to  solve  optimal  shape  de¬ 
sign  problems  numerically  is  the  boundary  vari¬ 
ation  method.  In  that  method  the  unknown 
boundary  is  parametrized  using  a  set  of  design 
parameters  [4],  [5], 

Here,  we  discuss  another  approach  suggested 
by  recent  controllability-type  results  for  elliptic 
systems  (l],  [6],  (7).  It  may  be -mainly  com¬ 
pared  with  the  method  of  mapping  or  with. the 
fictitious  domain  method  (3).  In  the  method  of 
mapping  the  problem  is  converted  to  a  control 
problem  in  a  fixed  domain  and  control  in  coeffi¬ 
cients  whereas  in  fictitious  domain  method  the 
control  is  on  the  right  hand  side. 

We  shall  convert  the  problem  to  a  control 
problem  in  a  fixed  domain,  with  control  in  the 
coefficient.  However,  unlike  in  the  method  of 
mapping  or  fictitious  domain  method,  the  topol¬ 
ogy  of  the  variable  domains  is  not  given  a  priori. 

Our  basic  idea  is  a  simple  one:  if  ft  is  a  reg¬ 
ular  subdomain  of  a  fixed  domain  D,  then  it  is 
possible  to  find  some  mapping  p  :  D  -+  R  (by 
an  exact  controllability-type  argument  [7])  such 
that 
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p  >  0,in  ft,  p  =  0  on  3ft  amd  p  ^  0  in  D  \  ft. 
Then  the  Heaviside. mapping  H  :  R  ->  R 

*(p)  =  {J’  o'  (u) 

10,  p<  0, 

is  the  characteristic  function  of  IT  in  D. 

In  the  sequel,  we  shall  apply  this  approach  to 
a  model  problem  discussed  by,  Cea  (2)  and  Piron- 
neau  [5,  Ch.  8].,  In  section  2,  we  perform  a  brief 
theoretical  analysis  of  the  proposed  method  and 
some  numerical  results  are  given  in  section  3. 

2.  The  main  results 
We  study  the  Mowing  optimization  problem 

minimize  /  |t/p  -  Vdf  dx  (P) 
DCD  J  E 

subject  to  the  transmission  problem 

-aIAy1  +  a0yi  =  /  in  ft, 

-a2Ay2  +  ao!/2  =  /  in  D  \TT, 

0x~  =  02^-  in  3ft\(3ft  fi  dD),  ^ 
yi  =  t/2  in  3ft\(3ftn3D), 

ai—  =  o  in  9D>  i  =  i  2  . 
on 


Above  ao,  oj,  a2  are  positive  constants,  de- 

on 

notes  the  exterior  normal  derivative  to  ft  or  D, 
W6£*(J5),/€£*(ft), 

E  C  D  is  a  fixed  measurable  subset  and  yp  g 
//'(ft)  is  given  by 


,  ,  /  yi(*)  in  ft, 
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If  x  is  the.  characteristic, function  of  ft  in  D, 
then  the  variational  formulation  of<  the  problem, 
(T)  is  given’by 

jjjfi lx  +  o2(  1  -  ■  Vturh  ^  ^ 

aoyftw  -  fw)  dx  =  0  V  w'&Hl(D) 
and,  in  [5],  it  is>analysed'the  case  when 


X  €  {</ :  D  ->  R  |  g(x)  =  0  or  g(x)  =  1  Vs  6  D} 

(2.3) 

is  the  control  parameter.  As  the  form  of  the 
constraint  (2.3)  makes  the  problem  difficult  to 
handle  we  replace  (2.2)  by 

f  ([aiH(p)  +  a2(l- H(p))]VyaVw 
JD 

+  oo yntn  -  fw)  dx  =  0,  V  w  s  HX(D). 

(2.4) 

We  approximate  (2.4)  by 

/d((« A(p)  +  «t(l  -  tf<(p))]VyVm 

+  a0yw  -  fw)  dx  =  0,  Vw  6  ff'iP) , 

where  f/t  is  the  Yosida  approximation  of  the 
maximal  monotone  extension  of  /f  in  R  x  R. 

THEOREM  2.2.  let  yc  e  Hl(D)  be  the  unique 
solution  of  (2.5).  Then  yc  -*  ya  strongly  in 
Hl(D),  when  e  -*  0. 

We  approximate  the  problem  (P),  (T)  by  the 
following  one: 


minimize 


dx, 


(P«) 


subject  to  any  measurable  p  and  y  €  H1( D ) 
given  by  (2.5). 

Remark  2.3.  Generally,  in  the  absence  of  some 
compactness  assumption  on  the  class  of  subdo¬ 
mains  ft  (for  instance  £-cone  property,  Piron- 
neau  (5,  Ch.3]),  one  may  not  obtain  the  exis¬ 
tence  of  a  solution  for  the  problem  (P),  (T).  The 
same  is  valid  for  the  problem  (Pc)  since  there  are 
no  coercivity  conditions  on  the  control  param¬ 
eter  p.  Obviously,  one  may  ask  a  boundedness 


condition  oh  p,  |p(i)|  <  1,  due  to  the  relation¬ 
ship  between  p  and  ft.  But- this  does  not  imply 
existence  since  the -weak  limit  in  £°?(jD)  of  a 
sequence  of  characteristic  functions  is  hot  nec¬ 
essarily  a  characteristicfunction. 

We.  denote  [y’-,pc]  an  ^-optimal  pair  of  the 
problem  (Pe),  5  >  0,.that  is: 

■?.(*,«)<  «rf(Pi)+-«,  (2-6) 

where 

Jc(ye,Pc)=  I  \yc-yA2  dx 
Je 

and 

/•  ([mRe(Pc)  +  a2(l  -Hc(pc))}VtfVW 

JD 

+  aoycw  -  fw)  dx  =  0,  Vu>  6  HX(D)  • 

(2.7) 

Proposition  2.4.  For  e  -*  0,  we  have  yc  —>  y 
strongly  in  L2(D)  and  weakly  in  HX(D)  on  a 
subsequence,  such  that 

f  \y-  rf  dx  <  inf(P)  +  5  (2.8) 

JE 

In  order  to  solve  the  problem  (P£)  by  a  gradient- 
type  method,  we  discuss  the  adjoint  system.  Let 
9e  :  LM(D)  d?(D)  be  the  approximate  state 
mapping  p  -»  y  defined  by  (2.5)  (we  restrict  p 
to  be  in  L°°(D)  here). 

Proposition  2.6.  0C  is  a.Gateaux  differentiable 
mapping  and  VBc(p)v  =  r.  satisfies 


‘J  [(ai  -  a2)H'c(p)vVy  ■  Vu>+ 


(aI//«(p)  +  n2(l-//e(p)))Vr.Vtn 
+  aortuj  dx  =  0  V  to  S  JIX{D), 


(2.9) 


where  y  =  9e(p)  €  1I1(D)  and  v  is  arbitrary 
fixed  in  L°°(D).  Moreover  r  6  //’(£>). 


3.  Numerical. examples 
In  this  section  we  choose  D  =  E  =  (0, 1)  x 
(0, 1),  at  =  10,  o2  =  ao  =  1  and  /  =  x\  +  x2. 
The  Heaviside  mapping  is  approximated  by 


Up) 


-  h~r/c, 

p/r. 


p>o, 

p  <  0  . 


(3.1) 
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The  regularized  state  problem  (2.5)  is  discretized 
by  the  finite  element' method..  The, control. pa¬ 
rameter  p  in  (P£);is  taken  to  be  piecewise  com 
stant. 

In  the  numerical  solution- we  have  used  a  gra- 
.dient  algorithm  from  the  NAG-subroutine  Li¬ 
brary.  In  computations  the' cost  function  was 
scaled  with  a  factor  0.5' xlO5  and  the  regular¬ 
ization  parameter  g  =1/10.  was  used.  As  the 
discrete  control  parameter  p  is  piecewise  con¬ 
stant,  nodal  averaging  of  it  was  done  for  plotting 
purposes. 

Example  3.1.  Let  yd  be  the  solution  of  the 
transmission  problem  in  the  geometry  shown  in 
Figure  3.1.  Two  runs  with  100  and  900  finite 
elements  were  done.  In -both  cases  initial  guess 
p  =  0  was  used.  The  results  are  shown  in  Table 
3.1  and  Figures  3.2-3.3  (contour  p  =  0).  In  both 
cases  the  global  optimums  were  clearly  found. 


Elem. 

In.  cost 

CPU-s. 

100 

23.1 

iaaa«Mii 

29 

900 

24.9 

395 

Table  3.1 
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Abstract- In  this  paper  we  show  how  a  simplified  model 
of  CMOS  structures  switching  conditions  can  be  used  to 
obtain  explicit  delay  formulations,  with  clear  evidence  of 
relevant  parameters.  Applications  to  a.  data  path 
evaluation  and  to  the  definition  of  a  local  strategy  for 
transistor  sizing  are  given  as  necessary  facilities  for  a  cell 
synthclizcr. 


I.  INTRODUCTION 

If  top  down  techniques  allow  to  control  the  complexity 
found  in  digital  circuit  designs,  physical  issues,  such  as 
layout  style,  transistor  size,  wiring  and  timing  arc  too 
much  important  for  tlicqualily  of  the  resulting  design  to  be 
ignored  until  the  last  implementation  step.  As  part  of  the 
design  automation  project  conducted  in  the 
Microelectronics  Department  (LAMM)  of  Montpellier, 
we  arc  developing  a  technology  independent  cell 
synthclizcr  with  automatic  performance  evaluation 
facilities  (1). 

Efficient  structural  synthesis  demands  the  use  of  prior 
structural  criteria  as  an  effective  initial  solution  to  layout 
implementation  and  to  evaluate  propagation  delays  across 
data  paths,  whilst  taking  into  account  device  sizes, 
process  parameters,  layout  parasitics,  input  waveform 
from  driving ccllsandactivcandwiringoulputloading.  In 
order  to  do  this,  simplified  delay  models  with  clear 
evidence  of  structural,  technological  and  environmental 
parameters  must  be  defined  at  the  transistor  level. 

The  aim  of  this  paper  is  to  present  an  accurate  analytical 
model  for  the  propagation  delay  of  CMOS  gates  and  its 
use  in  the  cell  synthclizcr  for  speed  performance 
evaluation  and  transistor  sizing. 


IL  Delay  Modelling 


Great  efforts  have  been  made  to  develop  a  fast  and 
accurate  riming  analysis  [2.5J.  Simplified  RC  models  [6] 
have,  until  now.  been  limited  to  unidirectional  structures 
with  a  low  degree  of  accuracy  in  treating  transistor  serial 
arrays.  Common  alternative  has  been  to  use  delay  table 
representation  [3)  and  polynomial  decomposition  (71 
which,  as  technology  dependent  solutions,  arc  limited  to 
standard  unit  evaluation  allowing  the  propagation  or 
worst  or  best  case  delays.  Moreover,  without  explicit 
evidence  of  the  physical  parameters,  these  modcls.can 
only  be  used  to  develop  tools  for  the  verification  of  delay 
specifications  and  donot  allow  thedefinition  of  sizing  and 
optimizarion  criteria. 


By  dividing  the  data  path  into  unidirectional  element 
(including  transmission  gates  associated  to  their  power 
source),  it  appeared  possible  [8,9],  from  the  physical 
modeling  of  the  individual  transistor’s  switching 
operations,  to  obtain  a  real  delay  evaluation  of  ANDORI 
■from  a  linear  combination  of  the  driving  (i-1)  and  the 
controlled  (t)  structure’s  step  responses,  as  follows: 


ha.  (  i  ) 


4  ■  tuh  (a  —  I  )  •*■  httj  (  i  ) 


lra.it. 


•llLtU) 


(1) 


lui  (  i  ) 


B  .  htu  (  r- 1  )  +  tut,  (  r  ) 


1+a  .  B 


'nun) 

•Uh  (!) 


where:  -  A,  B  arc  linearization  coefficients, 

-  a  is  an  input  slope  correcting  factor  [10], 

-  tHLsriuisarcthcfall  and  rise  step  responses 
of  general  ANDORI,  evaluated  for  an  evolution  of  lire 
output  voltage  from  the  static  level  to  Vcc/2  [9J 


These  step  responses  can  be  obtained  directly  from  the 
mean  charge  transfer,  evaluated  across  the  node  under 
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consideration  and.produced  by  the  imbalance  current' 
developed  in  the  cell  under  evaluation,  as  follows; 


tjtt  { 1 )  —tjr. 


CL 

2cK(.n 


'"“->=-2: -7^77 


(2) 


for  an  inverter  made  up  ofN(C>r)  and  P(CP)  transistors 
loaded  by  a  capacitance  Cl.  where : 


III.  Application  to  CMOS  Datapath  Evaluation 

The  use  of  these  expressions  to  evaluate  delays  across 
vanousdata  paths  has  resulted  in  values  which  are  very 
dose  to  those  obtained  by  electrical  simulations 
(discrepancies  of  less  than  10  %)  (9,7,10).  These 
equations  have  been  integrated  into  a  timing  predictor 
(PATH  RUNNER  [11]),  allowing  a  fast  and  accurate  post 
layout  evaluation  of  the  speed  performances  of 
synlhetized  cells.  Figure  la  and  lb  summarize  thespeed 
arid  accuracy  performances  obtained  by  characterizing  a 
2)im  CMOS  library. 


2 Cox  WN  Lmin 1  8 Vcc  (Pec-Pi)  .. 

tsT~  jis  Cox  \VK  (Vcc-Vi)' IVcc*  +  4PrJ  -  \2VccVt  W 

is  the  clementa-y  fall  time  characteristic  of  the  technology 
and : 

Reff  Co  l Co  n’cA 

<**+  »+— J  w 

represents  the  step  response  of  an  inverter  loaded  by  a 
serial  array  of  transistors  (TG). 

This  expression  has  been  obtained  while  considering  that 
during  theswi telling  process  thcTG  arc  working  in  linear 
mode  and  can  be  modeled  by  a  resistance,  Rjir  and  a 
capacitance  Co.  As  given  in  cqu.2.  teev  represents  the  step 
responses  of  the  driving  inverter  loaded  by  the  complete 
array  equivalent  load,  Q+nCo.  The  second  term  of  cqu.  4 
represents  the  propagation  delay  on  the  array,  of  the 
charge  variation  imposed  by  the  sourcing  device. 

Equ.l  and  4  can  easily  be  used  to  express  the  real 
responses  of  general  ANDORI  as  the  product  of  3  terms : 

-  a  temporal  scaling  factor  Tu,  characteristic 
of  the  technology, 

-  a  structural  factor  (  Cm,  CPl  n  )charactcrislic 
of  the  switching  network, 

-  an  environmental  factor  including  the 
loading  factor  ( generalized  fanout  term )  specifying  the 
total  active  and  parasitic  ( structural  and  interconnection ) 
load,  and  the  controlling  device. 


Fipure  1  :  Illustration  of  the  characteristics  of  the 
PATH-RUNNER  timing  evaluator : 

a)  speed  performances  arc  evaluated  on  SUN  3/60  (1.5  MIPS), 

b)  delay  values  evaluated  on  different  data  paths  arc  compared 
to  values  obtained  from  SPICE  simulations  (2i>  i, 
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IV.  Application  to transistor  Sizing 

Written  in  reduced  form  with  respect  to  a  capacitance 
and  used  as  a  reference,  equation  4  gives  direct 
information  about  the  relative  delay  cost  of  structural  and 
loading  choices,  thus  allowing  the  definition  of  explicit 
sizing  laws. 

For  example,  the  sizing  of  an  irregular  inverter  array  is 
directly  obtained  from  the  solution  of: 

fl  i  if)  +  0+Ff)‘  0 + 1) 

1)'  X«(i) 

where  Xjj,  Yi  arc  the  values  of.N  transistor  active 
capacitance  and  parasitic  load  normalized  with  respect  to 
the  reference  capacitance. 

This  represents  a  system  of  dependent  equations 
(XK(i)2,  Xk(i+1)  .  XN(i-l) )  with  a  complexity  equal  to 
the  number  of  elements  to  be  sized.  A  global  solution  can 
be  obtained  through  successive  iterations  of  the  initial 
solution,  but  major  difficulties  arc  met  in  solving 
divergence  branches. 

As  an  effective  speed  area  trade  off,  we  have  proposed  a 
local  solution  allowing  the  backward  processing  of  the 
array  from  thcouiputload  to  the  input. This  local  solution 
is  defined,  sizing  each  cell  with  respect  to  a  reference  cell, 
which  will  define  the  area  delay  trade  off.  For  example, 
using  a  minimum  size  inverter  as  a  reference  ( N  transistor 
reduced  width :  Xn(0)=  1),  it  can  be  shown  [12]  that  the 
local  sizing  law,  deduced  from  cqu.S : 

(0  =  ~r==j-/l'-.X*(0)  (6) 

V  *  *rr 

allows  the  sizing  of  an  array  of  inverters  with  a  smaller 
area  (.  50%  less ),  at  the  expense  of  10%  extra  delay,  with 
respect  to  the  global  sizing  that  we  can  obtain  using  a 
mathematical  minimization  algorithm  (13). 

These  results  can  easily  be  generalized  to  ANDORI, 
allowing  a  fast  initial  estimation  of  transistor  sizes 
corresponding  to  a  minimum  area  implementation  with 
nearly  optimal  delay. 


V.  Conclusion  and  Future  Developments 

We  have  presented  a  physical  modelization  of  delays  on 
CMOS  structures;  allowing  the  fast  and  accurate 
evaluation  of  delays  and  the  definition  of  efficient  sizing 
laws.  These  results  have  been  applied  to  a  cell  synthetizer 
'for  automatic  transistor  sizing  and-  evaluation  of 
performances.  The  parametrization  of  a  layout  induced 
parasitic  associated  to  structural  choices  is  under 
development- as  an  efficient  way  to  drive  structural 
synthesis. 
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V  Introduction 

Linear  EC  circuits, particularly.RC  trees,  Have  been  very 
popular  for  modeling  the  signal  propogation  aud  delay 
in  digital  integrated  circuits.  A  linear  RC  tree  is  a  rea¬ 
sonable  model  of  the  physicaLinterconnect  of  an  inte¬ 
grated  circuit.  More  importantly,  the  RC  tree. response 
waveforms  are  guaranteed  monotone  and  therefore  can 
'be  analyzed  with  extreme  efficiency  in’ terms  of  the  first 
moment  of  the  impulse  response,  also  known  as.  the  El¬ 
more  delay  [4)  One  can  compute  the 'Elmore  delay  for 
an  RC  tree  with  path  tracing,  in  linear  time.  This  effi¬ 
ciently  provides  a  dominant  time  constant,  or  first-order 
waveform  approximation  for  the  RC  tree' response.  Fur¬ 
thermore,;  best/worst  case  bounds  can  be  obtained  with 
equal  efficiency  (12).  Recently,  with  multilayer,  submi¬ 
cron  interconnects,  more  generalized  RC  circuits  have 
been  used  to  model  oh-chip  signal  propogation.  The 
effects  of  dominant  zeros  and  non-monotone  waveshape 
behavior  require  the  use  of  higher-order  approximations 

13.  9). 

Asymptotic  Waveform  Evaluation  (AWE)  (9),  an  nth- 
order  extension  of  the  El  more  approximation,  for  RC 
trees,  is  one  such  higher-order  approximation  for  RiC 
circuits.  AWE  performs  model-order  reduction  by  a  mo¬ 
ment  matching  technique  recognized  to  be  a  form  of  par¬ 
tial  Pade’  approximation  (6).  To  obtain  a  qth  order,  i.e.,  a 
q  time  constant;  approximation,  2q  moments  drawn  from 
the  actual  circuit  and  the  model  are  matched.  Moment¬ 
matching  methods,  however,  are  prone  to  yielding  unsta¬ 
ble  models  of  stable  systems  (2).  To  avoid  unstable  ap¬ 
proximations,  it  is  suggested  that  higher-order  approxi¬ 
mations  be  tried  [2).  Unfortunately,  due  to  finite  machine 
precision,  positive  time  constants  are  even  more  likely  at 
higher  orders.  This  paper  proposes  a  constrained  opti¬ 
mization  for  mapping  moments  to  stable  dominant  time 
constants  in  AWE  for  RC  interconnect  models. 

Section  2  briefly  reviews  the  development  of  the  equa¬ 
tions  in  AWE  that  form  the  objective  functions  in  this 
work.  The  proposed  optimization  scheme  is  described 
in  section  3,  followed  by  some  results  in  section  4  and 
concluding  remarks  in  section  5. 

*  This  work  wu  supported  in  part  by  the  National  Science  Foun¬ 
dation  M  IP-90079 17  and  the  Semiconductor  Research  Corporation 
under  contract  #90-DP-142. 


>2  ^Background: 

A  brief.  description  of  AWE; is  provided  below.  For  a 
complete  discussion,  the  reader  is. referred  to. (8,  9). 

The  mathematical  development  of  AWE  is  based  on 
the  differential  state  equations  for  a  lumped,  linear,  time- 
invariant  circuit: 

x  =  'A *u,  (1) 

where  z  is  the  n-dimensional  state  vector  and  t?  is  the  m- 
dimensiohal  excitation  vector.  For  the  investigation  of 
delay  and' rise-time  effects,  a  step  excitation  is  adequate 
and  assumed.  For  this  excitation,  (1)  has  the  homoge¬ 
neous  solution 

*a  =  A  •  zs  (2) 

with  the  initial  condition 

xa(0)  =  io+A-'-B-ffo  (3) 

where  x0  is  the  initial  state  at  time  zero  Taking  the 
Laplace  transform  of  (2)  yields 

Xa(s)  =  (sI-A)-‘xa(0).  (4) 

The  Maclaurin  series  expansion  of  (A)  is 

Xft(s)  =  -  A*'(I  +  A*‘s  +  A*  V . .  .)r„(0).  (5) 

From  (5),  the  initial  conditions  and  first  2q-l  time  mo¬ 
ments  of  the  tih  component  of  X/,(s)  are  characterized 
as 

=  (-xa(0))j 

(mo)i  =  (-A-1  •Xft(O)),- 

(mz,.z),  =  (-A-J‘+‘ •«*<<>)).  (6) 

where  the  initial  conditions  are  represented  as  the  nega¬ 
tive  first  moment.  For  RC  interconnects,  these  moments 
m  are  computed  efficiently  using  a  path  tracing  technique 
described  in  [11],  of  which  this  work  is  a  part 
The  reduced-order  model  has  the  form 

=  <7> 
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where  ti  and  hi  represent  the  dominant  time  constants 
and  corresponding  residues  respectively,  and  q  represents 
the.prder  of  the  reduced  model.  It  now  remains  to  solve 
for  these  time  constants  and  residues  by  matching  the 
moments  from  (6)  to  .those  of  (7)^ 

Expanding  each  of  the  terms  in  (7)  into  a  senes  about 
the  origin,  and  upon, inclusion  of  initial  conditions,  the 
following  set  of  nonlinear  simultaneous  equations  for  the' 
t*\  state  variable  is  obtained. 

+  +  =  (m-i). 

-(tlT,.+  tjii  +  •  •  ■  +  k,T,)  =  (mo), 

-(*|T r^"')  =  (”‘2«-2)i  (8) 


the  time  constants.  However,  rather  than  obtain  an  ex¬ 
act  solution,  the  gradient  information  is  used  to  descend 
to  ah  extremal  point,  in. this  case,  a  minima.  This  was 
'encouraged  by  the  avai lability  of  a  good  initial  guess  ob¬ 
tained  .by  the  direct  mapping  described  in  ;(8).  When 
the  approach  in  (8]  yields  positive  time  constants,  the  re¬ 
maining  time  constants  have  been  seen  to  lie  very  close  to 
the  correct  solution.  A  reasonably  good  initial  estimate 
is  thus  obtained  from  the  stable  time  constants  and  the 
negative  of  the  errant,  time  constants). 

’Gradient  information *im the  form  of  the  Jacobian 
matrix  can^be ‘simply  computed ^asjllustraled  below. 
‘Rewriting  the  system  of  equations' to  be  minimized  (10) 

f(x)  =  V  •  A"*  ■  V”1  *  mj  —  tTif,  (12) 


These  equations  can  be  expressed  in  matrix  form  as 

(9) 

V  •  •  V-*  —  (10) 

where  mi  represents  the  low-order  moments  (-1,0,  ...» 
q-2),  m/i  represents  the  high-order  moments  (q-1,  q,  . , 
2q-2),  A“*  is  a  diagonal  matrix  of  the  time  constants, 
and  V  is  the  well-known  Vandermonde  matrix. 

Attempts  to  directly  solve  the  nonlinear  equations 
(9)  and  (10)  to  obtain  the  requir'd  time  constants  and 
residues  as  described  m.(8]  or  by  using  unconstrained 
methods  such  as  Newton-Raphson  iteration  may  lead  to 
the  problem  mentioned  earlier,  i.e.,  unstable models  of 
stable  systems.  In  this  work,  a  constrained,  optimal  so¬ 
lution  is  sought  that  restricts  the  time  constants  obtained 
to  the  left  half  plane. 

3  Development 

A  two  step  optimization  scheme  is  proposed  to  fit  the 
moments  in  (7)  to  those  in  (6):  (i)  a  constrained  opti¬ 
mization  to  obtain  the  stable  time  constants  that  best 
satisfy  (10),  and  (ii)  a  least-squares  fit  of  the  overdetcr- 
mined  system  in  (9)  using  the  time  constants  obtained 
earlier.  (This  would  take  into  account  the  presence  of 
round-off  errors  in  the  moment  values  and  find  a  best 
fit.) 

As  derived  in  AWE,  (10)  is  a  nonlinear  system  of  the 
time  constants  only,  and  can  be  solved  independently 
of  (9).  To  ensure  that  the  reduced-order  model  (7)  is  sta¬ 
ble  for  a  given  stable  system,  in  this  case,  an  RC  tree,  it. 
would  require  that  the  time  constants  be  negative.  This 
constraint  can  be  incorporated  in  a  simple  manner,  with¬ 
out  significant  complication  of  the  equations  in  (10),  by 
a  transformation  of  variables  (1),  that  tranform  the  con¬ 
strained  system  into  an  unconstrained  one. 

*  =  -«**.  (11) 

The  resulting  equations  in  xi  can  now  be  optimized  using 
more  powerful  unconstrained  schemes. 

In  this  work,  a  gradient  descent  scheme,  based  on  the 
nonlinear  Newton-Raphson  iteration,  is  used  to  obtain 


an  expression  for  the  Jacobian  matrix  is  derived  as. 


gf®  Y3(v-A-«),  jt  ,av\ 
dx,  V  dzt) 


(13) 


where  z  ==  V,"1  •  A~*.  This  requires  the  evaluation  of  the 
partial  derivatives  of  only  the  Vandermonde  and  the  di¬ 
agonal  maHx,  both  of  which  yield  sparse  and  extremely 
predictable  derivatives. 

With  an  optimal  set  of  time  constants  in  the  left-half 
plane  that  minimize  (12),  it  now  remains  to  determine 
the  corresponding  residues  in  the  model  (7).  While  (9) 
provides  a  simple  and  straightforward  solution,  it  uses 
only  the  lower-order  moments.  To  further  decrease  the 
errors  incurred  in  constraining  the  time  constants,  the 
over  determined  system  (8)  that  matches  all  the  2  q  mo¬ 
ments  is  used  to  obtain  a  best  fit  estimate  using  a  least- 
squares  approach. 

Rewriting  (8)  in  matrix  form  yields 

L.fc  =  m  (14) 

where  L  is  the  (qxn)  matrix 

1  1  ...  1 
r\  T\  T\ 
rl  T1 

rf1"1  ...  rf2»“l 

From  (5),  the  required  least-squares  estimate  is 

j  =  -(LT.L)-‘-Lr.m  (15) 


The  error  in  the  model,  defined  as  the  amount  of  mis¬ 
match  in  (8)  is  now  defined  as  the  norm  of  the  vector 


c  =  m  ~  L  ■  £  (16) 

This  completes  the  constrained  mapping  of  the  moments 
to  a  pole-residue  representation. 
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'A  further  enhancement  would  be  to  use  a  weighted 
least-squares  scheme  in  the  second  stage  of  the  optimiza¬ 
tion  to  ensure  that  the  sum  of  the  residues,  m, i,  hence 
the  steady-state  response,  is  most  accurate.  -In  this  case, 
the  estimate  is  given- by 

£=  -(LT  ■  W  •  L)- 1  ■  Lt  -  V/  •  m  (17) 

where  W  is  a  suitable  weighting  matrix. 

Conditioning  of  matrices  is  handled  by  employing  fre¬ 
quency  scaling  (8)  and  use  of  the  singular  value. decom¬ 
position i  (SVD)  to. compute  the , least-squares  fit  (7j.  In 
this  case, 

*  as  V  -  {dm5(l/ty/))  '  UT  •  b  (18) 

where' L  as  U  •  (diajr(tyy))  -  VT  represents  the  SVD  of  L. 


4  Implementation  and  Results 
As  mentioned  earlier,  calculation  of  circuit  moments  for 
RC  interconnects  is  done  very  efficiently  using  a  path¬ 
tracing  technique  m  (11).  An  initial  estimation  of  pole(s) 
and  residue(s)  is  accomplished  using  the  approach  de¬ 
scribed  in  (9).  The  optimization  routines  are  invoked 
upon  the  occurance  of  a  positive  pole(s). 

For  a  4000  node  RO  circuit,  an  unconstrained  third' 
order  approximation  of  the  unit  step  response  at  node 
1000  yielded  the  unstable  model  (normalized  to  nanosec¬ 
onds): 


*M«i  + 


7.95 

I  +  16.2s 


+ 


0.84  -0.008 

1  +  l.C4s  *1-  1.82s 


(19) 


Constraining  the  mapping  yielded  the  stable  model; 


?W  =  1  + 


7.93 

1  +  16.2s ' 


0.17  -0.7 

1  +  1.51$  I  +  1.78s* 


(20) 


A  graph  of  the  response  (20),  when  plotted  versus  the 
output  of  a  circuit  simulator  (PSFICE)  (10),  shows  highly 
improved  results  as  compared  to  the  first-order  Elmore 
approximation,  (lt  is  noted  that  the  response  from 
PSPICE  and  (20)  are  nearly  identical.)  Further,  for 
a  third-order  approximation,  the  algorithm  described 
herein  required  1.93  CPU  seconds  and  708  kilobytes  of 
memory  on  a  Sparcstation  1  while  PSPICE  required 
328.63  CPU  seconds  and  2368.36  kilobytes  on  the  same 
machine. 


5  Conclusion 

A  promising  approach  for  overcoming  the  potential  for 
instability  associated  with  the  moment-matching  tech¬ 
nique  in  AWE  as  applied  to  RC  interconnects,  has  been 
presented.  This,  approach  can  further  be  extended  to 
general  linear(ized)  RLC  models,  When  used  in  combi¬ 
nation  with  patli  tracing,  a  robust  and  efficient  algorithm 
is  obtained  for  the  problem  of  accurate  RC  interconnect 
delay  evaluation. 


time  (ns) 

Figure  1.  Step  response  at  node  1000  of  a  4000  node  RC 
interconnect  model 
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Abstract  -  The  waveforra  relaxation  .algorithm 
for  the  electrical  simulation  of  digital 
circuits  is  generalized  to  efficiently  handle 
entire  waveforra  intervals  at  one  time,  thus 
enabling  detailed  circuit  verification  that  is 
impractical  with  non-interval  simulators: 


I.  Introduction 

The  complexity  of  digital  VLSI  circuits 
makes  verification  of  their  detailed  operation 
an  extremely  difficult  problem.  In  practice, 
one  can  simulate  only  a  few  subcircuits  and 
situations  using  current  electrical  simulators 
such  as  SPICE  (1).  Even  if  "exact"  simulation 
could  be  sped  up  by  orders  of  magnitude  using 
specialized  numerical  methods  or  hardware  12], 
full  verification  would  still  be  infeasible 
due  to  the  large  number  of  cases  that  must  be 
considered.  As  a  result,  for  efficient 
verification  we  need  a  simulator  that  can 
handle  entire  sets  of  waveforms,  e.g.  waveform 
intervals/  instead  of  precise  waveform 
estimates.  This  .paper  outlines  how  such 
intervals  can  be  computed  and  how  they  can  be 
used  for  verification. 


Interval  arithmetic  is  used  extensively  for 
verifying  digital  circuits  at  a  igh  level, 
e.g.  in  logic  simulation  signals  are  grouped 
into  true  and  false  signal  classes.  Timing 
verifiers  use  estimates  of  "worst-case"  delays 
to  cover  a  large  number  of  cases  with  little 
computation  (3) .  The  use  of  intervals  for 
verification  in  switch-level  simulation  has 
also  recently  been  introduced  (4).  By  their 
very  nature,  digital  circuits  have  monotonic 
properties  at  a  high  level  that  make  interval 
arithmetic  very  practical. 

Interval  arithmetic  has  rarely  been  applied 
to  electrical  simulation  because  it  is 
difficult  in  general  to  apply  it  to  the 
incremental-time  solution  of  ODEs  (ordinary 
differential  equations)  {5}.  Since  a  small 
amount  of  uncertainty  can  be  added  at  each  of 
the  many  steps  in  the  computation,  acceptable 
accuracy  is  often  achieved  only  for  small 
periods  of  time.  In  fact,  if  SPICE  were  run 
on  an  interval  computer,  it  would  likely  not 
even  be  able  to  distinguish  between  true  and 
false  digital  signals  by  the  end  of  a  typical 
clock  period.  Fortunately,  however,  there  is 
an  alternate  iterative  method  for  solving  ODEs 
called  Waveform  Relaxation  (WR)  which  performs 
well  on  most  digital  circuits  16]  and  can  be 
much  more  efficiently  merged  with  interval 
arithmetic  (7} .  WR  can  lead  to  efficient 
interval  analysis  because  it  treats  signals  as 
entire  waveforms,  a  natural  generalization  of 
what  is  done  in  logic  and  timing  simulators, 
and  partitions  the  circuit  into  small  logic 
blocks.  Thus  the  fundamental  high-level 
monotonic  relationships  that  are  built  into 
small  digital  subcircuits  can  be  exploited. 


II.  The  Waveform  Relaxation  Algorithm 

This  section  outlines  the  basic  WR  method 
18] .  All  WR  algorithms  start  with  a  guess  for 
the  circuit  behavior  x°C»),  consisting  of 
waveforms  for  at  least  a  complete  set  of  state 
variables  over  some  time  period  [0,  T) .  By 
simulating  subcircuits  (e.g.,  logic  blocks) 
independently  over  A0,  T),  this  guess  is 
updated  and  called’  x 1 .  The  analysis  of  each 
subcircuit  uses  the  assumption  that  connected 
subcircuits  are  behaving  according  to  earlier 
guesses.  Continued  iteration  produces  a 
sequence  of  behaviors  which  converges  to  the 
actual  solution,  denoted  x ,  for  a  large  and 
useful  class  of  circuit  models  16,7] .  A 
function  F  can  be  defined  that  maps,  .one 
behavior  guess  to  the  next,  i.e.,  xi+1  * 
F(x*),  and  WR  convergence  proofs  generally 
rely  on  showing  that  F  As  contractive  in  an 
appropriate  norm.  The  solution  x  can  be 
thought  of  as  a  unique  "fixed  point”  of  the 
relaxation  mapping  function  F. 

Because  signals  generally  flow  primarily  in 
one  direction  in  digital  circuits,  without  any 
direct  feedback  ocher  than  that  delayed  by 
latches,  WR  can  converge  very  quickly  for 
these  circuits  when  appropriate  partitions  and 
schedules  are  chosen.  Since  evaluation  of  f 
involves  incremental-time  simulation  of  each 
logic  block  separately,  computation  time  tends 
to  grow  only  linearly  with  circuit  size.  The 
advantages  (efficiency)  and  disadvantages 
(complexity  and  lack  of  generality)  of  using 
WR  for  "exact”  simulation  are  still  the 
subject  of  some  debate.  For  verification 
using  waveform  intervals,  however,  it  appears 
to  be  much  easier  to  build  on  the  WR  approach. 


III.  An  Interval  Extension  of  WR 

This  section  illustrates  how  an  interval 
algorithm  for  electrical  circuit  simulation 
can  be  derived  from  a  WR  algorithm.  A  set  of 
circuit  behaviors,  denoted  X,  is  defined  by  a 
waveform  interval  associated  with  each 
variable  (or  union  of  intervals) .  Variables 
in  a  circuit  behavior  could  consist  of  node 
voltages,  branch  currents,  and  voltage 
derivatives.  An  interval  mapping  function  G, 
whose  domain  and  range  both  consist  of  all 
possible  behavior  sets  X,  is  defined  in  terms 
of  the  WR  mapping  function  F  as  follows: 

Def.  1:  For  all  x  and  any  xcx,  F(x)  €  G(X) . 

There  are  many  possible  G  functions,  as  G 
is  any  bound  on  one  of  many  proposed  specific 
relaxation  functions.  It  is  possible  to  bound 
F  fairly  tightly  because  any  F  is  simply  a 
series  of  solutions  for  simple  logic  blocks. 
Basic  logic  blocks  have  many  provably 
monotonic  relationships  between  inputs  and 
responses,  so  only  endpoints  need  be 
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evaluated.  In  addition,  device  models  can 
often  be  rigorously  simplified  in  a  Banner 
that  drastically  reduces  computation  while 
only  slightly  loosening  the  bounds.  G 
functions  have  the  following  properties  (7) : 

Theorem  1:  If  G{X)c.  Xf  x*€  G(X) ; 

Theorem  2:  If  x’e  X,  x‘e'G(X)  . 

Thesertwo  (simplified)  theorems- can  be  used 
to  bound  x*  based  only  on  the  computation  of 
one  or  more  G  functions,  -possibly  repeatedly. 
The  first  theorem  can  be  used  to  verify  that  a 
tentative  bound  on  x*  is  indeed: a  bound.  If  G 
provides  a  fairly  tight  bound  on  f*,  it  should 
maintain  its  contractive  property  and  enable 
the  shrinking  of  behavior  sets.  The  second 
theorem  can  be  used  to  iteratively  improve 
bounds  on  x* .  Any  slack  in  the  function  G 
will  limit  the  tightness  of  the  final  bounds 
on  x  ,  as  will  any  correlations  that  are 
ignored  at  the  block  boundaries.  The  slack 
arising  from  such  correlations  can  be  managed, 
however,  because  the  feedback  among  adjacent 
blocks  is  relatively  small. 


IV.  Verification  Using  Interval  Algorithms 

waveform  intervals  can  be  used  in  a  number 
of  ways  to  reduce  computation  in  circuit 
verification.  These  uses  fall  into  two  main 
classes,  accuracy  management  and  combinatorial 
compression.  These  applications  are  discussed 
here  based  on  the  assumption  that  a  suitable 
interval  waveform  simulation  algorithm  is 
available. 

Accuracy  management  takes  advantage  of  the 
fact  that  uniformly  tight  bounds  for  every 
variable  in  x  are  not  always  necessary  to 
verify  satisfactory  circuit  operation  (9).  In 
portions  of  a  combinational  logic  circuit  that 
surpass  their  timing  specifications,  such  as 
logic  gates  not  in  critical  paths,  significant 
slack  can  often  be  tolerated  during 
transitions.  Since  interval  arithmetic  allows 
accuracy  to  be  measured,  optimal  assignments 
of  accuracy  levels  can  be  approached 
throughout  the  circuit.  Based  on  monotonic 
properties  of  most  simple  logic  blocks,  device 
models  can  be  locally  simplified  to  trade 
accuracy  for  computational  speed,  e.g.,  using 
bounding  piecewise  linear  transfer  curves. 

An  "exact"  simulator  cannot  be  used  for 
complete  verification  due  to  the  large  number 
of  different  cases  that  must  be  considered, 
arising  from  different  combinations  of  input 
signals  and  device  parameters.  Combinatorial 
compression  involves  grouping  together  classes 
of  inputs  into  intervals  so  that  computation 
grows  only  linearly  with  circuit  size.  The 
use  of  worst-case  delays  for  input-independent 
analysis  in  timing  verifiers  provides  a  good 
example  of  the  basic  approach.  Waveform 
interval  analysis,  though,  can  achieve  similar 
gains  without  sacrificing  detailed  information 
about  waveform  shapes.  Waveforms  can  be 
grouped  into  classes  such  as  rising,  falling, 
constant-high,  and  constant-low.  Due  to  the 
basic  roonotonicity  of  logic  circuits  at  the 
waveform  and  logic-block  level,  the  extra 
conservatism  that  arises  from  such  groupings 
is  generally  small. 


The  potential  power  of  combinatorial 
compression !  can  be  seen  by  generalizing  the 
switch-level  memory  verification  proposed  m 
C4]  to  include  timing  information.  For 
example,  to  verify  the  read  operation  of  a 
single  bit  in' a  large  memory,  all  other  memory 
cells  can  be  initialized  to  the  union  of  two 
intervals,  representing  true  and  false 
respectively.  All  the  output  buses  except  the 
one  connected  to  the  bit  under  test  will  have 
unknown  responses,  indicated  by  very  loose 
bounds  that  include  true  and  false  values.  If 
the  bit  is  still  read  correctly,  however,  with 
a  tight  interval  that  guarantees  correctness, 
the  circuit  has  been  verified  for  all  possible 
patterns  in  the -rest  of  the  memory  using  only 
a  single  simulation. 


V.  Conclusion 

Although- a  full  waveform  interval  simulator 
has  yet  to  be  demonstrated,  and  many 
challenges  remain  to  achieve  high  efficiency, 
the  required  basic  theory  has  been  developed 
and  experiments  have  been  undertaken  that 
indicate  feasibility.  Furthermore,  such  a. 
simulator  would  provide  a  powerful  new  tool 
for  detailed  circuit  verification,  based  on 
its  ability  to  enable  accuracy  management  and 
combinatorial  compression. 
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Settling-Time  Bounds  for  Switched-Capacitor  Networks 
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Abstract:  The.Penfidd.Rubinstdn  bounds  (1)  are  extended  to 

cover  many  switched-capacitor  networks.  The  original  formulation  in 
terms  of  differential  equations  [2]  is  modified  to  include”  the  relevant 
class  of  discrete-time  difference  equations. 


1  Introduction 

Many  algorithmsin  machine  (robot)  vlsior.  processing  can  be  formu¬ 
lated  as  minimization  problems.  Consequently,  these  algorithms  can 
be  implemented  using  appropriately  designed  reciprocal  resistive  net¬ 
works  that  relax  very  quickly  to  the  required  solution  (3)  The  appeal¬ 
ing  features  of  these  networks,  namely  local  computation  and  commu¬ 
nication,  make  VLSI  implementation  both  feasible  and  attractive. 

Most  image  processing  tasks  are  accomplished  with  a  large  amount 
of  input  data  (e  g ,  128x128  intensity  values).  Power  considerations, 
therefore,  require  each  unit  cell  to  contain  relatively  large- valued  re¬ 
sistors.  Since  these  large  values  are  difficult  to  achieve  consistently 
and  uniformly  in  standard  VLSI,  alternatives  are  sought  One  such 
alternative  is  the  use  of  switched  capacitor  (SC)  resistor  equivalents. 
These  equivalents  are  attractive  not  only  for  their  low  power,  but  also 
because  of  their  uniformity,  potentially  small  unit  cell  size,  and  depen¬ 
dence  on  switching  frequency. 

One  drawback  of  these  SC  networks  is  their  settling  time.  Since 
charge  is  shared  among  neighboring  capacitors  in  discrete  steps,  rather 
than  instantaneously  as  with  continuous- time  resistors,  the  equilib¬ 
rium  charge  (and  thus  voltage)  distribution  is  approached  only  after 
some  finite  number  of  timesteps.  In  seeking  bounds  and  estimates  for 
this  settling-time  waveform,  we  have  extended  the  Rubinstein-Pcnficld 
bounds  (1)  to  the  discrete-time  case. 


2  An  Example 

There  is  a  large  class  of  SC  networks  for  which  bounding  expressions 
can  be  formed^  As  an  example,  consider  the  SC  line  shown  in  Fig¬ 
ure  1.  There  are  twenty  grounded  capacitors,  each  connected  to  its 
two  neighbors  by  either  a  phase  I  or  a  phase  2  switch;  the  two  phases 
alternate  in  controlling  the  switches  along  the  line,  A  single  voltage 
source  feeds  the  line,  and,  in  Figuro  1,  nods  k  is  assumed  to  be  odd. 
Just  for  fun,  let  the  capacitors  in  the  line  take  on  the  values  of  the 
digits  of  x.  That  is,  let  C\  =3 ,  Cj  «  1,  Ca  -  >1,  etc.1  Also,  let  the 
voltage  on  the  sixteenth  capacitor  be  the  signal  of  interest. 

For  the  general  node,  the  voltages  are  for  phase  1  (with  k  odd) 


njn+j]  =  * 

<’*[’>  +  !]  =  W-ljnt-iJ 

<M  ["  i  5]  =  o»ln| 


(1) 

(n) -f  CtVt(n)  . 
C*-,+C*  K> 


(3) 


and  for  phase  2  (with  k  also  odd) 

«!»H]  =  <**.("  +  11  — 1 — crt;--J - Ml) 

These  equations  can  be  expressed  in  matrix  form,  then  combined,  to 
yield 


v  (n  +  1)  s=  Av  (n)  +  bt<o  (5) 


and 
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Figure  1.  Two-phase  SC  line  of  twenty  capacitors  and  switches 


Defining  A  (n)  =  v(n  +  lj  -  v(n]  and  M  =  I  -  A,  the  system  can 
be  written  as 


A  (n)  -Mv  (n)  -f  b  uj  (6) 

In  this  example,  M  is  a  nonsingular  M-matrix  (4]  by  construction  Its 
inverse,  P,  exists  and  has  nonnegative  entries,  and  an  analysis  similar 
to  that  found  in  [2]  can  be  performed.  In  our  example,  the  three 
important  parameters  dependent  upon  the  entries  of  P  are 


=  315.7 

Td,  18 

=  377.3 

(8) 

TP 

=  408  3 

(9) 

The  bounds,  derived  below,  are  shown  in  Figure  2  The  middle  signal 
is  the  exact  response  of  as  a  function  of  timesteps,  assuming  that 
all  capacitor  voltages  are  initially  at  zero  volts.  The  upper  and  lower 
waveforms  axe  the  results  of  the  bounding  expressions.  These  bounds 
are  derived  from  a  reduced  order  model  of  the  system,  and  represent 
true  bounds.  Their  advantage  is  that  they  are  much  easier  to  calculate 
than  the  exact  expression  for  large  systems. 


3  Determining  the  Bounds 
The  following  derivation  is  similar  to  that  in  (2),  altered  here  to  cover 
the  discrete-time  case.  In  general,  consider  the  linear  discrete-time 
system  as  in  equation  5  such  that  I  -  A  is  an  M-matrix,  where  b  >  0 
(that  is,  every  dement  is  nonnegative  and  b  0)  and  t,  denotes  the 
distinguished  component  of  v  that  serves  as  the  system  output.  Let 
to  be  a  positive  step  input.  Define  A(n},  M,  and  P  as  before,  along 
with  the  equilibrium  (final)  state  value 


‘lb  alttlee*  jJkw,  t  m  5  H1S925SJ58979J2M5. 
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=  Pbto 


(10) 


The  system  can  bow  be  rewritten  as 

PA[nJ=v„-v[n)  (11) 

It  can  be  shown  that,  the, zero-state  step  response  of  the  distin¬ 
guished  component  v,  of  the  system  state  is  a  monotoaic  nondecreasing 
function  of  timesteps  with  final  value  e«,.  The  function  nmin(e,*)  is 
defined  as  the  minimum  possible  number  of  steps  necessary  to  achieve 
p,[n]  >  r',  while  11^(9*)  is  defined  as  the  maximum  possible  number 
of  steps  before  the  system  output  r,(n]  is  forced  past  the  value  r*.  An 
output  level  r,*  will  be  reached  In  no  less  than  n„;n  steps  and  crossed 
in  no  more  than  nntz  steps. 

We  construct  a  reduced  order  model  with  two  states  defined  by 

$.!”]  =  (12) 

m  =  («) 

tn 


that  obeys 


(MOUm  =  (T-0,,1)  (14) 

/.I»+l|  =  /.W-J.W  (15) 

•5,(11 +1)  <  J,[n]  (16) 

5,(n]rK,'  <  AM  <  S,(n)7>  (17) 

t'.(n)  =  <Ve,(I  -  J,H)  (18) 


where  the  three  system  parameters,  7)y,  To;,  and  7>,  are  defined  in 

(2),  The  equalities  in  equations  14-18  follow  from  the  definitions,  and 
the  inequalities  follow  from  the  fact  that  I  -  A  is  an  M-matrix. 

The  reduced  order  model’s  equations  can  be  graphically  interpreted 
as  shown  in  Figure  3. 

*  »<■*»«*  I,"  ’.I, 


Figure  3.  Reduced  order  model  statespace  geometry. 
(Dotted  line.  nmi»  path.  Dashed  line.  nmtr  path.) 


At  every  timestep,  the  state  jumps  to  a  new  point  as  dictated  by 
the  reduced  order  model  (the  next  value  of  /,  is  specified,  and  the  next 
value  of  gt  is  bounded);  the  starting  point  is  at  (Tbi*  l)«  The  goal  is 
either  to  reach  the  target  line  (gt  ~  g*)  as  quickly  as  possible  (nm,-n),  or 
to  delay  going  past  it  as  long  as  possible  (nmtx).  The  lines  /,  a  Tjn,g; 
and  /,  »  Tpg,  are  the  upper  and  lower  boundary  lines,  respectively. 

The  procedure  for  obtaining  the  minimum  number  of  timesteps 
from  the  initial  point  to  a  point  on  or  below  the  target  line  is  to 
follow  the  “highest*  path  through  (/,,?, )*space.  This  path  consists 
of  traversing  the  line  g{  =  $,(0),  then  traversing  the  upper  boundary, 
always  in  the  direction  of  lower  U  values.  This  path  is  the  fastest 
way  to  move  towards  lower  /,  values,  since  each  /,  step  is  as  large  as 
possible.  Of  course,  the  final  jump  to  or  below  the  target  line  is  taken 
as  early  as  possible.  By  following  this  methodology,  the  nmi*  value 
arrived  at  is  the  minimum  possible  number  of  timesteps  necessary  to 
move  on  or  below  the  target  line. 

The  procedure  for  obtaining  the  maximum  number  of  timesteps 
from  the  initial  point  to  a  point  below  the  target  line  is  to  follow  the 
“lowest”  path  through  (/„0,)-$pace.  This  path  insists  of  traversing 
the  line  /,  =  /,(0),  then  traversing  the  lower  boundary  line,  and  finally 
traversing  the  target  line,  always  in  the  direction  of  lower  /,  values. 


This  path  is  the  slowest  way  to  move  towards  lower  ♦/;  values,  since 
each  /,  step  is  as  small  as  possible.  Of  course,  the  final  jump  below  the 
target  line  is  taken  as  late  as  possible.  By.following  this  methodology, 
the  iw  value  arrived  at  is  the  maximum  possible  number  of  timesteps 
such  that  its  state  and  every  subsequent  state  is  forced  below  the  target 
line. 

The  discrete- time  bounds  have  the  following' form; 


. »«;.«)  j 

v’  range 

fmax(] 

«a  + 

0 

,td,-tp9;)} 

u;  =  o 

*v«f  Tr. 

Tj»-Tn,+t»*  ;  *’  ,  ■ 

•*<■-*)  1 

If  ^  ^  1 

range 

2+  l^-lk] 

o<£<&^±! 

2  +  ns-f 

where 


n*  =  1  +  (Tdi  -  Tr,  j 
nj  =  1  + 


Mi-ij 

The  symbols  (xj  and  fx]  denote  the  floor  and  ceiling  functions. 


(19) 

(20) 

(21) 


4  Conclusions 

The  bounds  are  known  to  apply  to  a  large  class  of-  SC  networks, 
namely  those  containing  an  arbitrary  number  of  grounded  capacitors 
and  grounded  independent  voltage  sources,, along  with  an  arbitrary 
number  of  switches  connecting  any  of  the  elements.  The  class  of  net* 
works  is  sure  to  be  more  extensive,  but  the  enlarged  boundaries  have 
not  been  fully  investigated,  Tightening  the  bounds  is  also  an  impor- 
tant  topic,  as  is  studying  under  what  conditions  the  P  matrix  may 
be  stuffed  by  inspection  (to  avoid  calculating  a  matrix  inverse).  It  is 
known  that  the  P  matrix  can  be  stuffed  by  inspection  for  a  certain 
class  of  SC  trees. 
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Abstract.  In  this  paper  overlapping  domain  decomposi¬ 
tion  methods  are  applied  to  the  numerical  solution  of  Pois¬ 
son  equation  with' Dirichlet-Signorini  boundary  condition. 

T.  INTRODUCTION' 


In  ,  this  paper  we  describe  the  numerical  solution  by 
overlapping  domain  decomposition  methods  of  grid  vari¬ 
ational  problems  arising  from  finite  element  approxima¬ 
tions  of  the  simplified  ‘  Dirichlet-Signorini  >  problem  •  for 
Poisson  equation.  Probably,  the  paper  (1]  was  the  first, 
where  overlapping  domain.decomposition . methods  were 
applied  to  the  solving  ,of  variational  inequalities.  The 
convergence  proof  of  our  methods  is  based  . on  the  finite 
dimensional  approach  proposed,  and  developed  in  (3)  for, 
linear  variational  problems. 

We  consider  the  problem 


-Au  =  /  in  R2  , 

u  -  0  on  To  (?£  0), 
f) 

u>0,  -r-«>0,  u-— «  =  0 

on  dn 

a.e.  on  =  OCl  \  I'o  , 


(1.1) 


where  ^  denotes  the  outer  normal  derivation  to  #ft. 
It  is  well  known  that  .(1.1)  can  be  formulated  as  a  mini¬ 
mization  problem: 


Find  u  €  K  :  J(u)  =  nun  J(v)  = 

\JnVvVvdx~J  fvdx  =  \a(v'v)-(f’v)< 

(1.2) 

where  K  —  {u  6  /T*(ft)  \  V  =  0  on  To  and  V  > 
OonPi}.  The  equivalent  formulation  of  the  problem 
(1.1),  (1.2)  in  terms  of  variational  inequality  reads 


Find  u  €  K  :  a(u,  v  -  u)  >  (/,  v  -  u)  Vv  6  K. 

(1.3) 
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2:  Finite  element  approximation 

Let  7Ji  =  {e* }  be  a  triangular  covering  of  ft  under 
the  typical  assumptions’ like  that  the  set  Tonr*  belongs 
to  the  set  of  vertices  of  triangle  elements  ’ We  denote 
a  standart  piecewise  linear  finite  element  subspace  of  V 
related  with  7^  by  V\  and  consider  the  finite  element 
approximation  of  the  Dirichlet-SignorinL  problem  (1.1) 
in  the  form  of  minimization  task': 

Find  tth  €  I(h  :  J(uh)  =  min.J(v)  (2.1) 
v£Kk 

where  Kh  =  V\  fl  K. 

The  equivalent  formulation  of  (2.1)  in  the  terms  of 
variational  inequalities  is  defined  by: 

Find  U\  6  Kh  ■  a(tth,v-Uh)  >  (/,«-«*)  Vti  6  Kh- 

(2.2) 

3.  Overlapping  domain 

DECOMPOSITION  METHOD 
Gh  is  said  lo  be  a  grid  subdomain  ol  7/,  (ol  57),  it 
Gh  is  a  union  of  triangle  elements  e(*.  The  interior 
0 

of  Gh  is  denoted  by  Gh  and  its  boundary  is  denoted 
by  dGh-  The  boundary  of  Gh  consists  of  two  subsets: 

denotes  the  closure  of  the  intersection  of  dGh  with 
ft  U  To,  i.e.  rlG)  =  IRh n(ftuT^),  and  T{G)  de¬ 
notes  the  remaing  part  of  OGh »  be.  is  the  interior 

of  6Gh  n  Pi* 

Let  a  grid  subdomain  Gh  C  Th  and  a  finite  cle¬ 
ment  function  Vh  €  Kh  be  given.  If  dGh  =  we 
shall  associate  with  the  pair  {Gh]  Vh)  the  only  subset 
0 

Kh(G\Vh)  C  Khf  which  is  defined  by; 

Kh(G\  Vh)  =  {wh  €  Kh  |  to  J _  0  =  »*}.  (3.1) 

yQ\Gk 

ifr<c)  ^0  ,  we  shall  associate  with  the  pair  {Gh]  Vh) 

0  1 
either  the  subset  Kh(G]  Vh)  of  Kh  or  the  subset  Kh{G]  1>a) 
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of /(ft  which  is  defined  by: 


=  {«*  6  Kh  |  wh\_^r[a))=-vh}. 

(3,2) 

0  1 

Note  that  both  subsets  Kh(G]  Vh)  and  Kh(G]  Vfi)  are 
closed .  and  i convex.  Thus,  vwe  associate  with  the  pair 
(Ght  vh)  the  subset  Kh(G J  t>ft)  C  Kh,  which  is  equal 

,0  ...  7  ,1„ 

to  either  Kh(G\  Vh)or Kh(G\  Vh). 

Let  us  consider  the  following  variational  subdomain, 
problem:  Find  Wh  €.Kh(G]  Vh)  such  that 

J(wh)=  cmin~  .J(<fih),  (3.3). 

V>hGKh(G',VK) 

which  is  equivalent  .to  the  variational  ^inequality:  Find 
Wh  €  Kh(G }  Vh)  such  that 

a (u>a,V>a  -  wh)>  {fi<ph-WK)V<Ph  6  Kh{G\vh). 

(3.4) 

It  is  obviously,  that  the  latter  problem  has  a  unique  so¬ 
lution. 

The  problems  (3.3),  (3.4), result 

J(\lh)  =  J{wh  +  («A  -■«>/,)) 

=  J{ wa)  +  0(»A  “  Wa.WA  ~  Wh)+ 

2  {a(w A,  vh-wh)~  (/,  «A  -  «>/>)} 

>  +  a(vh  ~  wh,  Vh—  Wh) , 

which  leads  to  the  following  conclusion. 

Lemma  3.1. 


J(wh)  <.J(vh) 

for  any  Vh  G  Kh  and  J{vih)  =  J(Vh)iftWh  =  Vh . 

Let  us  assume  that  a  grid  subdomain  Gh  and  *  func¬ 
tion  /  are  fixed  as  well  as  a  triangular  covering  7ft.  We 
define  the  operator  Tq,  :  Kh  Kh  such  that  for 
any  Vft  €  Kh  the  solution  function  Wh  of  (3.3),  (3.4)  is 
given  by  formula 


LEMMA  3.2.  The  operator  Tq  :  Kh  Kh  is 

continuous. 

Let  m  be  a  positive s integer*  and .  Jet  7ft -be.  decom¬ 
posed  into  771  grid' subdomains  Gj  =  G^,,i.e.  7 ft  = 
VjL^Gj.  We  associate  with  every. subdomain  Gj  asub- 

(i)  0.,  - 

set  of  JTft,  which  is  equal  to  either  Kh(G)  or 

1  0.  V 

Kh(G)  where  JFft(6r) 'and  Jfft(G)  are  defined  as  iii 
(3.1)  and1  (3.2).  The  latter  case  means  that  with  every 
index  j  we  associate  the  pair  (Gj]  j,  j'ss  1“  TO. 

We  shall  require  that  the  following  two  conditions  are 
satisfied: 

(Al)  every,  grid  node  £;  €  ftft  belongs  to  at' least  one 

J0  ' 

Gy 

(A2)  every. grid, node  Fffc  belongs  to  at  leas  one 

r|;)  s.lfH 

The  assumption  (Al)  means  that  every  grid  node  from 
lift  belongs  to  the  interior  of  at  least  one  subdomain  G j. 
In  assumption  (A2)  we  require  that  every  grid  node  from 
l?i,ft  belongsUo  the  interior  of  at  least  one  F^,  i.e.  in 

particular  for  this  index  j  we  have  =  Kh(G j)  . 

According  to  above  definitions  and  assumptions  we 
can  formulate  the  following  iterative  domain  decomposi¬ 
tion  procedure. 


Algorithm.  Let  «°  6  I<h  be  given. 


Solve  sue- 
,m  the 


ccssively  for  «  =  0, 1, ...  and  for  j  =  1, 
following  subproblems:  Find  «a+"  G  I(h (Gy 
u?‘*r)  such  that 


a(«r^V>A-«r")>(/,n-t<A+") 

G  I(h(Gyvn*-) 

(3.5) 

THEOREM  3.1.  Under  the  assumptions  (Al)  and  ( A2 ) 

the  sequence  {ti^  m  convergence  for  every  j  (1  < 
j  <  m)  to  ihe  solution  function  Wft  of  the  problems 
(2.1),  (2.2)  for  any  6  Kh- 

Proof:  See  (2). 


4.  Examples 

EXAMPLE  4.1:  Let  be  a  rectangle  and  r  be  a  pos¬ 
itive  integer.  To  this  end  we  set  G\  =  fii  and  asso- 
0 

date  if  (ft)  with  G\  and  then  choose  Ga  =  G\  and 


=  Ta(vh) . 


(3.5) 


r 


t 

] 

I 

I 


=  (*2»*  =  l,2,...,ra$shown  in  Flg.fl’.  We  as¬ 
sociate^ K(Gj)  with  Gj  for vjjss  2,  ...,2r-f  1>  Because 
under  the  assumptions  made  the  conditions  of  Theorem- 
3. Tare  satisfied  the  corresponding  iterative :method  (3.5) 
is  convergent. 


-  Gii  v 

' 

Gi 

S.,  1 

.  . 

.  jrf'1 
}'r?> 

|rSM 


Fig.  1.  Decomposition  of  a  rectangle  ft  into 
subdomains  G\  =  12  and  (7j,  j  ==  2, 2r  +  T 

Numerical  experiments.  Let  ft  =  (0,1)  X  (Ojl) 
be  the  unit  square,  Tj  =  {(*i,®2)  |  Xj  as  1,  X2  € 
(0, 1)}  be  a  side  of  this  square  and  let  f  =  f(x  1,3*2) 
be  a  given  piecewise  constant  function  such  that 


The:results  of ‘Table  1  and  2  show;  that  the  iterative 
method  convergences  quite:rapidly  and  the  rate  of  con¬ 
vergence  becomes  better  for  bigger  values  of  r.  At  the 
same  time  the  rate  of  convergence  is  practically  indepen¬ 
dent  of  S.  Probably,  this  is  a  partial  property  of  the  con- 
cret  problem  under  consideration.  -  For  further  numerical 
tests  see  (2). 

References 


(  fi  =  const  <0,  X2  6  (0,  ?), 
\  fi  •=  const  >  0,  X2€(j,1). 


Partition  ft  into  subdomains  Gj,  j  =  l,2r+  1* which 
are  constructed  as  shown  in  Fig.  1,  where  r  is  a  positive 
integer.  It  follows,  that  we  set  G\  =  ft  and  Gj  are  mul- 
ticonnccted  domains  consisting  of  /  +  1  equal  rectangles 
for  even  values  of  j  and  t  equal  rectangles  for  odd  values 
of  j ,  where  <is  a  positive  integer,  j  =  2,2r  +  1.  For 
numerical  experiment  the  separate  rectangles  of  subdo¬ 
mains  Gj  were  chosen  with  sides  equal  to  (3  + 1  )h  and 
2/l  for  £1  and  X2  axes  respectively,  where  S  is  a  positive 
integer. 

Let  ke  be  a  number  of  steps  of  the  iterative  method 
(3.5)  under  the  assumptions  made,  which  are  required 
to  get  the  finite  element  solution  Uf,  on  T\  with  a  pre¬ 
scribed  accuracy  £.  It  is  obvious,  that  ke  depends  on 
values  of  S  and  m.  In  Tables  1  and  2  we  present  the 
dependence  of  ke  of  $  and  m,  which  was  established  by 
numerical  experiments  for  £  =  10-*5  and  suitable  cri¬ 
teria  for-  the  stopping  on  the  iterative  method.  We  set 
h  —  ^  in  Table  1  and  h  —  ^  in  Table  2. 
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{2]  Yu. A.  Kuznetsov  and  P.  Neittaanmaki,  Overlap¬ 
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J.L.  Lions  and  G.I.  Marchouk  (eds.),  Dunod,  Paris 
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Abstract:  HELENA,  -ay  new  -  software  »f or  tfie  modelling  of 
pseudomorphic  HEMT-' and  more  generally,' for  any- kind  of 
HEMTs  is  presented  Thissoftware  provides  the  DC,  AC' and 
noise  properties  in  die  cm  and  mm  wave  range.” It  is  very  fast,  easy 
to  use  and  needs  only  personal  computer. 

MNTRODUCTION. 

For  the  design  of  MMICs,  and: more  generally  for  the 
optimization  of  circuits  using  HEMTs  ,  the  knowledge  of  the 
high  frequency,  parameters  including  the  noise,  parameters 
constitutes  a  key  point.  Since  measurements  on  test  device  are 
difficult,  (especially  for  the,  noise  parameters),  a  theoretical 
modelling  is  very, well  suited, if  it  is  in  good  agreement  with 
experiments.  For  this,  purpose  a  new  HEMT-software  called 
HELENA  for  Hemt  ELEctrical  properties  and  Noise  Analysis 
has  been  developped. 

The  flow  chart  of  HELENA  is  given  in  figure  1 
It  is  divided  in  three  blocks : 

-The  layer  analysis 

-The  device  performance  modeling 


-The  results  display 


Fig.l:HELENA  flowshart 


II-THE  LAYER  ANALYSIS 

The  purpose  of  this  routine  is  to  obtain  the  charge  control 
law  by  a  schottky  barrier  of  any  HEMT  active  layer .  The  input 
parameters  are  the  thickness,  doping  (bulk  or  planar)  and 
composition  (%In,  %A1)  of  each  layer  constituting  the  active 
layer. 

This  modeling  is  based  on  a  simplified  selfconsistent 
resolution  of  Schrodinger  and  Poisson’s  equations  taking  the 


-strain,  effects -into  account  .  'As  the  matter  of  fact,  J. 
ALAMKAN  (1)  has  shown  that  the  two  first  energy  subbands 
are  related  to  the  sheet  carrier  density  in  2DEG  by  the  relation 
Ei=Ai  +  Bi,Ns  11] 

where' Aj  and  Bi  are  two  constants  and  Ns  is  the  sheet  carrier 
density.  This  result  allows  us  to,  write  a  simplified  charge 
control  law  for  electrons  in  the  2DEG  .  The  electron  supplying 
layer  for  which  quantum  effects  are  negligible  is  classically 
described  using. Fermi  statistics.  Layer  analysis  provides  for 
each  layer  the  sheet, carrier  density, (ionized  donnor  and  free 
carrier  density),  and  associated  capacitances  as  a  function  of 
the  gate  voltage.  Figure  2  shows  the  charge  control  law  in  terms 
of  capacitances  in  the  2DEG  and  in  the  electron  supplying 
layer  for  the  layer  shown  fig.  2. 


fig  2:  Charge  control  law 


112-THE  DEVICE  MODELING 

The  device  modeling  is  performed  using  a  quasi  2D 
approach  (2),(3).  In  this  approach  the  charge  control  law  ( in 
terms  of  sheet  carrier  density )  is  associated  with  an  electron 
transport  law  to  give  the  DC,  AC  and  Noise  properties,  The 
input  parameters  of  this  routine  are  the  transistor  topology  , 
the  charge  control  law  given  by  the  layer  analysis  previously 
described,  the  electron  transport  law  and  the  extrinsic 
parameters  of  the  small  signal  equivalent  circuit.  For  the 
electron  transport  law,  both  simple  v(E)  relationship  (suitable 
for  fitting  experimental  data)  or  hydrodynamic  equation 
(suitable  for  device  performances  predictions)  can  be  used.  A 
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^realistic  structure  including  g2te- recess  2$  well  as  suffice 
potential  and  carrier  injection  into  the  buffer  is  taken  into 
account. 

ilia  -.DC  AND  AC  CHARACTERISTICS 
For  given  \p  arid  !&,  electron  transport  2nd  Poisson’s 
equation  are  combined  to  give  a  quadratic  equation  (4). 
Solving  this  equation  at  each  step  from  source  to  drain  gives 
the  quantities  of  physical  interest  in  the  channel:  Electric  field, 
electron  velocity,  electron  energy- Drain  to  source  voltage  is 
obtained  by  integrating  the  electric  field  from  source  to  drain. 
The  DC  characteristics  feCVp ,  are  then  computed  (  see 
Kg- 3). 


fig  3:  DC  characteristics 

At  each  DC  point,  the  AC  performance  (  S-parameters  or 
small  signal  equivalent  circuit)  is  computed  using  the  active 
line  approach  (5). .  A  feature  of  this  approach  is  to  provide  not 
only  the  main  small  signal  parameters  (Gra,  Gj,  Cg*  Cgd)  but 
also  Rj,  Rgd,  r,  Cds  which  are  very  important  for  the 
determination  of  higfy frequency  device  behaviour  (see  Fig3). 


fig4  Transconductance  versus  gate-to-sourcc  voltage 


ISM*  NOISE  MODELING 

For  each  DC  poaa.  the  four  noise  jorzaetca  F* j*»  It*  C* 
r opt  as  well  as  associated  pin  (Gass)  and  nariam  available 
gain  (MAG)  are  competed  using  tbe  impcdance  field  method. 
Tbe  impedance  field  is  deduced  from  the  active  line  following 
the  method  described  in  (5).  Tbe  intrinsic  device  is  first 
considered  and  after  that  tbe  extrinsic  elements  (inductance, 
capacitance,  resistance)  are  introduced 
Figure  5  shows  the  frequency  variation  of  Fmxn  for  a  03  micron 
gate  length  pseodooorphic  HEMT. 


fig  5  Minimum  noise  figure  versus  frequency 
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Abstract  -  We  describe  a  two  dimensional  drift-diffusion  mtxM 
oed  to  nrnshtc  I InGaAs  JFETs  and  Barrier  Enhanced  MESFETs. 
We  also  describe  a  ooe  dimensional  model  used  to  investigate  the 
odti-Uyer  structure  of  BE- MESFETs. 

L  INTRODUCTION 

The  electron  transport  properties  place  InGaAs- based  field  effect  - 
transistors  (FETs)  amongst  the  most  promising  derices  m  the  design 
of  opto-electrock  integrated  circuits  for  high  speed  optical  fibre  com¬ 
munications  systems  (lj.  Standard  MESFET  technology  cannot  be 
csed  in  the  fabrication  of  these  FETs  as  the  Schottky  barrier  height  on 
n -doped  InGaAs  is  too  low  (~  02eV)  (23-  Hence  some  other  method 
of  achieving  transistor  action  b  necessary.  For  a  simple  n-chamiel  de¬ 
rice  two  different  structures  art; 

(1)  Junction  FET  (JPET)  -  a  p- layer  placed  tinder  an  ohmic  contact 
gate,  achieving  modulation  through  gate  voltage  control  of  the  deple¬ 
tion  width  of  the  p-n  junction  formed  under  the  gate. 

(2)  Barrier  Enhanced  MESFET  (BE-MESFET)  -  a  Schottky  contact 
to  an  intermediate  layer  of  higher  barrier  height  and  bandgap.  This 
higher  Schottky  barrier  and  conduction  band  discontinuity  between 
the  gate  layer  and  the  active  layer  giTcs  rise  to  an  effect  ire  barrier  on 
the  channel.  Modulation  is  then  achieved  in  a  similar  manner  to  the 
“classical"  MESFET. 

Numerical  simulation  of  each  of  these  options  is  necessary  in  the 
effort  to  derelop  and  optimise  this  technology.  Simulation  over  the 
entire  device  is  performed  using  a  2D  drift  diffusion  model  described 
bekw.  The  details  of  the  layer  structure  in  BE- MESFETs  are  more 
accurately  investigated  using  a  ID  model  This  involves  solving  the 
Poisson  equation  over  a  cross  section  iron  the  gate  contact  into  the 
bulk,  yielding  the  band  edge  profiles  and  electron  distributions  across 
the  layers. 

II.  THE  MODELS 

Drift-Diffusion  Model 

The  two  dimensional  model  is  based  on  a  previously  reported  MES¬ 
FET  simulator  (3].  It  uses  the  semi-classical  steady  state  description 
of  current  flow,  solving  for  the  Poisson  and  current  continuity  equa¬ 
tions 


V(«V^)  =  ,(n  -  p  +  JV.  -  WD) 

(0 

V(n^Vi-V(D.n)|  =  G 

p) 

V|r«,V<S  +  V(D,p)]=-C 

(3) 

-here  G  is  the  geneTStion-tecomhioAtioit  (G-R)  rate. 

G-R  is  modelled 

as  Schockley-Read-HaQ  type 


**’VB(p+nI)  +  ry(n  +  ni)  1  ' 

though  other  generation  models  can  easily  be  included. 

The  model  uses  the  finite  element  method  allowing  recessed  gate 
itnictures  and  many  other  non  rectangular  geometries  to  be  simu¬ 
lated. 

The  dependence  of  electron  mobility  in  InGaAs  with  electric  field 
is  modelled  by 


with  fa  —  50 00<m7 f  Vs  and  v, ut  =  1 X  I07cvn/s.  For  holes  a  constant 
mobility  /jy  =  200cm5/ Vj  is  used.'  Thu  ignores  overshoot  effects  to 
bring  the  simulations  into  line  with  measured  results  on  real  derices 
[4].  The  Einstein  relation  between  diffusirity  and  mobility  is  assumed 
for  both  carrier  species 

=  (6) 

In  simulating  JFETs  the  full  two  carrier  model  u  needed.  It  is 
possible  to  consider  only  electron  conduction  with  the  inclusion  of 
stationary  hole  charge  to  define  the  p-n  junction.  However  this  is  inad¬ 
equate  to  investigate  gate  leakage  current,  an  important  phenomenon 
in  these  devices. 

For  BE- MESFETs  we  need  consider  electron  conduction  only.  The 
effect  of  the  conduction  band  offsets  between  the  different  layers  in 
these  devices  is  included  by  solving  for  an  effective  potential  in  the 
current  continuity  equation 

^=^+x+y'"W  P) 

where  x-i*  the  electron  affinity  of  the  different  layers  and  1VC  the 
conduction  band  density  of  states.  This  approach  ignores  tunnelling, 
thermionic  and  quantum  effects,  but  is  valid  for  BE-MESFET  simu¬ 
lation  where  such  effects  are  negligible.  A  more  appropriate  model  u 
needed  where  transport  between  layers  may  be  a  predominant  effect 
(e-g.HEMTs). 

QflcJgiaTOigitfLMgdd 

Solution  of  the  Poisson  equation  over  the  ID  crow  section  requires 
an  accurate  description  of  the  dependence  of  electron  concentration 
on  potential  across  the  layers.  In  many  multi-layer  FET  structures 
the  electron  “gas”  is  degenerate  and  its  energies  often  quantised.  We 
ignore  quantisation  for  BE-MESFET*  but  do  account  for  degeneracy 
through  the  use  of  Fermi- Dirac  statistics.-This  gives 

"(0  =  -  E,(x))ltT)  (8) 

as  the  relation  between  conduction  band  edge  £«(z)  and  electron  con¬ 
centration.  Fj/2  is  the  Fermi-Dirac  integral  of  order  |  for  which  Ra¬ 
tional  Chebysher  approximations  are  known  (5).  The  fermi  level  Ep 
is  the  same  across  the  different  layers,  assuming  equilibrium  between 
the  layers,  thus  acting  as  a  natural  zero  level  for  energy.  Et{t)  is 
related  to  the  potential  £  by 

£<(*)  =  “#*)  +  A£c(*)  (9) 

where  A Et{z)  describes  the  band  offset  between  different  layers  in  the 
program  with  reference  to  the  gate  layer. 

Discretization  and  solution  of  the  resulting  non-linear  Poisson  equa¬ 
tion  are  easily  performed  over  the  ID  mesh.  Simulation  yields  the 
electron  concentration  and  band  edge  profile. 

III.  NUMERICAL  RESULTS 

Simulation  of  the  InGaAs  JFET  shown  in  fig.l  was  performed 
using  the  full  two  carrier  solver.  This  was  to  investigate  reverse  bias 
gate  leakage  current  Igs-Vgs  for  different  values  of  Vds  (fig.2).  The 
recombination  times  r„  =  r,  =  5  X  10“  4s  were  used  in  (4). 
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Figure  1:  InGaAs  JFET  crow  section. 
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Figure  3:  Band  Edge  profile 
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Figure  4'  Ids/Vds  curves  for  different -Vgs 


Figure  2:  Igs/Vgs  for  Vd*  =  0  5, 1.0, 1.5  and  2.0  V 

A  BB-MESFET  with  InAlAs  gate  and  buffer  layers  was  alio  stud* 
ied  using  both  ID  and  2D  simulators.  The  modulation  of  the  device  is 
dearly  seen  from  the  band  edge  profiles  in  fig  3.  These  also  show  the 
extent  of  the  barrier  to  reverse  bias  gate  leakage  current,  justifying  the 
use  of  (7).  The  Ids-Vds  DC  characteristics  are  shown  in  fig.4.  The 
gate  bias  dependence  of  both  the  sheet  charge  m  the  channel,  obtained 
using  the  ID  solver,  and  the  current  Ida  are  shown  in  fig.5.'Tt  can 
be  dearly  seen  that  gate  voltage  control  over  the  sheet  charge  in  the 
channel  is  the  dominant  effect  in  determining  the  transconductance 
of  the  device. 


\  ^.Vgs--2.0V 
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\y 

Y  ,Vgs-0.0V 

-X 

k\ 


0  05  05  15  1 6  2  2.4  2.0  35  3.6 

-Vgs  (Volts) 

Figure  5:  Ids  and  Sheet  diarge  dependence  with  Vgs 
IV.  CONCLUSIONS 

We  hare  described  a  full  2D  drift- diffusion  model  and  demon¬ 
strated  how  it  is  implemented  to  simulate  InGaAs  based  JFETs  and 
BE-MESFETs.  We  have  also  shown  how  a  more  accurate  ID  model  is 
used  to  describe  the  band  edge  structure  and  the  charge  distribution 
In  BB-MESFETs. 
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Abstract  The  optimal  thermal  design  of 
microwave  power  devices  is  complicated 
because  the  electrical  design  dictates  that 
the  layout-  of  the  structure  should  be  ,  as 
small  as  possible  and  the  thermal 
considerations  require  that  the  device  should 
be  large^for  minimum  thermal  resistance.  If 
such'  a  device  ^is  'to  be  used4  under  transient 
conditions  with  some  knovm  .duty  cycle  the 
requirement  for  the  device's  physical  size 
will  be  to,  some  extent  be -reduced^  but  the 
optimal  design-  f problem  -becomes  .more 
complicated.  This  presentation  outlines  some 
of  the  numerical- simulation  problems  which 
exist  for  the  transient  thermal  design  of 
heterojunction  bipolar  power  transistors 
(HBJT's)  and  a  method  of  simplifying  the 
design  process  is  suggested. 

1UOTRQPUCII.Q1U 

The  question  of  physical  size  in  the  design 
of  microwave  devices  is  always  extremely 
important  because  of  the  minimisation  of 
parasitic  components  in  the  electronic 
equivalent  circuit  and  matching  its  terminal 
impedance.  Generally  the  thermal  design 
dictates  that  the  device  should  be  as  largo 
as  possible  in  area. and  as. thin  .as  possible 
to  reduce  its  thermal*  resistance,  the 
benefits  being  greater  reliability  and  more 
predictable  electrical  characteristics.  The 
electrical  characteristic  will  bo  a  function 
of  temperature  whether  the  device  is  being 
used  under  CW  steady  state  conditions  or 
pulsed  according  to  some  duty  cycle. t  One 
problem  that  exists  with  pulsed  operation  is 
the  change  in  the  electrical  characteristics 
during  the  time  that  the-  devices  is 
operating.  The  thermal  design  of  a  microwave 
device  to  be  used  under  transient ; or  pulsed 
conditions  is  an  interesting  one  as  it  should 
be  possible  to  make  the. device  smaller  then  a 
device  designed  for  CW  operation  giving 
improved  bandwidth,  efficiency  and  power 
output.  Current  designs  using  HBJT's  consist 
of  a  variety  of  materials  with  temperature 
dependant  properties  requiring  3D  simulation. 
The  computer  time  required  to  perform  these 
transient  simulations  can  become 

embarrassingly  long,  and  this  paper  seeks  to 
outline  the  problem  and  suggest  a  convenient 
solution. 


The  vertical  geometry  of  the  device  to  be 
discussed. here  is  shown  in  Fig.l  and  shows  a 
simplified  cross  section  of  one  half  of  one 
emitter,  each  of  which  is  100  microns  wide. 
The  heat  source  dimension  is  estimated  from 
the  doping  concentrations  of  the  materials 
and  represents  the  volume  associated  with  the 
collector  depletion  layer.  Starting  with 
this  basic  emitter  structure  many  designs 
would  be  possible  based  on  varying  tho 
emitter  spacing  or  grouping  a  number  of 
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emitters  to,  form  cells-  from  which  larger 
power  devices  could  be  constructed. 


Fig.l.  Cross  section  through  the  structure 
showing  metallisation  and- dissipation  region. 


To  simplify  matters  we  will  concentrate  on 
the  simple  structure  of  Fig.l  and  assume  that 
the  device  has  a  large  number  of  equally 
spaced  emitter  fingers,  in  this  case  lOOum 
wide  spaced  by  48um.  Use  is  made  in  the 
simulations  of  the  planes  of  symmetry  which 
exist  in  the  structure 
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Three  dimensional  thermal  simulation  work 
has  been  undertaken  using  the  finite  element 
software  ANSYS,  and  two  in  house  simulators, 
one  based, on  a  finite  difference  technique 
and  the  other  using  the  Transmission  line 
matrix  method  (TLM) .  All  these  systems 
except  the  TLM  method  were  capable  of  solving 
both  steady  state  and  transient  thermal 
problems  and  take  into  consideration  the 
variation  of  thermal  conductivity  with 
temperature.  The  TLM  method  proved  to  be 
very,  useful  for  simulating  systems  where 
lengthy  time  periods  were  involved  and  was 
used  to  obtain  the  curve  shown  in  Fig. 2. 


Time  into  transient  IN  MlCROSECONOS 


Fig. 2.  Peak  Temperature  at  the  end  of 
repetitive  power  pulses.  Duty  Cycle»20% 
Pulse  Length  -  5  Microseconds. 

Fig. 2  shows  the  simulation  result  for  a  20% 
duty  cycle  and  5  microsecond  'on'  period. 
The  temperature  recorded  is  the  peak  value  at 
the  end  of  each  power  pulse.  Notice  that  the 
peak  temperature  has  not  reached  a  maximum 
and  the  process  is  likely  to  take  about  1 
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millisecond.  A  method  which  could'  obtain 
this,  result  with'  less  computation  effort 
.would,  be  valuable  and  , is  the  subject  of  the 
remainder, of  this; paper/' 


4.  AN  ALTERNATIVE  APPROACH-TO  THE  PROBLEM. 


Fig .“3  shows  the  increase  in  the  maximum 
temperature  of  the  structure  when  a  power 
pulse  is  applied.  The  peak  temperature  which 
would’  be  reached  is  324'  degrees  Centigrade 
neglecting  any  mounting  layers  (ie /solder) 
which  would  typically,  raises the  temperature  a 
further  20' degrees .  If  the  duty  cycle  of  the 
device  was  very,  small  this  curve  would 
accurately  predict  the  maximum  temperature 
for  a  required  length  of  power  input  pulse. 


i :Fig.3.  Temperature  Rise  starting  from  a. 
Uniform  Temperature  of  30  degrees  Centigrade. 


It  would  be  simple  to  produce  such  a  curve 
for  other  duty ^cycles  if  the  temperature 
distribution  at  the  beginning  of  the  power 
pulse  was  known.  It  seems  that -the  only,  way 
to  find  ,  this  distribution  would,  be  to 
simulate  the  repetitive  power  cycle,  the 
process  wo  seek  to  avoid.  The  alternate 
approach  is  to  first  use  a  steady  state 
simulator  with  an  input  power  reduced  by  a 
percentage  dictated  by  the  duty  cycle  and  use 
this  distribution  as  an  initial  boundary 
condition  for  the  start  of  the  transient 
simulation..  This  power  input- is  the  average 
of  that  for  each  cycle.  The  temperature  at 
some  point  in  the  bulk  of  the  device  will  be 
oscillating  about  a  mean  value  and  the 
further  away  the  point  from  the  power  source 
so  this  oscillation  magnitude  will,  decrease. 
In  the  case  being  studied  an  initial  power 
pulse  of  6.5  microseconds,  only  produces  a 
noticeable  change  in  temperature  within 
distances'  of  about  30  microns'  of  the  source. 
At  the  point  of  maximum  temperature  the 
temperature  simulated  using  this  reduced 
input, power  will  always  be  greater  than  that 
at  the  end  of  a  repeating  duty  cycle,  and 
therefore  if  this  steady  state  distribution 
is  used  as  an  initial  boundary  condition  for 
the  commencement  of  a  transient  simulation, 
the  resulting  temperature  will  always  bo 
overestimated. 


The  method  described  in  the  previous  section 
was  used  to  obtain  the  curves  shown  in  Fig. 4. 
They  show  the  relationship  between  peak 
temperature,  duty  cycle,  and  length  of  input 
power  pulse,  for  a  given  input  power,  base 
temperature . and  goometrical  layout.  Because 
of  the  simulation  procedure  used  the  results 
will  always  bo  pessimistic.  To  establish  the 
accuracy  of  the  curves  a  number  of  repetitive 
simulations  were  undertaken  for  a  variety  of 


duty  cycles  and. pulse 'lengths.  Some' typical 
results  are, shown  in  Fig. 5  where  the  first 
second  and: fifth  -  transient .cycles  are  shown. 


Fig. 5.  Temperature  Rise  during  * second •, third 
and  fifth  cycles. 


The  temperature  rises  tested  were  always 
within  5%  of  the  accurately  simulated  value, 
and  the  gains  in  computation  effort  were 
considerable. 


6.  DISCUSSION, 

The  method  of  -using-  a  steady  state 
simulation  and  corresponding  transient 
simulation  has  the  potential  to  reduce  the 
computation  effort  considerably.  An  estimate 
of  the  time  saved  to  produce  the  curves  shown 
in  Fig. 5,  assuming  10  points/decade  and  that 
the  transient  simulation  will  give  a 
reasonably  accurate  peak  temperature  within 
10  cycles,  is  a  factor  of  about  500.  A  5% 
error  in  the  predicted  temperature  rise  is  in 
reality  quite  satisfactory  as  problems  of 
reproducibility  in  bonding  layers  can  be 
comparatively  significant.  The  results  of 
the  simulation  using  all  the  simulators 
indicated  in  section  3  gave  very  similar 
results.  The  TLM  method  was  by  far  the  most 
useful  in  the  simulation  of  transients 
involving  large  time  periods,  but  it  was  not 
possible  to  couple  it  to  a  steady  state 
simulator  and  use  the  method  developed  here. 
Also  the  TLM  method  requires  relatively  large 
amounts  of  data  space,  about  8  times  the 
equivalent  finite  difference  approach. 

7...CQNCWaMi 

The  method  outlined  in  this  paper  shows  that 
the  time  required  to  compare  and  assess  the 
thermal  design  of  proposed  HBJT  structures 
for  use  in  transient  applications  may  be 
reduced  considerably  with  a  modest  compromise 
in  the  accuracy  obtained. 

The  authors  acknowledge  the  help  of  Plessey 
Research  Caswell  in  supplying  typical 
geometrical  and  process  design  data. 
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Abstract  -  A  general-purpose  2D  device  simulator  (SUDS)  has 
been  'enhanced  >  to*  investigate  the  .  specific  •  mechanisms  -  which 
cause".  DC  .gain  ‘  degradation;* in  -  AIGaAs/GaAs  heterojunction 
bipolar  transistors  (HBTs).  Access'  to  the  distributions  of 
potential  and  electron '  *  and  hole .  concentrations . .  from  > ,  the . 
numerical-  simulator  greatly  aid ' the  analysis  -  which  shows ,  that 
surface  recombination  asv  opposed  a  to  .surface-  Fermi  ,  level 
pinning  is  responsible  for  the  degradation  of  gain: 

I  INTRODUCTION 

AIGaAs/GaAs  heteroj  unction  bipolar  transistors  are 
becoming  increasingly  important  for  applications  in  microwave 
Integrated  circuits  due- to  their  very  high  switching  speeds.  In 
the  fabrication  of  these  devices  either  '  an  etch:  down  to  the 
AlGaAs  layer  or  an  Implant  Is  used  to  isolate  the  emitter  and 
base  contact  regions.  It  is  generally  .understood  that  surface 
effects,  namely  surface  Fermi,  level  pinning  and  surface 
recombination  are  the  causes  of  gain  degradation  In  both 'of 
these 'structures  [1). 

The  modelling  of » such  highly  non-linear  two-dimensional 
effects  is  best  achieved  by  a  numerical  device  simulator  whkh 
solves  the  basic  equations  of  Poisson  and  current  continuity,, 
A  general  .purpose  device  simulator  ,  (SUDS),  capable  of 
handling  arbitrary  doping  and  geometry,  has  been-/ developed 
and  recently  enhanced  to  handle  heterojunctions  and, surface 
effects.  In  this  paper  we  have  studied  the  effects  of  different 
configurations  surface  charge,  and  recombination  centres  and 
have  isolated  the  mechanisms  whkh  significantly  control  the 
basic  current  causing  the  degradation  in  gain. 

n  THE  NUMERICAL  MODEL 

The  device  modelling  package 1  used  In  this  work  solves 
the  semiconductor  heterojunction  device  equations  on  a 
arbitrarily  shaped  two  dimensional  domain.  The. physics  of 
the  semiconductor  are  modelled  by  a  set  three  partial 
differential  equations,  namely,  Poisson's  equation 


Tfie.  method  involves  the  integration  .of  each  of  the 
^equations  over’ the  control  (Vororioi)  .area  .associated  with .  each 
mesh  node.  ,  By _  application  of  the  divergence,  theorem,  the 
resulting '.surface ..  integrals"  can.  be  converted  to  line  integrals 
around  the  perimeter  .of  the  control  area  'So  that  for  a  node 
'i’Ja  the  mesh  the  discrctised  ‘equations  become 

f#«)  -j|]pijdij  -  pi*i  -  o  <7> 

FVn(*;n)  -I'JnlJ-llJ  (8) 

*yp(-rp)  +  ^UjA,  -  0  (9) 

where  •  Mj .  is ,  the.  number,  of  nodes  connected  to  node  I.  The 
constitutive  relations  (4)  -  (6)  are  then  discrctised  using  the 
standard  finite  difference  method  In  the  case  of  B  the  electric 
field 'displacement  and  by -the  Scharfetter-Gummel  (4)  method 
In  the  case  of  electron  and  hole  currents  along  edge  ij.  The 
resulting  set  of  non-Hnear  equations  (F^(\f),  F^fop) 
^v’n(v’n)),  solved  using  the  Newto'n-Raphson  method  for 
the  solution  variables  ^  and  v>p.  using  either  a  Gummel 
decoupled  scheme  or  a  coupled  scheme. 

The  large '  linear  equation  set  formed  at  each  Newton 
iteration-  is  solved  using  the  1CCG  method,  for  symmetric 
matrices  and  the  CGS'  method  for  non-symmetric.  Both  of 
which- use  an  extremely  compact  sparse  storage  scheme  whkh 
minimises  the  memory  requirements. 

m  THE  PHYSICAL  MODELS 

The  physical  models  described  here  are  for  the 
AIGaAs/GaAs  material  system,  however,  SUDS  has  built-in 
models  for  most  other  common’  semiconductors.  The 
mathematical  models  describing"  physical  processes  require  a 
number  of  empirical  parameters  to  fit  them  to  the  specific 
material. 


9  .  d  -  p  (I) 

and  two  continuity  equations  one  for  the  electron  distribution 


and  one  for  the  hole  distribution. 

V  ,  |n  -  qU 

<2> 

7  .  2p  -  -qU 

0> 

These  fundamental  equations  require  constitutive  relationships 
for  the  vector  flux  terms.  The  electric  displacement  term  £>. 
is  expressed  as 

D  -  «E 

W 

and  the  current  densities  are  expressed  In  terms  of  the 
modified  drift-diffusion  relationships 

in  *  "MW*  7  (t  *  *n>  +  9Dn7n 

(5) 

Ip  -  -WpP  7  (#  +  *p)  -  q»p7p 

<«> 

where  the  ^  and  parameters  described  the  compositional 
variation  of  the  semiconductor  (2).  The  basic  set  of  equations 
(1)  -  (3)  are  discrctised  using  the  control  region  method  (3J, 
on  a  triangular  mesh. 


At— MqVMy 

Since  the  -AIGaAs/GaAs  material  system  is  a  ternary 
compound,  the  mobility  model  must  take  account  of  the  extra 
scattering  caused  by  the  alloy,  in  addition  to  the  standard 
phonon  and  Impurity  scattering  mechanisms  of  the 
homogeneous  material.  Experimentally  this  processes  manifests 
itself  as  a  reduction  In  the  iow-field  mobility  which  decreases 
as  alloy  mole  fraction  (x)  increases.  The  mobility  model 
Implemented  in  the  current  work  is  thus  a  function  of 
temperature,  net  doping,  alloy  composition  and"  electric 
field  (5)[6J.  The  standard  expression  is  used  for  lattice 
scattering 


PL  "  Po 


T  ya 

30oJ 


(10) 


where  T  is  the  lattice  temperature,  yx>  is  the  low  field 
mobility  In  GaAs  and  a  is  an  experimental  fitting  term. 
Impurity  scattering  is  modelled  by 


P 


where  P  H  the  net  doping  and  the  fitting  parameters  p,  Nref 
and  a  are  determined  from  experimental  data  (51. 
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The  high  field '  mobility  of.  carriers  In  small geometry,/ devices 
Is  of  critical  importance.  The  model  u$ed  here  for- electrons 
is  the  well  known  model  for  IH-V  materials  [6] .  which 
includes -the  intervally  scattering  of  electrons  •  causing  negative 
differential ,  resistance  in  lightly  doped /structures. 


«*<E» 


(12) 


where  V$  is  the  saturation .velocity^ind.^^Is  the  critical' field. 
Velocity  overshoot  effects  are  not  included  here'- since  they- are 
localised  to  the  collector  depletion  region/’  andso'playno  role 
in  the  phenomena  currently  under  consideration , 


B.  Recombination 

Bulk.  .  recombination  is  modelled  by-  the 
Schockley-Hall-Read  (S HR)  process  and  are  ■  characterised  by 
two  minority  carrier  lifetimes,  and; 
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(13) 


The  lifetimes  are  dependent  on  the  local  net  doping.  Auger 
and  Impact  ionisation  are  also  options  but  are  not  selected  for 
this  device. 

Surface  effects  are  modelled  by  the  incorporation  of 
surface  charge  and  a  separate  surface  recombination 
mechanism.  Surface  charge  is  defined  as  a  fixed  donor  -  or 
acceptor  charge,  which  Is  an  additive  term  to  p.  In  Poisson’s 
equation  (1).  The  increased'  recombination  at  the  surface  Is 
assumed  to  arise  from  discrete  energy  ■  levels  which  can  be 
defined  to  be  donor  of  acceptor  levels.  For  the  trap  at  each 
energy  level  a  separate  SHR  model  Is  established. 
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where  rM  and  jpJ  arc  the  minority  carrier  lifetimes  associated 
with  the  trap,  and  n-p  and  p-p  are  the  electron  and  hole 
concentrations  that  would  exist  In  equilibrium  If  the  Fermi 
level  sat  at  the  energy  of  the  trapping  centre. 
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Fig.l.  Typical  structure  of  AlGaAs  HBT 


Fig.2.  I-V  characteristics  for  structure  In  Fig.l. 


IV  RESULTS 

Fig.  1  shows  a  typical  AlGaAs/GaAs  HUT  used  in 
microwave  Integrated  circuits.  Flg.2  shows  a  comparison  of 
the  IV  characteristics  of  the  structure  with  and  without  surface 
effects,  and  clearly  demonstrates  the  significant  Increase  In  the 
base  current  below  approximately  Vgg  ■  1«1V,  Fig.  2  also 
shows  the  position  of  the  traps  and  the  pn  base-emitter 
junction  in  the  AlGaAs  layer.  The  surface  cb»/ge  was  set  to 
2*6  x  10l,cnT2  and  the  trap  depth  was  set  to  the  Intrinsic 
Fermi  level  Epj.  With  surface  recombination  switched  out  of 
the  simulator,  the  base  current  reverted  back  to  Its  former 
level  demonstrating  that  the  surface  charge  alone  had  little 
influence  on  the  terminal  base  current. 


V  CONCLUSIONS 

The  highly  non  linear  models  associated-  with  surface  effects 
have  been  incorporated  into  SUDS  to  demonstrate  their  effects 
on  the  characteristics  of  the  AlGaAs/GaAs  HBT.  Surface 
recombination  U  shown  to  be  the  primary  cause  of  gain 
degradation  at  low  bias  levels. 
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Abstract 

A  two-dimensional  simulation  of  the  heterojunction  bipolar  transis¬ 
tor  is  described  which  has  been  used  to  understand  the  processes 
taking  place-’  in,  real-  devices,  jn  this.,  paper  the  effects  of 
electron/hole'recombination  in  implanted  regions  is  discussed  and  it 
is  also  shown  that  misalignment  between  the  doping  and  alloy  junc¬ 
tions  at  the  emiuer/base  interface  can  significantly  degrade  the  dev¬ 
ice  performance. 


INTRODUCTION 
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Mathematical  models  have  long  been  used  to  understand  and  hence 
improve  the  operation  of  semiconductor  devices.  These  models,  at 
their  simplest,  can  give  a  closed  from  analytical  solution  which  in 
the  past  has  been  instrumental  in  the  development  of  improved 
structures.  However  for  modem  devices,  with  their  reduced  dimen¬ 
sions  and  high  internal  electric  fields,  the  approximations  necessary 
for  an  analytical  solution  arc  no  longer  valid.  It  has  therefore  been 
necessary  to  develop  numerical  models  where  more  accurate 
descriptions  of  the  device  physics  can  be  included  albeit  at  the 
expense  of  solution  time. 

This  paper  will  describe  how'  a  two  dimensional  model  has  been 
used  to  understand  and  aid  the  design  of  hetcrojunciion  bipolar 
transistors  (HBTs). 

THE  MODEL 

The  device  modelled  in  this  work  is  the  GaAs/AlGaAs  HBT  where 
the  wide  band  gap  emitter  is  the  AlGaAs  alloy  with  the  mole  frac¬ 
tion  of  aluminium  in  the  region  of  0.2  to  0.3  and  the  base  is  GaAs. 
The  emitter/basc  junction  can  be  abrupt  or  graded.  Being  a  bipolar 
device  the  recombination  of  electrons  and  holes  ir«  an  HBT  is  an 
important  factor  which  in  this  model  is  described  by  the  Shockley- 
Hall-Read  mechanism  via  recombination  states  at  mid  gap.  The 
model  allows  for  a  wide  variation  of  recombination  rates  in  different 
regions  of  the  device  to  account  for  the  effects  of  surfaces,  material 
interfaces  and  various  implants.  Figure  l  shows  the  basic  geometrv 
of  the  devices  modelled.  In  practice,  owing  to  the  symmetry  of  (he 
structure  only  one  half  of  the  device  needs  to  be  simulated  with  a 
consequent  saving  in  computer  run  time. 

A  drift-diffusion  description  of  the  HBT  has  been  implemented  tak¬ 
ing  into  account  the  dependence  of  earner  velocity  on  electnc  field 
and  the  variation  of  the  semiconductor  properties  across  the  junc¬ 
tion.  The  device  equations  are  solved  by  finite  difference  tech¬ 
niques  on  a  two-dimensional  gnd  which  is  automatically  refined  to  a 
smaller  mesh  size  in  regions  where  the  electneal  potential  is  varying 
rapidly. 


Fig.  1.  Heterojunction  Bipolar  "IYansistor  structure 
modelled  m  this  work.  (NB  ‘A  of  device  shown) 

A  number  of  effects  can  be  studied  with  the  aid  of  this  model  but 
this  paper  will  concentrate  on  two  areas  where  it  has  been  used  to 
interpret  the  measured  performance  of  actual  devices  fabricated  at 
the  GEC/Plessey  Research  Centre,  Caswell.  The  two  areas  are 
firstly,  the  effect  of  an  implanted  oxygen  isolation  layer  between  the 
emitter  and  base  and  secondly,  the  consequences  of  a  misalignment 
between  the  doping  and  alloying  junctions  at  the  base  emitter  inter¬ 
face.  The  misalignment  problem  has  also  been  studied  using  a  one 
dimensional  model  of  a  p-n  heterojunction  because  it  is  much  faster 
to  run  than  the  full  two  dimensional  case. 

THE  EMITTER/BASE  ISOLATION  IMPLANT 
The  HBT  shown  in  Figure  1  is  a  planar  structure  which  has  a 
number  of  advantages  during  fabrication.  A  less  desirable  feature 
however  is  that  the  base  is  tn  contact  with  the  emitter  across  a  verti¬ 
cal  plane  extending  down  from  the  surface.  This  effectively  means 
that  electrons  will  be  injected  into  a  region  where  the  base  is  much 
wider  than  the  optimum  (as  defined  by  the  region  under  the 
emitter).  The  injected  electrons  will  therefore  take  a  longer  time  to 
traverse  the  base  and  there  will  also  be  a  higher  probability  of  their 
recombining  on  the  way.  These  effects  will  lower  both  the  fre¬ 
quency  response  and  the  current  gam  of  the  device  -  both  undesir¬ 
able  effects.  One  solution  to  this  problem  is  to  introduce  the  isola¬ 
tion  layer  shown  in  Figure  1  between  the  emitter  and  base  by  means 
of  an  oxygen  implant  which  effectively  creates  high  resistivity 
material  in  the  required  region. 

The  oxygen  implant  effectively  creates  a  region  of  heavy  crystalline 
damage  which  means  that  there  will  be  a  high  density  of  localised 
energy  levels  in  the  forbidden  gap  which  will  in  turn  make  the 
electron/hole  recombination  process  much  faster  This  implant  is 
therefore  modelled  as  a  region  having  a  much  shorter  recombination 
time  than  the  rest  of  the  device. 
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Figure  2  shows  die  effect  of  this,  recombination  on  the  current  gain.. 
In  this  Figure,  device  A  has  a  recombination  time  of;5Xl079s  in  • 
the  GaAs  base  and  2xlO_10s  in. the  implanted  AIGaAs  base  con¬ 
tact,  Device  B  is  the  same  but  with  the  addition  of  a  recombination 
time  of  4X10*ns  in,. the  isolation; implant.  The  current -gain  of 
about  30  for, device  B, is,  typical  of  what. is  observed  in  practice. 
The  model  was  also  used  to  investigate  the  effect  of  recombination 
at  the  top  surface  and, the  emitter/base  interface,.  It  was  found  that 
recombination. at  the  top  surface;had< little. or^ no  effect. while  a 
recombination  centre  density  of  greater  than  l053cm"3  at  the 
emitter/base,  interface  was  -  needed  to  account  for  the  observed 
reduction  in  current  gam.  This  density  is  much  higher  than  would 
be  expected  for  modem  technology. 
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Fig.  2.  The  effect  of  recombination  in  the  isolation 
region  on  the  current  gain. 

By  using  this  model  it  was  therefore  shown  that  out  of  a  number  of 
possible  causes,  excessive  recombination  in  the  isolation  implant 
region  was  the  most  likely  reason  for  the  poorer  than  expected  per¬ 
formance  of  these  devices.  It  would  have  been  difficult  to  isolate 
the  effects  of  recombination  in  the  different  regions  without  the  aid 
of  a  suitable  model, 

ALLOY/DOPING  MISALIGNMENT 
Simple  hcterojunction  theory  assumes  that  the  boundary  between  the 
n  and  p  regions  corresponds  exactly  with  the  change  from  AIGaAs 
to  GaAs.  In  practice,  owing' to  the  limitations  of  the  technology  a 
misalignment  of  these  junctions  can  easily  occur  In  the  devices 
studied  here  the  most  likely  reason  for  this  misalignment  is  the 


Fig.  3.  Schematic  representation  of  a  misalignment 
between  the  doping  and  alloy  junctions. 


diffusion  of  acceptors  from  the  highly  doped  GaAs  base  into  the 
AIGaAs  emitter.  This  effect  is  illustrated  in  Figure  3  which  shows 
the -movement  of  the  p-type  doping  (shaded  black)  into  the  n-type 
emitter  (hatched)  which,  for  the  purposes  of  this  paper,  is  defined  as 
a  positive  shift. 

A  one  dimensional  p-n  heterojunction  model  has  been  used  to  study 
the  effect  of  misalignment  on  the  emitter  injection  efficiency  The 
emitter  injection  efficiency,  defined  as  the  percentage  of. the  total 
emitter/base  current  carried  by  the  emitter  majority  carriers  (in  this 
case  electrons),  is  the  most  important  quantity  affecting  the  gain  of 
the  device.  The  valence  band  discontinuity  in  an  HBT  virtually 
eliminates  the  hole  current  giving  an  efficiency  of  close  to  100% 
and  hence  a  very  high'  gain.  Figure  4  shows  the  reduction  in 
emitter  injection  vefficiency  caused  by  various,  positive  shifts  up  to 
40  nm.'.  (Negative  shifts  give  little  or  no  effect.)  Shifts  of  a  few 
nanometres,-  corresponding  to  movements  of  the  junction  through 
about  10  atomic  layers,  give  significant  reductions  in  the  injection 
efficiency,  sufficient  to  have  a  significant  effect  on  the  current  gain 
of  an  HBT.  (factors  of  10  are  typical.) 
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Fig.  4.  The  effect  of  misalignment  on  the  emitter  injection 
efficiency. 

The  reduction  m  the  injection  efficiency  levels  off  for  shifts  greater 
than  about  40  nm  because  at  thu  point,  for  the  doping  levels  con¬ 
sidered,  the  depletion  regions  associated  with  the  pm  junction  are 
all  with.n  the  emitter  and  the  alloy  junction  plays  no  further  part  in 
the  transistor  action.  It  is  also  expected  that  graded  alloy  junctions 
will  be  less  susceptible  to  the  effects  of  misalignment. 

CONCLUSION 

This  paper  has  described  just  two  technological  problems  that  can 
have  a  bearing  on  the  performance  of  an  HBT.  By  using  the  model 
individual  features  can  be  studied  in  isolation,  a  process  which  is 
difficult  m  real  devices  because  of  the  uncertainties  of  the 
fabrication  process  and  characterisation  techniques.  This  in  turn 
can  be  fed  back  to  the  device  engineer  to  give  insight  into  the 
important  parameters  which  must  be  controlled  during  fabrication. 
Without  such  a  model,  it  is  very  difficult  to  identify  these  parame¬ 
ters. 


MONTE  CARLO  SIMULATIONS  OF  TRANSPORT  jN  P-TYPE  HETEROSTRUCTURES 
R.  W.'Kclsall  and  R.  A. 'Abram 
Applied  Physics  Group,  School  of  Engineering  and  Applied  Science, 

University  of  Durham,  South  :Road,  Durham,  DH1  3LE,  UK. 


Abgtrai;^ 

A  Monte  Carlo  transport  simulation  is  described, which  in¬ 
corporates  a  realistic  description  of  the  valence  bandstructure'  in 
semiconductor  quantum  wells,  including;  heavy  -  light  hole  mix¬ 
ing  and ‘strain  effects.  The"  simulation  is  used  to  analyse' hole 
transport  in  lattice'  matched  GaAs/AlAs  and  peeudomorphlc  In; 
GaAs/GaAs  structures.  The,77K  phonon-limited' mobility  in 
the  pseudomorphic  well  is' nearly’ six 'times  that 'In  the  lattice 
matched  "well,  but  much  of  this  enhancement  is  removed  by 'alloy 
scattering. 

Introduction 

For  several  years  there  have. been  aspirations,  to  develop  a 
complementary  logic  process  technology  in' III-V  semiconduc- 
'tors..  Such  a  logic  system  ought,  in  principle;  to  combine'the 
extremely  low  stand-by  power  dissipation  of  .CMOS  with  t  the 
high  speed  of  the  HI-V  devices.  However, ,no  practical  system 
has  yet  been  realised; 'a  major  setback  being  the  limited  speed 
achieved  in  the  p-type  devices.  The  incorporation  of  strained 
InGaAs  layers  in  modulation  doped  FETs  has  opened  up, a  new 
route  to  a  potentially  fast  p-type  device*  It  is  found  that  the 
axial  strain  present  in  a  pseudomorphic  heterostructure  shifts 
the  energy  levels  of  the  heavy  hole  subbands  relative  to, the  light 
hole  subbands.  This  can  lead  to  a  reduced  effective  mas*  in  the 
highest  valence  subband,  which  would  be  expected  to  give  rise 
to  enhanced  hole'  mobilities  and  saturation  drift  velocities  |l]. 
However,  the  mobilities  and  drift  velocities  measured  to  date  in 
p-type  pseudomorphic  InGaAs  heterostructures  have  been  disap¬ 
pointingly  low  (2-4],  showing  little  improvement  over  the  values 
obtained  in  lattice  matched  systems.  The  reasons  for  this  dis¬ 
crepancy  between  expected  andobeerved  result*  have  not  been 
identified,  and  no  theoretical  analysis  has  been  undertaken. 

Any  such  analysis  of  p-type  heterostructures  must  take  into 
account  the  detailed  form  of  the' valence  subband  structure, 
which  can  be  severely  distorted  from  the  usual  near-parabolic 
form  by  qu&ntum-sixe-induced  mixing  of  the  heavy  and  light  hole 
states.  This  mixing  also  affects  the  hole  wavefunctions,  which 
axe  important  in  determining  the  transition  rates  for  the  vari¬ 
ous  scattering  processes  active  in  the  device.  We  have  developed 
a  Monte  Carlo  simulation  scheme  which  satisfies  all  the  above 
requirements  The  hole  subband  energy  dispersions  and  wave- 
functions  are  calculated  using  the  k.p  method,  and  are  then  used 
to  calculate  the  scattering  rate#  as  a  function  of  hole  wavevector. 
Hole  transport  is  then  simulated  in  the  plane  of  the  heterostruc¬ 
ture  active  layers  taking  into  account  the  subband  energies  and 
group  velocities,  and  the  angular  dependences  of  the  heterostruc¬ 
ture  scattering  rates.  We  have  investigated  two  cases:  i)  a  lattice 
matched  GaAs/AlAs  single  quantum  well  and  ii)  a  pseudomor¬ 
phic  InG&As/GaAs  single  quantum  well.  In  each  case  we  have 
taken  the  lattice  temperature  to  be  77K,  in  order  to  facilitate 
comparison  with  a  range  of  experimental  results.  These' simula¬ 
tions  represent,  to  the  best  of  our  knowledge,  the  first  detailed 
models  of  hole  transport  in  p*type  heterostructures,  allowing  us 
to  investigate  the  origins  of  the  speed  limitations  of  both  lattice 
matched  and  pseudomorphic  devices. 

Calculations 

Figures  la  and  b  show  the  in-planc  energy  dispersion  of  va¬ 
lence  subbands  in  the  GaAs  lattice-matched  and  InGaAs  pseu¬ 
domorphic  quantum  wells  respectively.  The  GaAs  quantum  well 
width  is  100A,  whilst  the  InGaAs  well  width  u  90A  and  the  In 
concentration  is  x  =  0.18.  The  bandstructures  show  relatively 
httle  anisotropy  in  the  quantum  well  plane,  and  we  have  assumed 
them  to  be  isotropic  in  the  following  calculations.  Strong  heavy  - 
light  hole  mixing  is  clearly  evident  in  the  GaAs  well,  with  severe 
distortion  of  the  subbands  near  regions  of  repulsion  (‘avoided 


crossings’)  between  adj'acent  subbands.  .The  second  and  fourth 
highest  subbands  have’ energy  minima  which  are' displaced  from 
the  zone  centre,- leading, to. regions  of  extremely. high  densities' 
of  states.  The  effective  mass  at  the  zone  centre  of  the  highest 
subband  is  0 156,‘  but  rises  to  a  maximum  value  of  0  901  due  to 
'  interaction  with  the  next  highest  subband.  By  comparison  the 
valence  bandstructure  in  the  InGaAs  well  is  much  less  affected 
by  mixing,  and  the  effective  mass  in  the  highest  subband  varies 
from  0  126  at  the'  zone  centre  to  0.434  for  an  m-plane  wavevector 
fc,  ofo.iAr*.  ,  ' 
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Figure  1  In-plane  Yalence  subband  dispersion  for  (a)  a 
GaAs/AlAs  lattice  matched  and  (b)  an  InGaAs/GaAs  pseu¬ 
domorphic  quantum  well. 


The  hole-phonon  scattering  rates  are  dependent  on  the  sub- 
band  energies,  tho  quantum-confined  hole  wavefunctions  and  the 
densities  of  states.  We  have  obtained  this  information  from  the 
k.p  calculations  for  each  required  value  of  klt  which  enabled 
us  to  calculate  the  integrated  hole-phonon  scattering  rates  out 
of  any  initial  kt  state  into  any  allowed  final  state,  via  intra- 
and  inter-subband  transitions.  We  have  included  optical  phonon 
scattering  via  the  FVohlich  (polar)  and  deformation  potential  in¬ 
teractions,  and  acoustic  deformation  potential  scattering,  Rates 
for  piexolectric  scattering  were  calculated  for  the  GaAs  quantum 
well,  but  found  to  be  negligible,  and  the  calculations  were  not 
repeated  for  the  InGaAs  case. 

In  figure  2  we  have  shown  the  total  hole-phonon  scattering 
rate  in  the  highest  valence  subband  of  the  GaAs  and  InGaAs 
quantum  wells  vs.  initial  energy.  The  features  labelled  on  the 
diagram  correspond  to  thresholds  for  the  principal  transitions. 
Polar  optical  scattering  is  the  dominant  process  in  both  quan¬ 
tum  wells;  however,  all  processes  are  considerably  stronger  in 
the  GaAs  well.  This  is  due  to  the  enhanced  density  of  states 
in  the  GaAs  valence  subbands,  the  spike-like  structures  at  the 
thresholds  for  1-2  and  1-4  scattering  correspond  to  the  peaks  in 
the  density  of  states  at  the  off-zone-centre  energy  minima  in  the 
second  and  fourth  subbands. 

For  the  InG&As  quantum  well,  alloy  scattering  must  also  be 
considered.  We  calculated  rates  for  intrasubband  alloy  scatter¬ 
ing  using  the  2D  density  of  states,  but  taking  the  bulk  form  of 
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the  scattering  matrix  element  [5].  *  We  are  currently  working  on 
calculations  of  the,  matrix  element  using  the  quantum  confined 
k.p  wavefunctions  in.  the  manner  described 'above  for  phonon 
scattering.  There  appears  to  be  no  mforination  available  on  the 
alloy  scattering  potential  A  B  for  holes  in  InGaAs  One  possi¬ 
ble  prescription  is  to  take  the  difference  between  the  band  edge* 
energies  of  the  constituent  binary  compounds,  measured  relative 
to  the  vacuum' level.  This  gives  a  value' of  A  Ei=  0.267eV.'In‘ 
our  simulations  we  have  also  used  the  value  'of  A B  ~  0  42eV 


Figure  2  Ibtal  77K  hole-phonon  scattering  rate  in  the  high¬ 
est  valence  ribband:  —  GaAs/AlAs,  ■  —  •  InGaAs/GaAs 
quantum  wells,  The  labels  mark  the  thresholds  for:  Op  1- 
lem,  l-2em,  l-4em;  optical  phonon  emission  scattering  into 
the  highest,  second  highest  and  fourth  highest  subbands  re¬ 
spectively,  and  Ac  1-2,  acoustic  phonon  scattering  into  the 
second  subband. 


Rtwin  and  PlKuntoa 

Figure  3  shows  the  77K  velocity-field  characteristics  for 
GaAs  lattice  matched  and  InGaAs/GaAs  pseudomorphic  het¬ 
erostructures,  For  .the  InGaAs  system  we  have  shown  results 
calculated  without  alloy  "scattering,  and  with  alloy  scattering  po¬ 
tentials  of  0  267  and  0.42eV,  The  phonon-limited  drift  velocities 
in  the  InGaAs  quantum  well  are,  as  expected,  considerably  larger 
than  the  measured  and  simulated  values  in  GaAs  heterostruc¬ 
tures.  The  simulated  phonon-limited  low  field  mobility  is  ap¬ 
proximately  38, 000cm5 which  is  nearly  six  times  larger 
than  our  simulated  value  of  6,400cm3  V*1*”1  for  the  GaAs  well. 
Alloy  scattering  gives  rise  to  significant  reductions  in  the  drift 
velocity  and  mobility;  we  obtained  mobilities  of  approximately 
15,000  and  1 0,000cm3  V”1*”1  withalloy  .scattering  potentials  of 
0.267  and  0.42eV  respectively.  Even  this  latter  result  is  some 
33%  larger  than  the  highest  experimental  results  obtained  (2], 
and  the  drift  velocities  at  higher  fields  are  well  above  the  ex¬ 
perimental  values  shown  in  figure  3,  In.contrast,  the  simulated 
characteristic  for  the  GaAs  quantum  well  agrees  well  with  the 
two  experimental  curves  shown,  both  measured  on  single  GaAs 
heterojunctions.  The  simulated  low  field  mobility  also  agrees 
v/ell  with  experimental  data  [8,0]. 

The  low  mobilities  observed  experimentally  in  the  InGaAs 
system  could  be  due  to  impurity  scattering,  which  was  not  in¬ 
cluded  in  our  simulations.  However,  there  u  no  clear  reason 
why  this  process  should  be  any  more  important  in  the  InGaAs 
than  in  the  GaAs  quantum  well.  One  process  which  may  well 


be  more' prevalent  in  the  InGaAs  system  is  interface  roughness 
scattering,  since  the  InGaAs/GaAs  growth  technology  u  ^ot  as 
well  established  as  that  for  GaAs/AlGaAs  and  GaAs/AlAs  inter¬ 
faces.  More  work  on  alloy  scattering  is  also  needed  to  determine 
whether  a  higher  upper  limit  than  that  used  hem  is  appropriate 


Figure  3  77K  hole  drift  velocity^#,  in-plane  electric  field. 
—  GaAs  simulation;  *,  b,  -f-,  InGaAs  simulations'; 

1  GaAs  experiment  [7];  •  •  ■  InGaAs  experiment  (4). 


Conclusions 

The  Monte  Carlo-k.p  model  described  above  represents  a 
valuable  tool  for  analysing  hole  transport  in  p-type  heterostruc¬ 
tures.  Our  simulations  predict  that  the  77K  phonon-limited 
drift  velocities  and  low  field  mobility  in  a  pseudomorphic  In¬ 
GaAs/GaAs  quantum  well  are  significantly  larger  than  in  a  lat¬ 
tice  matched  GaAs/AlAs  well.  Alloy  scattering  is  important  in 
reducing  the  mobility  in  the  InGaAs  system,  but  we  suspect  that 
other,  hitherto  unidentified  scattering  processes  arc  also  respon¬ 
sible  for  the  low  values  obtained  experimentally. 

We  wish  to  acknowledge  E.  P  O’Reilly,  W.  Batty  and 
A:  C.  G.  Wood  for  their  helpful  advice  and  the  provision  of  band- 
structure  data. 
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Semiconductor  Device  Modeling; 
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Introduction 

High  numerical  accuracy,  is  essential  to  achieve  conver¬ 
gence  and  reliable  results  -for  numerical  models,  of  ad¬ 
vanced  semiconductor  devices.  Physical  effects  such  as 
impact  ionization  and  local  carrier  heating  sensitively  de¬ 
pend  upon  the  carrier  current  density,-  which! can, be  of 
insufficient  numerical  accuracy  in  classical  numerical  mod¬ 
els.  While  these  accuracy  problems  can  limit  the  quality 
of  the  results,  possible  convergence  problems  are  of  even 
more  practical  significance. 

A  way  to  improved  accuracy  of  the  solution  is. the. use  of 
higher-order  methods.  Spectral  methods  offering  infinite- 
order  accuracy  0(c“N)  (N  is  the  number  of  degrees  of 
freedom)  are  particularly  attractive.  Recently  developed 
algorithms  demonstrated  the  applicability  of  a  spectral 
Fourier-series  discretization  to  strongly  nonlinear  semicon¬ 
ductor  device  problems  (1,  2).  Limitations  of  the  spectral 
approach  with  respect  to  possible  geometries  and  bound¬ 
ary  conditions  have  been  addressed  and  efficient  numerical 
algorithms  for  the  solution  of  the  spectral  equations  devel¬ 
oped. 


Localized  Sampling 

In  order  to  achieve  high  accuracy,  locally  fast  variation  of 
the  state  variables  has  to  be  sufficiently  resolved  by  the 
underlying  discretization  mesh.  Reasonable  solution  time 
can,  however,  only  be  maintained  by  local  refinement  of 
the  mesh.  A  general  approach  is  the  use  of  coordinate 
transformations  and  equidistant  mesh  spacing  in  the  new 
coordinate  system.  A  special  case  is  a  separable  (tensor- 
product  type)  coordinate  transformation,  in  2D: 


Figure  1  Spectral  element  discretization  based  on  Cheby- 
shev  polynomials  (left)  and  Fourier  (cosine)  series  (right) 


their  series  representations  to  exponential  convergence  re¬ 
quirements). 

To  alleviate  the  restrictions  imposed  by  the  rectangu¬ 
lar  tensor-product  mesh,  spectral  element  discretization 
is  under  consideration.  The  standard  method  is  to  use 
nonequidistant  Chebyshev  polynomial  (Fig.  1,  left).  The 
associated  matrices  are,  however,  not  optimal  for  a  nu¬ 
merical  solution.  The  space  domain  matrix  exhibits  an 
algebraic  singularity  at  the  boundaries  (3],  whereas  the 
spectral  domain  matrix  is  ill-conditioned  (4). 

An  approach  using  overlapping  high  order  elements 
using  cosine  basis  functions  and  coupling  the  elements 
through  source  terms  controlled  by  tHe  function  differ¬ 
ence  between  both  elements  (Fig.  1,  right)  appears  to  be 
promising.  The  associated  spatial  domain  matrices  possess 
advantageous  properties.  Preliminary  one-dimensional  re¬ 
sults  show  the  feasibility  of  this  approach. 


Spectral  accuracy  is  assured  by  using  infinitely  differ¬ 
entiable  mapping  functions  t}  6  Cw  (i.e.  by  subjecting 

•j>re*enl/jr  «f.  Technology  Modeling  Associate#,  Palo  Alto,  Cali¬ 
fornia,  USA 


Solving  the  Equations 

The  algorithm  used  for  the  solution  of  the  discrete  equa¬ 
tions  is  of  central  importance  for  the  cpu-time  and  memory 
requirements  of  the  method.  In  general,  direct  methods 
(Gaussian  elimination)  are  not  feasible,  since  the  system 
matrix  of  the  spectral  equations  is  not  sparse. 
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Figure  2:  Net  doping  concentration  in  the  JFET. 

.Iterative  techniques  such  as  incomplete- LU-decompo- 
sition  can  be  applied  to  the  spectral-domain  equations 
taking  advantage  of  their  diagonal  dominance  and  well- 
conditioning  (2).  Alternatively,  iterative  algorithms  based 
on  space-domain  preconditioning  of  the  spectral  equations 
by  a  finite-difference  operator  [5]  can  be  used.  This  hybrid 
spatially  preconditioned  spectral  solution  is  attractive  es^ 
pecially  for  complex  multidimensional  device  models. 

Application 

An  application  example  is  provided  by  a  p-channel  JFET, 
Lcatc  =  1.5pm.  A  drain-voltage  of  -0.1V  and  a  gate- 
voltage  of  +0.1V  is. applied.  Fig.  2. shows  the  doping 
profile (p-arca:  lxl0lscm_3,n-arca:  3xl015cm_3),  Fig.  3 
shows  the  space  charge  density  and  Fig.  4  the  hole  current 
density  (jmar  -  3.3A/cm”3).  Contacts  were  defined  in 
the  two  corners  of  the  device  (0,0)  and  (0,£,)  and  at 
(Lt,Ly/2)  (i.c.  source,  drain  and  gate  respectively)  using 
the  distributed  voltage-controlled  current  source  model  (2). 
The  simulation  was  performed  on  a  32  x  64  point  non- 
equidistant  grid  covering  an  area  of  2pm  X  4pm. 


F-'gure  4:  Hole  current  density. 


Summary 

Basic  functionality  of  the  method  has  been  demonstrated 
in  previous  work.  Current  efforts  address  localized  mesh¬ 
ing  via  high-order  element  patching,  effective  iterative  so¬ 
lution  of  the  spectral  equations  and  refinement  of  the  phys¬ 
ical  models  (energy  balance  equations)  taking  advantage  of 
the  method’s  high  numerical  accuracy.  Comparison  of  the 
method’s  properties  to  those  of  classical  low-order  space- 
domain  methods  is  being  carried  out. 
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GENERALISATIONS  OF  THE  BOX  METHOD  WITH 
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In  this  paper  w’ef present  the  box 'method •«  »  mixed  finite  element 
method  with  a  suitable  quadrature.  This  interpretation  provides' a 
way  of  defining  a  continuous  current  density  for  the  box  method.  Also, 
in  this  way,  higher  order  box  methods  can  be  derived  which  are  suit¬ 
able  for  application  to  the  discretisation  of  the  semiconductor  device 
problem. 


V.J  =  fl(u).  (1) 

J  =o7«.  ’  (2) 

on  the  region  ft  C  jR1,  with  suitable' boundary  conditions  Remark 
that  Poisson’s  equation  as  well  as  the  continuity  equat:ons  for  holes 
and  electrons  can  be  .written  in  the  above  form.. 

The  mixed*  wiationaf  formulation  of  the  problem  (I)*(2)  is;  find 
(u,  J)  €  L*(n)  X  H(div,  ft)  such  that 


1.  INTRODUCTION 


The  box  method, is  the  most  widely t used  discretisation  method  for 
the  semiconductor  device  problem.  The  main  reason  for  this  is  that  it 
yields  discrete  current  conservation  and  utilizes  ’upwinding’  by  using 
the  Scharfetter-Gummel  expressions  for  the  current  densities.  Another 
reason  for  its  use  is  that  solutions  of  the  resulting  disc'rete  systems  sat¬ 
isfy  the  maximum  principle  (positive  carrier  concentrations  !)  if  the 
underlying  triangular  mesh  is  of  Delauneyrtype.  iA  drawback  of  the 
method  is  that  it  does  not  yield  a  continuous  expression  for  the  electric 
field  and  the  current  densities,  only  components  along  the  edges  of  the 
meshes  are  given.  Because  of  the  latter,  one  could  be  tempted  to  think 
about  the  application  of  finite  element  methods  to  the  discretisation 
of  the  semiconductor  problem.  Unfortunately,  it  is  wellknown  that 
ordinary  finite  element  methods,  although  frequently  used  in  other 
disciplines,  do  not  possess  any  of  the  properties  listed  above  (although 
’upwinding*  can  be  achieved,' for  example*,  by  using  the  streamline  up¬ 
wind  methods  proposed  by  Hughes  and  Brooks,  cf.  (I)). 

Recently,  mixed  finite  element  methods  have  been  proposed  for  the  dis¬ 
cretisation  of  the  semiconductor  device  problem  (cf.  [2]),  These  meth¬ 
ods  differ  from  ordinary  finite  element  methods  in  several  respects  bat, 
most  importantly,  they  yield  Scharfetter-Gummel  type  expressions  for 
the  current  densities  in  a  natural  way  and  resulting  solutions  satisfy 
discrete  current  conservation.  Unfortunately,  the  mixed  finite  element 
methodj  proposed  have  been  shown  to  be  rather  unstable  (cf.  (3)). 
For  example*  the  solution  of  Poisson’s  equation  will  lead  to  un physical 
oscillations  in  the  electric  potential  even  for  triangulations  without 
any  obtuse  angle.  The  same  holds  for  the  solutions  of  the  continuity 
equations  whenever  a  non- zero  recombination  term  is  present.  Fur¬ 
thermore,  if  obtuse  angles  occur,  the  method  will  always  be  unstable, 
even  when  the  underlying  mesh  is  cf  Delauney-type.  Although  at¬ 
tempts  have  been  undertaken  to  remedy  this  situation  (cf.  (3]  where 
quadrature  rules'  are  suggested;  and  (4)  where  new  elements  are  intro¬ 
duced),  the  latter  problem  has  not  been  resolved  so  far. 

In  the  following  sections  we  present  a  class  of  finite  element 

methods  which  can  be  used  on  Delauney-type  tnau^ular  meshes.  The 
box  method  is  shown  to  be  equivalent  to  the  lowest-order  element  in 
class.  Because  of  this,  a  recipe  can  be  given  for  producing  a  contin¬ 
uous  expression  for  the  electric  field  and  the  current  densities  from  the 
solutions  obtained  with  the  box  discretisation.  Furthermore,  higher 
order  extensions  of  the  box  method  are  immediate  consequences  of 
this  new  class  of  mixed  finite  element  methods. 


E  SEMIC9JSDJI010R.PR0BLSM 


We  consider  the  following  system  of  equations. 


/  In  *  .  V^iJ(n)  (3) 

J  fcf'j'tSbB-rfftV-tto  Vr  £  H(d!vtn)  (4) 

n)  =  {.  e  (t*(n))5|v  • .  e  i!(n)} 


It  is  wellknown  that  the  problem  (3)-(4)  has  a  unique  solution  (u,’J)  e 
L5(ft)  x  H(dtv,Cl)t  which  is  the  weak  solution  of  (1)*(2). 

The  mixed  discretisation  now  consists  of  vhoosing  suitable  finite  di¬ 
mensional  subspaces  V\  c  L*(0)  and  W*  c  ff(div,ft)  and  to  restrict 
the  problem  (3)-(4)  to  these  subspaces;  find  (u*,  J*)  €  V*  x  W*  such 
that 

J  f  W-MO  =  {  fahR{uk)iO  v*  6  V,  (5), 

f  faa-ili,  ■  n, do  =  -  f  jf  u,V  ■  r hdO  Vr.  £  IV,  (6) 

The  remaining  problem  is  to  construct  suitable  subspaces  V*  and  W\. 
The  Brezzi-Babuska  conditions  (cf.  (5])  give  sufficient  (but  not  neces¬ 
sary)  conditions  on  these  subspaces  m  order  to  guarantee  uniqueness 
of  the  solution  of  (5)-(6).  For  triangular  meshes,  a  class  of  finite  el¬ 
ement  spaces  have  been  proposed  which  satisfy  these  conditions,  the  ■ 
so-called  Ranatt-Tfiomat  elements  (cf.  [6]).  The  latter  are  v  idely 
used  in  the  context  of  mixed  finite  element  discretisations. 


3  EQUIVALENCE  OF  THE  BOX  METHOD 

AHP_A,LQW,QRPBR  MIXED-EEM 


In  view  of  the  foregoing  we  now  assume  that  the  region  ft  has  been 
triangulated  and  that  the  mesh  is  of  Delauney-type  Because  of  the 
latter  assumption  we  can  associate,  with  each  mesh  point,  a  so-called 
box  which  is  constructed  by  intersecting  the  midperpendiculars  of  the 
edges  of  the  triangles  These  boxes  are  sometimes  termed  the  Voronot 
polygon.  Using  these1  we  construct  a  new  triarigulation  IT*  of  the 
domain  0.  divide  each  of  the.Voronoi-polygons  into  triangles  m  such 
a  way  that  the  vertices  of  these  triangles  are  the  centre  of  the  polygon 
and  two  neighbouring  extremal  points  on  the  boundary  of  the  polygon 
Thus,  two  of  the  three  vertices  of  these  new  triangles  are  points  of 
intersection  of  midperpendiculars. 

We  now  consider  the  mixed  finite  element  method  which  makes  use  of 
the  lowest-order  RT-elements  for  the  approximation  of  the  fields  and 
current  densities  on  the  triangulation  Vl\3  and  of  the  piecewise  constant 
approximations  of  the  potentials  on  a  box  In  order  to  analyse  this 
method,  we  introduce  the  following  notation.  For  each  mesh  point 
(*<»  V») we  *et  be  the  box  around  it  and  T*,  k  =  l,...,n<  the  triangles 
of  0*  which  are  contained  in  Bt  Thus,  we  have  that  II*  =  U< uJ^j  T*k 
We  define 
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Vk  =  {«j.  a  coast  «at  co  each  box} 

Thb  space  b  spanned  by  tie  basb  fiactfcos  wfckh  are  1  oo  B,  and 
0  cm  «H  other  boxes.  Remark  that  these  febsriocs  are  not  coctbmoos 
orer  tbe  box  edges.  Furthermore,  we  let  W*  be  the  space  of  fewest- 
order  Raviart-Thomas  ekmesU  oa  tbe  triangles  of  II*,  Le.  W*  b  tie 
space  spanned  by  tbe  rector  basis  fonctioes  iy,  which  hare  a  constant 
normal  compooeut  equal  to  1  afeegedge  j  of  *  triangle  in  IT*  and  sero 
normal  compooent  along  each  of  tie  other  edges.  Thus,  the  support 
of  Tj  extends  crrez  two  triangles.  On  each  of  these  triangles,  r,  b  of  the 
form 

T  [x  y)  =  —=!i —  [  x"  S*V)  } 

J*  ,y'  2arca[T)  ^  jr-fcffl  ) 

in  which  (ar§yjf  y,^)  is  the  vertex  of  the  triangle  T  opposite  the  edge  jf 
and  lj  tbe  length  of  edge  j. 

We  will  now  show  that  the  box  method  b  equivalent  to  this  mixed 
finite  element  method  when  a  suitable  quadrature  b  chosen.  To  this 
end,  set  up  the  mixed  FEM  equation  of  the  form  (5)  for  the  box  Bf. 


Using  Green’s  theorem,  the  fact  that  u*  =  u,  on  B(,  and  the  property 
of  the  lowest-order  RT-elements  that  their  normal  components  along 
an  arbitrary  line  are  constant,  we  obtain: 

ill 

where  1*  is  the  length  of  the  edge  of  T*  which  coincides  with  the  outer 
edge  of  the  box  B,  and  J*''"4  b  the  normal  component  of  J*  along 
that  edge. 

Next  we  use  equation  (6)  for  the  t}  which  correspond  to  edges  which 
coincide  with  th*  outer  edges  of  B,.  For  each  triangle T*  €  B,  there  b 
(except  at  the  boundary)  another  triangle  T?  €  B.-  which,  together, 
form  the  support  of  t}.  Then  we  bare: 

The  latter  integral  can  be  shown  to  be  equal  to 

fi  (.*-*») 

3  A.V.--V.  ) 

Since  T*  and  are  congruent,  it  then  follows  that 

The  treatment  of  the  right  hand  side  of  (6)  b  slightly  more  compli¬ 
cated.  Since  the  Tj  have  constant  divergence  on  each  triangle,  we  may 
replace  the  right  hand  side  of  the  weak  formulation: 

where  i*  and  are  the  centres  of  gravity  of  T*  and  T*f,  respectively. 

Using  interpol'  "ion  we  obtain: 

/  bw uV  ■ ,,d0  ~  ‘i  (“i1* R)  ■  w» 

Thus,  the  right  hand  side  of  (6)  can  be  approximated  by 
s 

Combining  the  obtained  approximations,  we  finally  get. 

jkjid  _  ty  —  ... 

‘ 


Equation  (8)  and  (9)  yidd  the  bcx  scheme  dbcrecbarioo  I  Thus, 
we  hare  shown  that  the  latter  can  be  obCamrJ  by  apprcxhnatrtg  the 
integral:  sa  the  lowest  order  ETortbod  described  above. 


The  equivalence  of  tbe  box  method  and  k  low-oedex  mixed  Shite  el- 
emeat  metLsd  opens  powhEto  for  obtaining  cxpcesriocs  for  the 
fields  and  current  densities  inside  the  eJeoents  (remember  that  the 
box  method  only  prorides  coccpooeats  of  these  oa  the  edges).  We  will 
describe,  two  ways  of  doing  this. 

We  can  use  equations  (5)  and  (6)  for  the  test  function?  which  bare  not 
yet  teen  used  in  the  above.  More  spedScally:  for  each  box  B„  we  can 
set  up  a  system  of  n*  equations  for  the  n,-  remaining  unknowns,  namely 
the  norma]  compooents  of  J*  along'  the  inner  edges  of  the  box.  In  tbe 
resulting  set  of  equations,  tbe  values  of  the  components  of  Jvafeng  the 
outer  edges  of  B%  occur,  for  winch  we  can  substitute  expression  (9), 
as  well  as  the  values  u,  which  Have  already  been  determined.  Remark 
that  these  calculations  can  all  be  doe  locally,  be.  thb  can  be  consid¬ 
ered  as  a  postprocessing  exerrise.  Having  obtained  the  values  for  the 
m  remaining  unknowns  for  the  box  B,,  we  can  give  an  expression  for. 
J*  inside  the  box:  thus  we  have  produced  an  3(div-,Q}- function  (with 
continuous  normal  components  over  the  edges  of  the  triangles  7f). 
Thb  b  important  for  adaptive  runs,  or  for  applications  in  which  tbe 
fields  and/or  tbe  current  densities  are  coefficients  in  another  equation 
(e.g.  the  temperature  equation,  or  the  hydrodynamic  equations).  In 
the  lecture  we  will  give  an  example  of  this. 

Tbe  method  just  described  for  obtaining  the  fields  and  current  densi¬ 
ties  inside  the  box  does  guarantee  current  conservation  on  the  boxes, 
but  not  on  the  triangles  of  IT*.  The  latter  can  be  achieved  by  con¬ 
sidering  a  mixed  finite  element  method  on  each  box  Blt  in  which  we 
now  introduce  piecewise  constant  potentials  on  the  triangles  of  IT*. 
This,  there  are  2n,  remaining  unknowns  per  box.  In  order  to  have 
a  well-determined  system  of  equations  for  these,  we  impose  the  extra 
restriction  that  the  average  of  the  newly  introduced  u*  (value  of  po¬ 
tential  on  T*)  b  equal  to  the  value  u,  calculated  in  the  centre  of  the 
box: 

area[B,) 

Thb  second  way  of  interpolating  the  results  obtained  by  the  box 
method  does  guarantee  current  conservation  on  the  triangles  of  II*, 
and  yields  another  approximation  for  the  potentials. 
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Abstract  -  Solring  the  system  the  stationary  semi¬ 
conductor,  device,  eqnations  in.  three-  dimensions  one  is 
strongly  interested  to  rbiuce  the  complexity  of  the  pro-: 
blem.  An  approximate  decoupling  of  the  electrostatic  po¬ 
tential  and  the  carrier  densities,  motivated  by  singular, 
perturbation  theory,  was  introduced  in  (ij-  This  reduces 
the  problem  to  a  scalar  one  and  a  two  times  two  system. 
In  many  (majority  carrier)  situations  the  changes  in  the 
minorities  are  mostly  determined  by  the  changes  cf  the 
majorities  -  so  that  the  problem  is  reduced  to  scalar'pro- 
blems.  "Typical  convergence  rates  are  0.2, .. .,0.02  i u  vases 
where  Gummd’s  method  shows  slow  convergence  (MOS- 
FET'with  current  but  not  .very  strong  recombination). 

I.  APPROXIMATION  FOR  VAN  ROOSBROECK’S 
EQUATIONS 

The  simplest  version  of  van  Roosbrocck’s  equations  still 
fitting  our  purposes  will  be  used  -  with  a  convenient  sca¬ 
ling  one  arrives  at  the  following  system  of  Poisson’s  equa¬ 
tion  and  continuity  equations  for  electrons  and  holes: 

— Au=  /-n  +  p, 

-V  -  Jn  =  —R,  Jn  =  Vn  —  nVu, 

V  •  Jp  =  —R1  Jp  =  — Vp  —  pVu  in  G ,  (1) 

u  =  u°,  7i  ==  7i°,  p  =  p°  on  r0, 

a(u-ul)  =  i/- J,'  =  0,*  =  n,p  on  Tj. 

Here:  G  C  Rdt  1  <  d  <  3,  is  a  Lipschitzian  domain 

with  boundary  r  =  IoUTi,  To,  Ti  arc  disjoint,  To  is 
closed  and  has  a  positive  surface  measure,  v  is  the  outer 
unit  normal  at  any  point  of  T;  the  unknown  functions 
u,n,p  represent  the  electrostatic  potential,  the  densities 
of  electrons  and  holes,  respectively;  Jn  and  Jp  denote 
the  electron  and  hole  current  density;  R  the  recombina¬ 
tion,  generation  ratio;  /  is  a  given  density  of  impurities; 
u°,  n°,  p°  represent  boundary  values  at  ohmic  (Dirich- 
let)  contacts,  u1  ond  a  arc  given  functions  modelling 
gate  contacts.  Einstein  relations,  relating  mobilities  and 
diffusion  coefficients  by  a  constant  factor,  arc  assumed, 
too.  bet 

Au  =  f,  +p-n,  (/i,ft):=(/,/i)  +  j au'hdVi, 

B(u)n  =  C(u)p  =  -R  (2) 

be  the  discrete  .version  of  (1)  with  h  as  test  function  of 
the  discretization  Let  ( U ,  N,  P)  be  a  given  approximate 


solution  of  (2)  and  new  variables  x,  y,  z  be  introduced  .via 
uzzU  +  x,  »  =  Af(l  +  x  —  y)t  p  =  P(lHKr  —  x),  so 

A»  =  /j  -fp-n, 

B(U) n  +  #(*)(«- *0  = 

-  R(NiP)-RnN(x-y)-RrP{z-x)t 

C(U)p  +  Cm{P)(u-V)  = 

-  R{N,  P)  -  RnN(x  -y)-  RpP(z  -  z) 

is  the  corresponding  Newton  linearized  system.  Motivated 
by  [2]  and  the  connected  convergence  problems  in  (lj  an 
algorithm  was  suggested  that  should  be  understood  as 
preconditioner  decoupling  the  electrostatic  potential  and 
the  carrier  densities  (with  (B'„  —  B’u  —  RpPt  B  =  B  + 
R»,  C'u  =  C'u  +  RnN,  C  ~C  +  Rp,  £  =  P1,  £=  Nl): 

(A  +  P  +  N)z-Ny-Pz  =  fy*P-!L-AV 

=  Ft+z-E3 

B'mx  +  BNx -  BNy -f  RpPz  =  -BN.~  R(tf, P), 

C'x  -  CPx  +  CPz  -  Rn Ny  =  -CR  +  R(N,  P). 

Choosing  A"1  as  an  approximate  inverse  of  A  +  N  +  P 
one  has  the  following  iteration  scheme  : 

(B'uA“l  +  B{NA~l  -  E))Nxf+l  + 

((D'.  +  II N)  A-'  +  Rr)Pz‘*'  =  (3) 

-DM.  -  R{N,  P )  -  (K  +  Bl V)/, , 

((Cl-CP)A-'-R„)Dy^'A- 

{C'.A-1  +  C(B  -  PA-'flPz**1  = 

-CE.+  R(N,P)  —  (Cl  —CP)fi,  (4) 

(A  +  P  +  N)  ii+l  = 

(i?i+£-Z£+W+l +  i>2i+1),  i°  =  o.  (5) 

A  is  chosen  as  A  =  a  diag(A)  -f  N+P,  a  damping  param¬ 
eter.  o  =  0  corresponds  to  singular  perturbation  theory, 
for  o  >  1  holds  <  (A  +  N  +  PjJJ1  and  a  -*  oo  is 
a  modified  linearized  Gummel  scheme.  So  in  some  neigh¬ 
bourhood  of  the  thermodynamical  equilibrium  one  has  an 
’interpolation’  between  two  convergent  (in  the  continuous 
case)  methods,  a  is  used  to  approximate  a  characteristic 
eigenvalue  (a  )  of  A  in  those  (small)  parts  of  the  domain 
with  n  -f  p  <  |/i  4*  p  -  n|  -  or  with  other  words  in  parts 
where  the  singular  perturbed  ansatz  is  violated  and  n  +  p 
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•  Figurcl:  Drain  and  bulk'currenV^iifravalanche  genera- 

.  tion,  Iq  =  l.e  —  8A,  Ubnlk  —  Uto%r  “‘QVtUgate  =  2Vt 

t  /i  =  /i(n,p) 

| 

1  should  not  be  compared  with  0  instead  of  a.  (5)  gua- 

)  rantees  the  same  smoothness  for  x  as  for  u  and  should 

.  not  be  simplified, 

j  II.  NUMERICAL  RESULTS 

{  The  system  (3)  to  (5)  has  been  implemented  in  ToSCA  (2d 
I  device  simulation  program  of  the  Karl-WeierstraB-Institut 

j  fur  Malhematik)  in  the  following  manner:  The  remaining 

|  two  times  two  system  was  solved  by  simple  block  forward 
backward  substitution  and  one  iteration  starting  with  the 
majority  carrier  density  and  unmodified  diagonal  blocks 
B'uA-l+B(NA“x-E)t  C'MA-l+C(E-PA‘l):  In  cases 
without  recombination  and' negative  /  positive  definite 
K  /  (valid  for  special  discretizations)  this  blocks  arc 
regular.  Not  any  density  or  field  dependence  of  the  mobi¬ 
lities  was  included  in  the  Jacobian  matrices  and  the  tests 
have  been  performed  for  different  MOSFETs.  Results  for 
an  avalanche  breakdown  are  shown  here.  Figure  1  showes 
the  current  voltage  characteristics.  Figures  2,3  give  an 
impression  of  the  influence  of  the  damping  parameter  a 
at  various  bias  conditions.  Figure  4  illustrates  the  weak 
recombination  coupling  even  at  the  highest  ii\jcction  levels 
!  and  supports  the  diagonalization  of  the  , possibly  two.ite- 
'  ration  processes.  At  lower  current  levels  one  iteration  for 
'  solving  (3)  to  (5)  is  optimal,  in  the  avalanche  region  the 

(  minority  carrier  density  must  be  calculated  correctly  and 

j  the  singular  perturbed  ansatz  is  violated.  Together  with 

!  convergence  acceleration  three  or  four  iterations  should  be 

,  sufficient  to  reach  the  error  reduction  defined  by  the  outer 
'  approximate  Newton  process. 
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Figure2:  Convergence. ratio  IgA,A  =  max(—^U')  of 
the  ele.  potential,  Uo  =  10 ~*UT,  Ub*ik  =  Uto*T;  =  « 

if  drain  =  0.86, 1.9,3.3V’,  Ugtttc  =  2  V,ti  =  /t(»,p),  damp¬ 
ing  parameter  a  =  0.05 


Figurc3:  Convergence  ratio  IgA  of  the  ele.  po¬ 
tential,  Uo  =  10  ~*UT,  Ifyulfc  =  Utour  —  0V, 
U drain  =  0.86, 1.9, 3.3^,  Ugau  =  2V,  p  =  p(n,p ),  damp¬ 
ing  parameter  a  =  100 


Figuic4:  Convergence  ratios:  inner  iteration  process  for 
holes  and  eletrons  (rcc.-it, holes)  and  outer  one  for  the 
ele,  potential  (damp=0.5)),  I/tuik  —  Uiovr  = 

Uir. h.  =  0.86, 1.9,3.3V,  Usate  =  2V,  /*  =  /i(n,p),  dam¬ 
ping  parameter  a  —  0.5 
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Abstract 

Our  device  simulator  MINIMOS  has  been  .used  for  the 
numerical  analysis  of  three-dimensionalnon-planar  sil¬ 
icon  MOSFET  and  GaAs  MESFET  structures.  Here 
we  present  an  extension  of  the  program  for  the  simu¬ 
lation  of  transient  effects.  This  version  of  MINIMOS 
has  further  been  enhanced  by  a  new,  highly  accurate 
current  integration  method. 

The  computational  complexity  of  three-dimensional 
transient  simulations  is  tackled  by  preconditioned  it¬ 
erative-methods.  We  present  efficient  algorithms  and 
their  implementation  for  the  solution  of  the  large  lin¬ 
ear  systems  of  equations  on  vector  and  parallel  com¬ 
puters. 


1  Transient  Simulation 

Three-dimensional  transient  simulation  of  MOSFET 
structures  is  necessary  to  analyse  both  the  influence  of 
time  dependent  physical  quantities  such  as  the  recom¬ 
bination  rate,  and  .three-dimensionalinon-planar.  ge¬ 
ometries  such  as  the  field-oxide  transition.  The  current 
continuity  equations  are  discretized  in  space  by  the 
Scharfetter-Gummel  method  and  in  time  by  the  fully 
implicit  (backward  Euler)  method.  Time-step  control 
is  based  on  the  functional 

I  =  J  ((n,+1  -  »*)  ln-j-p-  +  (pi+1  -  p')  ln^- 

ft 

+  5(gtad(tii+1 

The  time-step  tj+i  is  chosen  such  that 

2±i  =  6-1 

Ti 

remains  bounded.  For  the  solution  of  the  device 
equations  the  decoupled  (Gummel)  algorithm  is  used. 
The  convergence  of  Gummel’s  algorithm  in  the  linear 
regime  is. accelerated  effectively  by  least  squares  ex¬ 
trapolation  for  the.  update  of  the  electric  potential 


Transient  simulation  is  an  important  method  for  the 
analysis  of  physical  effects, such,  as  e.g.the  kinetjc  of 
deep  traps  in  the,  semi-insulating  substrates  of  GaAs 
MESFETs  [3){4],  and  the  simulation  of  the  charge¬ 
pumping  experiment  (interface  trap  kinetic).  The  deep 
trap  model  in  GaAs  for  the  donor  trap  rate-equation 
is  given  by 


8  (Nt  -  N+) 

at 


—  Rn  ““  Rp 


in  which  the  effective  recombination  rates  R„lP  are 


•R*  =  C„N}n-en(NT-Nf) 
4  =  Cp  (Nt  -  N}fp  -  epN} 


In  these  equations  Nt  denotes  the  total  and  N£  the 
density  of  the  electrically  active  deep  donors.  C„,p  are 
capture  coefficients,  and  c„lP  are  the  emission  rates  (3). 
An  equivalent  formulation  holds  for  the  acceptor  traps. 
For  the  simulation  of  the.  charge-pumping  experiment 
in  silicon  MOSFETs-we  use  a  model  for  the  interface 
trap  kinetic  given  in. (2).  Assuming  the. acceptor  type 
of  the  interface  traps  with  density  Dj,  the  time  de¬ 
pendent  charge-pumping  current  is  obtained  by  inte¬ 
gration  with  respect  to  the  gate-oxide  surface  and  the 
time  (period  length  To).  The  falling  pulse  slope  is  as¬ 
sumed  to  start  at  t  =  t1: 


t’+To 

ICP  ^  =  %CVth  /  /  NT(TMT)drdr 

*'  To 


where  Nt  (t)  is  the  non-equilibrium  part  of  the  trapped 
charge  density,  obtained  by  integration  of  the  rate 
equation 


Nt  (t)  = 
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The  time  interval  5  ( t ,  E)  is  determined  for  a  given  E 
by  the  condition  [Ep  (t  -  6)  -  E  =  0.  -All  parameters 
are  assumed  as  spatial  variables  albng  the  channel  sur; 
face.  Tn  (E)  denotes.the  trap  lifetimes  which  depend 
on  the  energy  in'.the  well-knowri.way  (6j. 

A  selfco'nsistent  transient  solution  of  these  .equations 
enables  the  simulation  of  the  charge-pumping  experi¬ 
ment.  This  facilitates  aproper'design  of  .this  .experi¬ 
ment  and  the  extraction  of  the  spatial  distribution  and 
energy. density  of  the  traps  created  by  a  hot  carrier  in¬ 
jection. 

2  Cwrentvlntegfation' 

After  i  solution' at  some  timestep  has  been-  found  a 
critical  step  is  the  terminal  current  integration.  We 
have  implemented  a  new  method  which  is  based  on 
choosing  weight  functions.ui!  foreach  terminal  15  and 
evaluating  a  volume  integral  instead  of  a  surface  inte¬ 
gral:  E.g.  for  the  electron'current  on  the  terminal  T, 
we  compute 

r„  =  J[graduil  ■  J„  -  <fl)dn 
n 

In  this  formula  J„- denotes tthe  electron  current  density 
and  R  the  recombination  rate.  The  functions  tnj,  are 
smooth' functions  on  fl.  They  have  to  suffice,  homoge¬ 
nous  Dirichlet  boundary  conditions  at  all  terminals 
Tj  yt  Ti,  riori-homogenous  constant  Dirichlet  boundary 
conditions  at  the, terminal  Ti  and  homogenous  Neu¬ 
mann  boundary  conditions  elsewhere.  To  obtain  op¬ 
timal  weight  function  for  e.g.  the  electron  current  on 
terminal  Ti  we  minimize  the  functional 

¥, .  =  /  [^(gradujj,)3  -  dfl 
n 

This  choice  is  motivated  by  the  experimental  observa¬ 
tion  that  a  high  degree  of  accuracy  for  terminal  cur¬ 
rents  is  achievable,  if  the  gradients  of  the  weight  func¬ 
tions  are  minimized  in  highly  doped  regions  of  the  de¬ 
vice  [5].  The  variation  of  the  functional  above  leads  to 
the  elliptic  partial  differential  equation 

div  (ngradtu^)  =  R 

which  is  discretized  by  the  Scharfetter-Gummel 
method  and  solved  by  the  standard  preconditioned 
conjugate  gradient  method.  For  the  deviation  currents 
the  Laplace  equation  is  solved  on  0. 

3  Implementations 

The  backward  Euler  time  discretization  in  general  in¬ 
creases  the  diagonal  dominance  of  the  linearized  dis¬ 


crete  carrier  continuity  equations,  thus  making  pre¬ 
conditioned  iterative  methods  converge  quickly.  Apart 
from  the  classical  conjugate  gradient  algorithm  (CG), 
which  is  used  to  solve  the  discrete  Poisson  and  weight 
function  equation,  we  use  the  conjugate  gradient 
squared-method  (CGS)  for  the  carrier  continuity  equa¬ 
tions. 

The.convergence  rate  on  onehand  and  the  efficiency  of 
the  implementation  on  parallel  computers  on  the  other 
is  determined  to  a  large  extent  by.  the  applied  precon¬ 
ditioner.  Incomplete  LU  factorizations  have  proven 
to  be  a  neatly  optimal  choice  on*  vector  computers. 
We  have  carried  outiyarious  implementations  oh  vec¬ 
tor  supercomputers  such  as  the  Cray-2  and  the  Fujitsu 
VP200  resulting  in  execution  speeds  of  more  than  100 
megaflops  for  the  critical  triangular  solves  of  the  ILU 
preconditioner.  This  is  achieved  by-the  hyperplane¬ 
reordering,  technique  using  list-vectors.  We  have,  in¬ 
vestigated  also  several  massively  parallelizable  precon- 
ditioners,  namely  truncated  Neumann  series'and  mul¬ 
ticolor,  incomplete  factorization  preconditioners.  Ex¬ 
periments  performed  on  a  massively  parallel  architec¬ 
ture,  the  Connection  Machine  (1),  indicate  that  an  in¬ 
complete  factorization  of  the  reduced  system- matrix 
performs  best. 
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Abstract' -  Numerical-methods'  for  '3D  steady- 
state  <and  transient  .simulation  of  semiconducr 
tor  devices  are  considered;, .These  methods,. are 
realized  in  the  devices  simulator" "ALPHA-3" > 
which  is  the -extension  of  our -previous  2D  si¬ 
mulator  "ALPHA”..' 

.1/ INTRODUCTION  ' 

The  development  of  .future,  semiconductor  de¬ 
vices  with  oubmicroh  feature' size ^requires 
the  use  of  mathematical  modeling  fortheana- 
lysis  .of  charge  transport* in  these,  devices 
and  the  estimation  of  their  electrical  cha¬ 
racteristics.  The  behaviour  of  a  semiconduc¬ 
tor  device  Is  described  by  a* nonlinear  system 
of  partial  differential  equations  (drift-dif¬ 
fusion  model). In  general- case  these  equations 
cannot  be  solved  analytically* without  very- 
hard-  simplified  assumptions  such  as  one-di¬ 
mensional  approach,  "constant'  mobilities, 
charge  neutrality,  trivial  doping  profiles 
and  so  on.  Since  for  modern  devices  these  as¬ 
sumptions  are  not  valid,  therefore  numerical 
technique  i3  the  only  tool  for  obtaining  the 
solution. 

Effective  numerical  methods  and  related 
software  for  ID  and  2D  simulation  are* deve¬ 
loped  in  the  present  time.  On  the  other  hand 
for  small  devices  3D  effects  may  be  quite  es¬ 
sential  as  first' results  of  3D  analysis- show 
[1-6] .  In  the  present  paper*  our  experience  in 
the  fiold  of  3D  simulation  is  given.  The  nu¬ 
merical  methods  for  steady-state  and  transi¬ 
ent  analysus  realised  In  the  silicon  device 
simulator  "ALPHA-3"  ore  described. 

II.  MATHEMATICAL  MODEL 

The  physical  behaviour  of  a  semiconductor 
device  Is  described  by  the  following  drift- 


diffusion  model: 

div3n  =  (p.n)+.-~ ,  (D 

dtv 3p  » -R(p,n)  —  »  (2) 

&p  =  n-p-N,  N=NJ-Na  ,  (3> 

3„  =fn  <vn-nvi/>-nv?nnie),  (4) 

dp  =-J<p(vp+pv?-pvZnnie),  (5) 

h  — «> 

3  =3n+3p+3i  (7) 


The  system  .is  written  in  normalized*  form 
and  symbols  have  their  standard,  meaning.,  The 
expressions .  for  mobilities  /<„  and  J<p 
according  to  Caugh'eyrThomas  model  are  “used. 
The  recombination  rate  R  includes  three 
mechanisms:  Shockley-Read -Hall  process.  Au¬ 
ger  process  and  the  cumulative  multiplicati¬ 
on  of  free  carriers  under  strong .electric 
field.  The  effective -intrinsic  concentration 
ntp  depending  on  the  bandgap  narrowing , is 
calculated  according  to  the  -formula'of  Slot- 
.boom  or  may  be  obtained  from  the  solution  of 
the  special  equilibrium  problem  [7j  i 

III.  NUMERICAL  METHODS  AND 
PROGRAM-  IMPLEMENTATION 

The  numerical  solution  of' drift-diffusion 
equations  isr* carried- out*  by  a- finite,  diffe¬ 
rence  method.  For  electron  and  hole  current 
components  ‘Scharfetter-Gummcl  approximations 
[8]  and  the  standard  seven-point  approxima¬ 
tion  of  Laplace  operator  arc  usod.  Steady- 
state  problems' are  solved  by  decoupled  Gum- 
mel’s  method  [9j  ,  in  which  linearized  Pois¬ 
son  equation  and  continuity  equations  are 
iterated -successively  (external  iterative 
, process)  until  the  self-consistent  solution 
is  obtained.  Each-Gummel!s  iteration  requi¬ 
res  the  solution  of  three  linear  systems  of 
elliptical  difference  equations. , For  this 
.purpose  iterative  methods  L10,  lljare  used 
(internal  iterative  process). 

Absolutely  stable  half-implicit  scheme  fl2] 
for  the  analysis  of  transient  processes  is 
used.  In  this  scheme  the  determination  of 
the  potential  of  electric  field  is  carried 
out  in  two  stages.  Firstly  the  total  current 
continuity  equation,  which  is  the  consequen¬ 
ce  of  (1)-(7),  is  solved  and  the  predictor 
for  the  potential Jnobtained,  The  final  value 
of  the  potential  is  determined  from  the  sta¬ 
bilized  Poisson  equation.  The  automatical 
time  step  selection  with  local  error  control 
is  realized. 

Above  mentioned  methods  are  implemented  in 
the  device  simulator  "ALPHA-3">  which  is  the 
extension  of  our  2D  simulator  "ALPHA"  f7j* 

CPU  time  for  one  Gummel  iteration  on  the 
grid  with  28*28x18  knots  is  approximately  3 
min  on  the  soviet  mainframe  ES-1060' (Imips). 
Depending  on  the  injection  level  from  5  to  30 
Gumel  iterations  are  reguired  for  the  calcu¬ 
lation  of  one  point  of  I-V  curve.  CPU  time 
for  one  time  stop  in  the  transient  analysis 
is  also  3  nin  and  200-300  steps  are  suffi¬ 
cient*  for  the  *  simulation  ofptransient  pro¬ 
cess  with  the  tolerance  10  c  « 
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COMPUTER  AIDED  ANALYSIS  OF  METASTABLE  DECAY 
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Abstract:  The  computer  aided  mathematical  treatments  of  two 
problems  connected  with  the  metastable  decay  of  ionized  Van 
der  Waals  (vdW)  dusters  will  be  presented  here.  One  is  the 
mathematical  treatment  and  analysis  of  sequential  decay  senes 
in  nitrogen  clusters  in  order  to  allow  the  meaningful 
determination  of  decay  rates.  Secondly,  we  developed  a 
computer  program  to  facilitate  the  correct  interpretation  of 
metastable  duster  ion  spectra  allowing  to  exclude  after 
identification  all  possible  colnddcnces. 

Introduction:  Small  atomic  and  molecular  aggregates  (dusters) 
are  of  interest  both  from  the  point  of  view  of  basic  and  applied 
sdence.  Clusters  are  seen  as  a  link  between  the  gas  phase  and 
the  condensed  state.  Van  der  Waals  and  especially  rare  gas 
dusters  arc  rather  simple  systems  and  therefore  also  accessible 
to  theoretical  treatments.  Careful  observation  and 
(mathematical)  analysis  of  metastable  decay  reactions  of 
Ionized  vdW  dusters  allows  to  draw  conclusions  about 
stability,  structure  and  internal  energy  transfer  in  these 
systems. 

Experimental  Method:  Neutral  clusters  are  produced  by 
expanding  pure  or  seeded  gas  through  a  small  nozzle  Into  a 
vacuum  chamber.  The  ensuing  supersonic  beam  passes  a 
skimmer  and  Is  crossed  10  cm  downstream  at  right  angles  by  an 
electron  beam  of  variable  energy.  Ions  thereby  produced  are 
extracted  at  right  angles  to  the  plane  of  the  neutral  cluster  beam 
and  electron  beam,  accelerated  to  a  typical  energy  of  3  keV  and 
mass  selected  In  a  high  resolution  double  focussing  mass 
spectrometer  system, 

Important  for  the  present  studies  of  metastable  dissociations  is 
the  existence  of  two  field  free  regions.  The  first  one  between  the 
end  of  the  acceleration  and  the  entrance  slit  of  the  magnetic 
field  and  the  second  field  free  region  between  the  two  sector 
fields,  A  possible  metastable  decay  of  an  Son  (mass  m^xharge 
qj)  into  (m2^2)  be  detected  either  in  the  first  field  free 
region  bv  tuning  the  magnetic  sector  field  to  a  nominal  mass 
m  «(n>2 '<te^)/(n>i/qi)  an£l  electnc  sector  field  to  a 
voltage  E*»E(m2/q2)/(mj/q|)  (with  E  the  voltage  ot  the 
electnc  analyzer  for  direct  mass  spectra)  or  In  the  second  field 
free  region  by  tuning  the  magnetic  field  to  mj/Qj  and  the 
electnc  sector  field  to  E*«E(m2/q2)/(nVqi),  In  the  present 
study  however,  we  used  an  additional  technique  to  study  the 
occurrence  of  successive  metastable  decay  series.  In  this 
alternative  mode  of  operation  it  is  possible  to  detect  a  decay  of 
(mj.qi)  into  (m^q^  In  the  first  field  free  region  followed  by  a 
sequential  metastable  decay  of  this  (m2,q2)  into  (m^qj)  in  the 
second  field  free  region  by  tuning  the  magnetic  sector  field  to  a 
nominal  mass  m*  •  (n^^V^l^l)  ^  the  electric  sector 
field  to  an  electric  field  E*«E(m3/q3)/(mj/qj). 


Results:  Magnera  et  al  /I /s recently  reported  that  N 2-cluster  ions 
do  not  only'  lose  one  monomer,' but  depending  on  the  cluster 
size  they  observed  well  structured  decay  patterns  For  example 
(N2)45+*  cluster  ions  are  loosing  with'  high  probability  2,7,12,17 
and  22  monomers,  .They  could' not  determine  whether  these 
monomers  are  lost  m  one  step  or  sequentially.  With 'the  above 
mentioned  technique  we  were  able  to  demonstrate,  that  all  of 
‘  these  decays  are  sequential  /2/. 

The  next  step  was  to  develope.a  program  which  allows  to 
evaluate  from  the  experimental  data  apparent  decay  rates  The 
following ,  set  of  coupled  dlfferental 1  rate  •  equations  are 
describing  these  decays: 

dN0(t)/dt-.koN0(t) 

dNi(t)/dt.koNo(t).kjNi(t) 

dNj(t)/dt  ■  Iq.iNuW  •  JqN\(t) 

The  solution  of  these  equations  is  very  simple  leading  to 
exponential  dependencies,  but  the  actual  calculation  of  the 
decay  rates  is  only  ■  possible  with  a  numerical  method  For 
example  for  the  (N2)s+*  the  following  decay  channels  can  be 
measured; 

5  mothcrsignal 
5- 1- >4,  5  *2*>  4, 

5  -1  ->  3,  5  *2.>  3,  5  •!■>  4  -2->  3 

5  -1>  2,  5  *2»>  ^  5  *!•>  4  -2->  2,  5  •!•>  3  -2*>  2 

5  -l->  1,  5  *2*>  2,  5  *!•>  4 .2->  1,  5  -1>  3  -2->  l.  5  -l->  2  -2*>  l 

(where  5  •!->  4  »Unds  (or  *  dtay  o(  (N^j*  Into  (Nj)|*  in  (hr  first  field  frre 
regtoev  and  5  -»•>  4  -2>>  2  SUM*  far  «  sequrntUl  decay  where  (N3>j+  drays  to 
(N2)4+  in  the  first  field  (tee  region  fcflowvd  by  a  dray  o(  this  Into 
In  lh<  second  field  free  region). 

In  general  for  a  n-mer  */2n(n+l)  experimental  data  are 
therefore  available  for  the  determination  of  n  variables.  Every 
measured  decay  channel  is  leading  to  one  exponential  equation, 
ie,  in  the  5-mer  case  leading  to  15  equations  for  only  5 
variables.  Nevertheless,  using  a  combination  of  a  Gaussian  with 
a  Monte  Carlo  method  it  was  possible  to  obtain  decay  rates, 
which  describe  the  experimental  data  very  well,  i  using  the 
derived  decay  rates  it  was  possible  to  calculate  the  ion  signals 
for  every  decay  channel  in  good  agreement  with  the 
experimental  result.  Table  1  shows  the  results  in  case  of  (^3* 

Another  problem  in  dusterphysics  is  the  occurrence  of 
coincidences.  In  direct  mass  spectra  of  analytical  chemistry 
coincidences  are  a  well  known  problem  In  particular  when  a 
sample  consists  of  many  different  hydrocarbons,  the  mass 
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spectrometric  fragmentation  patterns  of  the  orgamcal  molecules 
are  very  much,  overlapping '/3/.  The  situation -becomes  even 
worse  if  one  is  using  clusters.  Table  2  shows  as  ah  example 
some,  of  the  coincidences  of  .propane  and  mixed  C02-H20/ 
dusters  at  a  nominal  mak  of  100  Daltons. 

For  the  identification  of  metastable  peaks  (which  is  even  more 
difficult  than-direct  spectra)  in ,magnetiCv sector-fields ma»' 
spectrometer  systems  up  to  now ’a  book  by  Beynon  and 
coworkers  /4/  could  be  used  containing  a  list  of  all  possible  - 
decays  up  to  a  mass  of  500  Daltons.  Besides  the  fact  that  it  is  not 
very. convenient  to  search  for  a  specific  decay  in  this  book,. the 
maximum  mass  of  500  Daltons  is  much  too  small  for  theuse- 
with  clusters.  We  have  developed  a  hew  computer  program 
(CLUSTER)  which  allows  to  calculate  all  the  possible  decays  for 
a  certain  cluster  series  (also  mixed  clusters  and  multiply 
charged  ones)  and  to  list  them  in  a  file  or  on  a  printer.  This 
program  also  takes  into  consideration  all  the  possible 
fragmentations  of  molecular  monomers  For  Instance,  for  mixed 
CO2-H2O  dusters  there  are  9562476  possible  metastable  decays 
of  duster  ions  up  to  a  mass  of  1000  Daltons  Table  3  illustrates 
•for  several  duster  ions  possible  coincidences  as  given  on' the 
printer  readout  of  this  new  program  (CLUSTER). 
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Table  l:  Comparison  of  metasuble  fraction*  of  the  measured  decay*  of  fN’j)}** 
dusterions  and  the  calculated  data  using  the  evaluated  decay  rate*. 


Decay  channel 

•  experimental 
metastabie  fraction 

calculated 
metastable  fraction 

difference 

3  •!•>  3  »2->  2 

1.41252% 

1.41312% 

000060% 

3*1*>3*2->1. 

000674% 

000376% 

’0.00298% 

3«l*>2*2->2 

M7512  % 

847537% 

000025% 

3  •!•>  2  »2->  1 

0.00433% 

0.00161  % 

000272% 

3  *l->  X  -2->  1 

008949% 

005614% 

0  03375% 

fromfCjHj),, 

(CjH^C 

C3H4(CH^4 

CjHa(ai)2(CH3)2 

CjHjCCf^CH^ 

(012)5(013)2 


from(C02)1(H20)|. 

(CO^C 

(H20)4CO 

C02(C0)2 

H200j(OU>2 

H2OC4(OH>2 

(H20)2C3CO 


Table  3  Comadences  in  metasuWe  windows  Am  =  5  Daltons,  AE  =  5  Volt, 
, maximum  mass  1000  Daltons. 

nominal  mass  m*  *  200  Daltons,  nominal  sector  field  E  =  400  Volt: 


'A'ifOjlm  example 

AipOjO*  -!->  Ar60’ 

.  A'5W<V>-A"eP2>3: 

,  in  total  4  coinddence* 

m* 

199805 

199805 

E* 

398.829 

398.829 

(q<4):  example 

•!•>  Ar,> 

Ar^*  •!•>  Arjj2* 

In  total  2  coinddence* 

m*‘ 

200556 

202.800 

E* 

404542 

398580 

(c3H8>n:  example 

KjWW  (C3W3W 
(CjlWW  -I->  (CjHjJjCjH,* 
in  total  43  coinddence* 

m 

204.000 

201 190 

E* 

400784 

395.417 

nominal  mass  m*  ■  500  Daltons,  nominal  sector  field  E 

>  400  Volt 

Arn(°2W  example 

ArjoO*  »!•> 

Ar7(9^2*s  Ar7(®2)3^* 

in  total  1 02  cdnddtnces 

m 

501961 

504.000 

E* 

400.784 

397444 

ArnT*  (q<4):  example 

A'llT  AtlC 

f*i\3  Ai£** 

In  tceal  8  coinddence* 

m* 

499512 

503.607 

E* 

'  398.829  *  - 
402.098 

(c3H«)n:  example 

(CjlWllC*  (CsHs),Cj* 

(CjiwiAl**  •!•>  (CtfWcjH* 
lntotal96co«nddences 

V 

496000 

502.912 

E* 

399734 

.400.919 

nominal  mass  m*  ■  $00  Daltons,  nominal  sector  field  E* 

a  400  Volt 

A,nl°2W  example 

Art$(02>204  AtijOj* 

m* 

800.000 

E* 

403690 

IntotaUOcanddmce* 

ArnS*.(q<4);  no  coinddence*  In  this  window  found 

‘(c3Hs)n-  example  m* 

(C^,^  -2->  (C3H3),4C*  804000 

In  total  $ccindd<nce* 


Table  fc  Coincidences  of  propane,  and  mixed  COj  HjCWuster*  In.  the  direct 
spectrum  at  a  nominal  mas*  to  charge  ratio  of  100  Daltons. 
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Abstract.  We  describe  a  computer.algorithm  that 
generates  models  of  covalently  bonded  amorphous 
semiconductors  that  are  characterized  by  tetrahe- 
drally  coordinated  random  networks,  the  prototyp¬ 
ical  example  being  amorphous  silicon.  The  starting 
point  is  a  supercell  in  the  diamond  cubic  struc¬ 
ture.  Periodic  boundary  conditions,  are  imposed 
so  as  to  eliminate  surface  effects.  The  algorithm 
uses  an  elementary  topological  rearrangement  that 
is  randomly  and  progressively  introduced  into  the 
diamond  cubic  structure  until  a  random  network 
has  been  generated.  This  is  followed  by  an  anneal¬ 
ing  process  that  allows  topological  relaxation  of  the 
structure,  leading  to  a  structural  model  that  is  in 
good  agreement  with  experiment  as  determined  by 
the  density  and  radial  distribution  function.  We 
have  recently  modified  the  algorithm  so  as  to  gen¬ 
erate  networks  having  only,  or  mostly,  even  rings, 
as  would  be  expected  for  a-GaAs.  In  anticipation  of 
the  likelihood  that  a  few  odd  rings  may  remain  af¬ 
ter  the  annealing  process,  we  have  begun  a  program 
to  quantify  and  visualize  the  topological  defects. 

I.  INTRODUCTION 

Covalently  bonded  amorphous  semiconductors 
such  as  amorphous  silicon  (a-Si)  are  characterized 
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as  random  network  structures,  lldealiy  such  a  struc¬ 
ture  is  a  well. defined  network  of  nearest-neighbor 
bonds,  which  may  be  analyzed  in  topological  terms, 
for  some  purposes.  An  algorithm  has  been  de¬ 
veloped  based  on  simulated  annealing  and  bond¬ 
switching  (illustrated  in  Fig.  lj  that  successfully 
models  the  homogeneous  region  that  is  believed  to 
be  representative  of  the  bulk  material  in  a-Si  [1,2,3]. 
We  are  not  attempting  to  directly  model  the  phys¬ 
ical  proccss.of  equilibration,  hence  the  method  has 
much  in  common  with  the  method  of  simulated  an¬ 
nealing  described  by  Kirkpatrick  ci  a  I  [4].  Such  a 
model,  and  the  process  for  its  generation,  has  been 
given  the  name  sillium  [3,6].  The  topology  of  sil- 
lium  is  characterized  by  a  distribution  of  ring  sizes, 
including  a  large  fraction  of  odd  rings. 

There  is  considerable  interest  in  extending  the 
computer-modeling  of  amorphous  structures  to 
such  covalently  bonded  materials  as  GaAs;  For  such 
binary  compounds  as  GaAs  there  can  be  only  a  few 
“wrong  bonds”. (Ga-Ga  or  As- As),  and  in  the  ideal 
case  there  should  be  no  wrong  bonds  and  thus  only 
even  rings. 

There  are  algorithms  that  ensure  the  generation 
of  an  even-ring  random  network  starting  from  a  net¬ 
work  with  only  even  rings  such  as  the  diamond- 
cubic  structure,  but  the  bond  switching  mechanism 
is  a  bit  more  complicated  than  that  of  Fig.  1  and, 
what  is  more  important,  it  introduces  much  more 
strain  into  the  network.  One  of  these  methods  has 
been  described  by  Rivier  ef  al  [5].  However,  the 
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Rivier  model  ,  and  all  similar,  models  that  ;we  have 
generated  by;computer:over  a  period  of  some  five 
years  -  duffer  froim  very  Targe ,  geometric:  distortions  • 
and'are  quite1  imphysical;  '  -  '  - 

It  seems  that  the  best  prospect  for  the  computer 
generation  of  an  even-ring  model  may  he  to  return 
to  the  simplicity  of  the;original  bond-switch  of  Fig. 

I  but  modified  by  a  weighting  factor  to  bias  against 
the  creation  of  odd  rings': 'The  goal  is  the  generation 
of  an  ideahzed'mbdel’of  the  homogeneous  bulk  re¬ 
gion  of  materials  such  as  GaAs,'  Oii  the  other  hand, 
even  a  model  with ‘just  a'few  odd  rings  woiildbe  of 
interest,  as  the  bccasibnal'inclusioh  of  odd  rings  is 
not  unexpected  as  a' topological  defect  in  real  struc¬ 
tures: 

II.  SILLIUM 

The  generation  of  sillium  has  been  thoroughly 
described  in. earlier  publications  [2,3,6],  Here  we 
present  only  a  brief  description  of  the  essentisd  in¬ 
gredients. 

The  starting  point  is  a  supercell  in  the  diamond 
cubic  structure,  subject  to  periodic  boundary  con¬ 
ditions,  that  contains  enough  atoms  to  provide  a 
model  of  reasonable  size.  We  have  used  supercells 
containing  from  216  to  4096  atoms. 

The  starting  crystal  structure  is  then  disordered 
by  a  sequence  of  randomly  selected  bond  switches, 
as  illustrated  in  Fig.  1.  The  bond  switch  illustrated 
there  gives  the  simplest  topological  rearrangement 
that  preserves  tetrahedral  bonding  and  introduces 
the  minimum  possible  strain  into  the  network.  It 
also  introduces  &-  and  7-fold  rings  into  the  other¬ 
wise  perfect  diamond  cubic  structure.  This  process 
of  introducing  topological  disorder  is  continued  un¬ 
til  there  is  no  remaining  memory  of  the  original 
crystal  structure. 

The  most  stringent  test  of  any  remaining  long 
range  order  is  the  structure  factor  |S(j)|s  associated 
with  those  reciprocal  lattice  vectors  labeled  (111) 
for  the  diamond  cubic  structure  [3,7].  The  structure 
factor  is  initially  of  zero  intensity  for  most  values  of 
the  reciprocal  lattice  vectors  for-the  chosen  super- 
cell.  It  is  nonzero  only  for  some  q-values  appropri¬ 
ate  to  the  smaller  unit  cell  of  the  diamond  struc¬ 
ture.  The  introduction  of  disorder  redistributes  the 
structure  factor  among  all  of  the  q-values  of  the 
supercell.  We  require  that  random  bond  switch¬ 


ing  continue.until  the  structure  factor  is  of- roughly, 
equal  intensity -for  ail  q-values. 

Every  time"two;bonds  are  switched,,  the  struc¬ 
ture  is.relaxed  by  the  method  of  Steinhardt.ei  al  [8] 
to  the  geometrical  configuration,  that  minimizes  its 
energy  as-givemby  thelKeating  potential  [9]. 


Figure  1:  Local  rearrangement  of  bonds  used  to 
generate  random  networks  from  the  diamond  cubic 
structure,  (a)  Configuration  of  atoms  and  bonds 
in, the  diamond  cubic  structure;  and  (b),  relaxed 
configuration  of  atoms  and  bonds  after  switching 
bonds. 

After  the  network  has  been  randomized,  further 
bond  switches  are  randomly  selected  on  a  trial  ba¬ 
sis.  The  switches  are  accepted  or  rejected  accord¬ 
ing  to  the  usual  Monte  Carlo  prescription:  They 
are  accepted  if  A E  <  0;  they  are  accepted  with 
probability  p  =  exp(-AE/kT)  if  A E  >  0.  (The 
Monte  Carlo  prescription  is  actually  followed  from 
the  outset  in  most  cases.  That  means  the  original 
randomization  is  carried  out  at  a  finite  temperature 
[3]  rather  than  the  infinite  temperature  implied  by 
the  randomization  process  described  above.  This 
takes  a  bit  more  computer  time  for  the  randomiza- 
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tiomprocess  but  'leads;to'Iess'liighly.strained:rani 
domized;  starting  structures ;ahd ,  reduces,  the  total; 
time  for  generating  a  satisfactory  ;model.): 

at  frequently  happeMjduringithe;ahnealing  pro¬ 
cess  that  a  structure'will  be  trapped  in  a  metastable, 
state  from  which  no  allowed  bond  switches  will  lead', 
directly  to  a  lower  energy.  The  Boltzmann  factor 
is  essential  in  order  to  ^provide  an  escape  .mecha¬ 
nism  from  these  local  minima.  It  does  this  by  dcca- 
sionally  allowing  . a  transition  to  a  somewhat  more 
strained  state  from  which  another  path  to  lower  en¬ 
ergy  states  can  be  found. 

The  structure  is. annealed  by  generating  a  se¬ 
quence  of  networks  while  lowering  the  temperature 
in  small  increments.  Typically,  we  require  that  at 
each  temperature  there  be,  as  smany  actual  bond 
switches  as  there  are  bonds,  ..in  order  to  achieve 
equilibrium.  The  result  is  a  model  that,  for  sil¬ 
icon,  always  agrees  very  well  with  experiment  as 
determined  by  the  radial  distribution  function. 

in.  THE  TRAVELING  SALESMAN 

It  is  illuminating  that  the  algorithm  we  have  de¬ 
veloped  for  modeling  a-Si  cm,  with  trivial  modifi¬ 
cations,  be  used  to  find  a  nasonailc  solution  to  the 
traveling  salesman  problem  [10], 

The  simplest  version  of  the  traveling  salesman 
problem  is  this:  A  traveling  salesman  must  visit  N 
cities  and  return  to  his  home  base  by  the  shortest 
route.  His  possible  routes  of  travel  and  distances 
between  cities  are  given  by  a  road  map.  The  algo¬ 
rithm  then  treats  the  cities  as  atoms  and  the  roads 
between  cities  as  bonds  between  atoms.  The  energy 
in  the  Boltzmann  factor  becomes'  a  path  length  and 
the  temperature  becomes  a -  pseudo-temperature. 
(There  is,  of  course,  a  question  of  how  to  inter¬ 
pret  the  temperature  even  when  modeling  a-Si,  but 
the  connection  with  a  “real”  temperature  is  at  least 
qualitatively  and  conceptually  direct  for  that  case.) 

Unlike  the  three-dimensional  network  needed  to 
model  a-Si,  the  closed  path  connecting  N  cities  is 
two-dimensional.  This  makes  it  a  simple  matter  to 
explicitly  demonstrate  a  metastable  state. 

It  happens  that  in  the  course  of  switching  pos- 
sible -  paths  between ‘cities  a  route-is  occasionally 
found  such  that,  given  the  rules  of  the  algorithm, 
there  is  no  single  switch  in  paths  between  cities  that 
will  further  reduce  the  distance  traveled  in  a  com¬ 


plete  circuit.  Since  there ; are , N (Nrl) !/2  possible; 
routes,- if  N  is  large,  the  probability  that' the  route' 
is;the’niimimum!path  'Unessentially,  zero,  'One.  has 
..almost  certainly  found  a  metastable  state!  It, is  .easy, 
to  depict  such  a  state  graphically  [10]. 

IV..EVEN-IIING  MODELS 

•GaAs  is  the  prototypical  binary  covalent  semi¬ 
conductor.  -In  the  idealized  .amorphous  form  it  is 
assumed  to  be  characterized  by  a  random.network 
having/only  even-fold,  rings;  but,  .it  is  otherwise 
taken  to.be  equivalent  to  sillium,  that  is,  it  has  no 
dangling.bonds,  employs  periodic.boundary  condi¬ 
tions,  and  is  generated' by  the  same  simple  bond 
switching  mechanism. 

Clearly  something  must  be  added  to  the  prescrip¬ 
tion  for  generating  models  if  only  even-fold  rings  are 
wanted.  We  have  attempted  to  accomplish  this  by 
including  an  additional  term  in  the  energy  of  the 
structure  without  modifying  the  Keating  potential 
itself.  The  additional  term  is  an  effective  energy 
that  is  proportional  to  the  number  of  irreducible 
odd-rings.  In  practice,  we  have  thus  far  counted 
only  Wold  rings.  We  express  the  energy  as 

E  =  Ejecting  +  r/Rs 

where  JRS  is  the  number  of  5-fold  rings  and  q  is  a 
parameter  to  be  optimized  by  (computer)  experi¬ 
ment. 

The.only  place  that  Its  actually  enters  the  mod¬ 
eling  process  is  in  the  Boltzmann  factor.  It  biases 
against  the  introduction  of  5-fold.rings.  Put  more 
positively,  it  favors  the  elimination  of  Wold  rings 
that  are  necessarily  introduced  during  the  original 
randomization  process. 

We  have  chosen  not  to  count  7-fold  and  higher 
rings  for  two  reasons:  It  is  very  time  consuming 
to  count  large  rings;  and,  many  higher  odd-fold 
rings  are  reducible  to  Wold  rings  and  thus  disap¬ 
pear  when  the  associated  5-fold  ring  is  eliminated. 

To  understand  how  7-fold  reducible  rings  may 
disappear  when  a  Wold  ring  is  eliminated,  consider 
what  is  meant  by  a  reducible  ring.  If  there  are  two 
atoms  on  the  ring  that  are  connected  by  a  shortcut 
that  does  not  lie  on  the  ring,  the  ring  is  reducible. 
A  typical  example  of  a  reducible  ring  is  a  7-fold  ring 
made  from  a  4-fold  ring  and  a  Wold  ring  that  share 
a  common  bond.  The  outer  7-fold  ring,  consisting 


1700 


of  all  the  atoms  fromboth.the  4-fold  and  5-fold  ring, 
and  all  the  bonds  from  those  rings  except  the  one 
shared  in  common,  is  reduced  by  the  .common  bond; 
shared  by  the  4-fold  and  5-fold  ring.lFor  that;case, 
eliminating  the  5-fold  ring  by,  say,- converting  it  to;a< 
5-fold  ring,  also  eliminates  the  reducible  7-fold  ring 
by  converting  it  to  an  8-fold  ring. 

We. have  been  successful  in  eliminating'-'at'.  least- 
85%  of  the  5-fold  rings  usually  present  in  a  random 
network  structure  like  silliiim.  We  believe  it  is.pos- 
sible  to  eliminate  nearly  all  odd'rihgs  with  fiirther 
refinements  in  the  algorithm. 

V.  RIVIER  LINES 

In  anticipation  of  the  likelihood  that  all  odd  rings 
may  not  be  removed  during  the'wnealing  process, 
and  thus  that  it  would  be  desirable  to  characterise 
the  topology  of  the  structure  associated  with  the 
odd  rings,  we  have  simultaneously  begun  a  program 
to  quantify  and  visualize  the  topological  defects. 
This  program  is  being  carried  out  in  collaboration 
with  J.  Koch  and  will  be  described  in  detail  in  work 
to  be  published. 

The  essence  of  the  program  is  that  all  irreducible 
rings  in  the  structure  are  identified  and  triangu¬ 
lated.  Each  triangle  is  identified,  numbered  and 
stored  along  with  the  parity  of  the.parent  ring.  A 
cubic. grid  of  points  is  defined'with  a  mesh  scale 
finer  than  the  structure  of  the  random  network.,  A 
search  is  then  made  for  a  path  that  lies  on  the  grid 
and  which  passes  through  only  odd  rings  and  either 
makes  a  complete  circuit,  returning  to  the  starting 
point,  or  extends  to  infinity.  .The  latter  is  possible 
here  because  of  the  use  of  periodic  boundary  condi¬ 
tions.  We  refer  to  these  “odd  lines”  as  Rivier  lines 
after  their  promoter  N.  Rivier  [11], 

We  have  found  the  two  Rivier  lines  that  thread 
their  way  through  the  four  irreducible  5-fold  rings 
rind  the  four  irreducible  7-fold  rings  that  are  created 
when  a  single  pair  of  bond  switches  is  introduced 
into  the  otherwise  perfect  diamond  cubic  structure. 
These  lines  are  easily  found  by  inspection  and  thus 
make  a  good  first  test  case  of  the  correctness  of 
the  computer  algorithm.  One  of  the  lines  circles 
the’ bond  between  atoms  1  and  5  in  Fig.  lb.  The 
other  circles  the  bond  between  atoms  2  and  6  in 
Fig.  lb.  Each  Rivier  line  passes  through  two  irre¬ 
ducible  5-fold  rings  and  two  irreducible  7-fold  rings. 
We  are  now  testing  the  program  with  more  disor- 


■  dered;structures,  including  the  kinds, of  amorphous 
models  whose  generation  has  been  discussed  here. 

VL  CONCLUSION 

Historically,  the  study  of  random  structures  (e.g., 
;liquids)-has-mainly  concentrated  on  various  distri¬ 
bution  functions  that  express  geometrical  proper¬ 
ties.  In  common  with  some  recent  problems  in  mag¬ 
netism  (spin  glasses,  etc.)  the  study  of  amorphous 
semiconductors  has  brought' topological  properties 
.to-the  fore;.  While  the  search  for;  direct’probes  of 
topologicrdly.defined'quantities  has  proved  frustrat¬ 
ing-in  practice;(despite  several-good  ideaadn  prin¬ 
ciple),  the:  topological  approach  to  model-building 
andstructure  analysis  remams-a  source  of  fascinar 
tion  to  the  theorist  and  stimulation  to  the  exper¬ 
imenter.  The  work  described  here  highlights  the 
topological  aspect,  while  respecting, the  constraints 
upon  realistic  geometrical  arrangements. 
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■ABSTRACT 

The  development  of  semi-conductor  devices  involves  several 
iterations  of  trial  and  error  in  the'fabrication  until  a  specified  goal 
in  terms  of  design  conditions  is' reached.  The  application  of  device 
,  models  can  now  decrease  the  number  of  trial  and  error  steps.  The 
reflection  accompanying  the  development  will  be  more  productive 
if  the  modeling  tools  themselves  do  not  require  too  much 
attention.  We  have  developped  a  software  package  whose  aim  is 
to  be  used  as  easily  as  a  usual  word  processor. 


New  technologies  applied  to  scmi-conductor'stmctures  imply 
more  and  more  complex  behavior  related  to  bi -dimensional  effects 
and  irregular  profiles  in  miniaturized  layers.  Traditional  analysis  is 
often  based  on  analytical  expressions  derived  from  solutions  of 
simplified  models  (often  in  one  dimension),  and  so,  becomes  less 
and  less  adapted.  Numerical  simulation  appears  as  a  precious  tool 
for  analysis  and  development  or  to-day’s  electronic 
components[l],l21,l3],  its  use  definitely  is  becoming  widespread, 
however,  it  appears  that  the  closed  form  formulation  is  still  widely 
used:  two  reasons  can  be  invoked  to  explain  this: 

-<i)  the  physical  interpretation  based  on  the  influence  of  each 
observable  parameter  is  relatively  easy  to  extract  from  a  formula, 

-(ri)  the  use  of  a  software  tool  needs  a  time  investment  in 
learning  which  cannot  be  always  afforded  by, the  product 
designers. 

Keeping  in  mind  these  reasons  {difficult  man-machine 
relationship),  we  have  developed  a  software  tool  that  can  be  run 
on  a  personal  computer,  its  access  being  comparable  to  present 
word  processor.'  It  is  implemented  on  an  Apple  Macintosh™ 
computer  and  takes  advantage  of  its  editing  facilities  and  user- 
friendly  approach:  menus,  windows,  ' mouse  "click  and  point " 
operations... The  user-friendly  approach  has  to  be  developped  on 
both  sides  of  the  numerical  modeling  :  inputs  (edition  of  the 
device)  and  outputs. 

BASIC  IDEAS 

Rather  than  a  coupling  with  a  preprocessor  and  post  processor 
(as  in  traditional  procedural  device  simulators),  it  is  a  complete 
software  with  ceveral  mixed  parts :  edition  of  the  device,  2-D 
simulauon  of  diffusion  processes  and  2-D  simulation  of  electrical 
behavior  (electrostatic  potential  and  current).  Graphic  control  and 
questioning  the  computer  is  made  available  during  all  the  phases 
of  simulations.  The  basic  mechanism  is  an  'event  loop"  (as  in 
usual  productivity  software)  controlling  the  instantiation  and  the 
treatment  of  data  structures.  Object-oriented  programming  1  ideas 
have  guided  the  development  of  this  software.  Events  are  of 
different  kinds :  the  traditional  ones  such  as  general  messages  to 
windows  (mouse  clicks  to  scroll;  to  move,  to  close...),  drawing 
actions,  menu-commands,  user’s  answers  to  dialogs,  as  well  as 


,  specific  ones  such  as  the  end  ofcomputation  of  a  local  doping,  or 
the  convergence  of  the  iterative  computation  of  .potentials,  and 
charges?  %  '  '  * .  *  1  v  ‘ 


The  description  is  made  through  the  use  of  the  mouse  m 
"painting1'  the  different  zones  of  the  studied  device.  This  operauon 
is  consistent  with  the  traditional -work  done  .withdrawing 
softwares  such  as  MacDraw™,  Mac  Paint™... the  fcolors"  of  the 
palette  are  here  substrate  N,or  P,  oxide,  contact  diodes,  gates ... 
or  dopant  implantations  •'to  allow  precise  description,  scaling  is 
,  provided  and  coordinates  in  nm  are  displayed  while  the  user  is 
.drawing  or  modifying  his  zones.  The  zones  are  defined  for  the 
software,  as  polygonal  objects  with  information  on  position, 
,  geometry;  type,  flag$.J..and  with  specific  data!  All  these  objects 
■  are  chained  to  form  a  list  which  is  the  studied  device.  The  device 
is  coupled  with  the  window  in  "which  it  is  drawn  and  with  a  file 
where  it  can  be  saved  for  later  use.' 

Figure  1  shows  the  computer  screen  with  the  palette,  on  the 
left,  the  drawing  window,  and  three  result  windows  (Ust,  plot  net 
windows)  after  computation  of  a  local  distnbution  of  dopants  (sec 
next  paragraph) 


figure  1 :  The  computer  screen 


iTNs  software  has  beeq  developed  In  Think  C™  version  4  which 
supports  objetes  oriented  programming  (dose  to  C++). 


PROCESS  SIMULATION 

The  software  scans  tbe  structure  to  find  ztccs  concerned  by^ 
the  process  simulation;  pure  doping  implants  and  contacts2 .  Aw 
detailed  2-D  profile  has  to  be  evaluated  Mainly  foe  Impic!s.  It  is 
done  through  a  diffusion  process  based  on, the  numerical' 
resolution  oftbe  heat  differential  equation  on  a  rectangular  mesh. 
For  each  zone,  diffusion  parameter  values  such  as  temperature, 
lime  and  coefficients,  are  proposed  to  the  user  (by  ’dialogs’)  and 
can  be  adjusted  Tbe  software  is  able  to  build  a  finite  difference 
system  of  equations  which  models  the  diffusion  of  the  dopant  in 
[he  local  area  aroundlht  implant.  This  depends  of  course  on  the 
local  environment :  oxide,  substrate  or  boundaries...  Several 
algorithms  have  been  tested  (alternate* direction  iteration), 
incomplete  factorization  on  a  Crank-Nicfcolson  scheme  in  2-D 
(Buleev-Stonef4J)  appears  to  be  the  most  efficient,  especially 
when  the  two  dimensions  (vertical  and  horizontal’  largely  differ. 

Tbe  evolution  of  the  diffusion  is  also  shown  through  douds 
of  dots  evolving  on  the  screen  during  the  computation.  The  user 
can  also  look  for  -  a  concentration  cross-section a  list  of  values 
-  and/or  a  critical  value  at  a  mousOdicked  point 

DEVICE  SIMULATION 

A  rectangular  global  mesh  is  built  The  addition  of  all  the  local 
distributions  of  dopants  gives  the  global  distribution  When  this  is 
done,  a  new  mesh  is  computed  based  on  the  variations  from  point 
to  point,  and  interpolated  values  are  evaluated.  The  procedure  of 
re-meshing  can  also  be  launched  manually.  To  prepare  the  device 
simulation,  a  table  of  characteristics  of  each  point  is  built :  The 
characteristics  specify  the  nature  of  the  equations  corresponding  to 
each  point :  Known  voltages,  boundary  with  zero  electric  field, 
oxide  (zones  without  charges)... 

A  new  finite  difference  scheme  is  automatically  built,  indexed 
on  the  geographical  changes  in  the  device .  POISSON’S  equation 
ind  continuity  equations  are  solved.  Coefficients  for  the 
;orres ponding  'difference  equations  are  computed.  Several 
discretization  schemes  for  the  cuncnt  have  been  tested  to  tackle 
now  well-known  problems  of  consistency  (1],12),(3J,(5J 
involvedby  the  equation  of  the  cuncnlfMOCKs  stream  functions 
16],  MO  l  i fcl's  intermediate  conrinuily[7J).  Taking  advantage  of 
memory  available  even  now  on  small  computers,  large  (and 
sparse)  systems  are  built  and  solved  by  factorization  in  a  sequence 
of  iterations  according  to  the  NEWTON-RAPHSON  algorithm. 
Here,  the  GUMMELI5]  iteration  (decoupled  equations)  is  used. 
Each  simulation  of  an  electrostatic  stale  starts  from  an 
approximative  equilibrium  (to  begin),  or  from  a  preceding  state  as 
an  initial  condition,  or .  The  computation  tint?  q*nds  naturally 
on  the  number  of  points  of  the  mesh  and  the  voltage  values  given 
to  the  simulator. 

EXPLOITATION 

The  user  is  asked  to  give  potentials  for  the  contacts  and/or 
gates.  The  simulator  then  solves  the  equations  to  evaluate  the 
distribution  of  potentials  and  charges,  from  which  arc  derived  the 
currents.  A  dually  only  static  behavior  and  relatively  simple  forms 
of  equation  are  taken  into  account  (Boltzmann  statistics,  constant 
mobilities),  but  the  structure  of  the  software  should  allow  easy 
evolution. 

Numerical  simulations  often  produces  an  impressive  quantity 
of  data  from  which  it  is  difficult  to  extract  what  is  relevant  to  the 
user’s  problem.  Our  approach  offers  a  great  interactivity  at  this 
stage.  The  results  (electrostatic  potentials,  charges  ...)  arc 
accessible  cither  using  menus  and/or  through  mouse-clicks  on 
appropriate  representation :  net  representation,  lists  of  values  in 
several  Macintosh  windows  .For  instance,  a  click  on  cqui- 
potemial  curves  produces  a  new  window  with  a  crc<s  section  at 


tbe  corrcspocxErg  location  Rrsuhs  are  also  easily  tansfemKcto 
typical  word  processors  spreadsheets  or  other  traditional 
productivity  softwares. 

CONCLUSION 

This  software  tool  because  of  its  intuitive  access  aid  direct 
connections  with  desktop  work  may  be  used  in  a  great  variety  of 
situations:  besides-teaching  and  demonstrations  of  device 
behavior,  the  designers  can  "measure’  tbe  influence  of  a 
parameter,  look  at  some  bidrroena'ocal  effects,  without  having  to 
spend  boos  on  leaning  bow  to  use  the  looL  Tbe  snudare  of  tbe 
software,  allows  customization  to  take  into  account  specific 
aspects.  Some  versions  of  this  softwareI8]  have  been  used  to 
study  aspects  of  noise  in  MOS  transistors,  as  well  as 
antibioocrmig  designing  in  CCD  cellsf9]. 
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Abstract -We  present  an  accurate  circuit  simulator  for  power 
semiconductor  devils  by  using  2-D  physical  device  models  and 
hierarchical  p.  '<r  ~  of  system  equation  based  on  Newton* 

Raphsotfs  meii  d.  renting  the  SI  thyristor,  we  not  only  use 

the  circuit  simukio  analyzing"  its  turn-off  process  hot  also 
design  its  superior  c.  sectional  structure.  Final] w  we  describe 
schemes  for  accdei^ng  the  hierarchical  process!^  j  terms  of 
computcraidtitecturc 

I.  INTRODUCTION 

As  for  the  DC  analysis  of  semiconductor  devices,  it  is 
satisfactory  to  compute  only  device  modek  1 1 ).  However,  transient 
analysis  requires  to  compute  system  equations  comprising  circuit' 
equations  and  device  equations.  Sane  researchers  have  used  the 
Gummcrs  algorithm  |2]  that  simply  iterates  to  calculate  both 
equations  as  shown  in  Fig.  1  (a)  (31,  |4J.  However,  the  G'uhme! 
loop  docs  not  always  theoretically  ensure  the  convergence  of 
iterative  solution.  Then,  hierarchical  processing  has  been  presented 
for  system  equations  as  shown  in  Fig.  I  (b)  by  employing  Newton- 
Raphsorfs  method  to  circuit  equations  that  implicitly  contain  one¬ 
dimensional  ( I  -D)  physical  device  equations  [5J. 


(a)  (b) 


Fig.  I  Processing  of  system  equations,  (a)  Gummcl 
iteration;  (b)  Newton  iteration. 

We  use  the  hierarchical  processing  for  a  circuit  simulator  of 
power  semiconductor  devices  that  should  be  approximated  by  2-D 
physical  models.  2-D  physical  device  models  located  in  a  lower 
hierarchy  are  quickly  and  accurately  calculated  to  determine  the 
currents  in  circuit  models.  Thus,  wc  realize  the  circuit  simulator 
that  is  physically  as  well  as  mathematically  accurate.  Then,  we  use 
the  circuit  simulator  for  analyzing  the  tum-off  process  of  a  basic 
circuit  containing  the  static  induction  (SO  thyristor. 

II.  ALGORITHMS  OF  THE  HIERARCHICAL  PROCESSING 

Ncwton-Raphsorfs  method  yields  for  circuit  equations 

§  Msv,]  ffl(»1.v2,l„l2) 

dvi  SV2  __  (1) 

^  liT  5v2  r2(V|,v2,l„i2) 

(dvi  0V2JI  J 

where  fj,  v,,  iit  (t»l,2)  represent  a  function  of  a  circuit  equation,  a 
terminal  voltage,  and  a  terminal  current,  respectively.  Currents  a.-'d 
their  partial  differentials  that  are  needed  for  processing  cq.  (1)  are 
computed  from  terminal  currents  of  2-D  physical  models. 
Therefore,  the  processing  of  Eq.  (1)  is  hierarchical  as  shown  in  Fig. 
*»  where  top-duwn  traversal  is  done  whenever  a  trial  circuit  solution 
is  given  in  processing  Eq.  ( 1). 


*  Tobofai  University,  Sendai  980,  JAPAN 


Fig.2  Data  structure  formed  in  orocessing  Newton  iteration. 


We  use  again  Ncwton-Raphsorfs  method  for  2-D  physical  model 
equations  in  computing  device  solutions.  The  divergence  of  a 
solution  in  each  of  two  nesting  Newton  loops  is  nipped  in  the  bud, 
which  is  mathematically  allowable.'  Then,  a  converged  system 
solution  is  physically  i  erified  by  examining  whether  the  locus  of  an 
mput'butput  woriCrg  point  is  involved  in  a  respective  load  surface. 

In  order  to  succcssfully  obtain  a  converged  system  solution,  it  is 
important  toquickly  and  accurately  compute  2-D  physical  device 
models.  The  processor  is  occupied  mostly  by  matrix  equations 
whose  computing  time  depends  on  both  the  number  of  mesh  pants 
and  algorithms.  In  order  to  decrease  the  cumber  of  mesh  points, 
wc  use  an  irregular  mesh  structure  (6J.  Moreover,  wc  define  an  x/y 
differential  mesh  that  w  convenient  to  the  Scharfetter-Gummcrs 
integral  [7],  though  a  first  order  Taylor  series  is  alternatively  used  in 
the  place  where  the  electric  field  intensity  is  nearly  equal  to  zero. 
As  for  algorithms,  CG  methods  recently  arc  used  more  popularly 
(81,  (9].  Nevertheless,  it  has  not  yet  been  discussed  completely  that 
CG  methods  are  more  effective  than  SOR  methods  [10].  Hence, 
we  employ  the  SLOR  method  [  11 J.  The  simplicity  of  the  SLOR 
method  is  preserved,  though  coefficient  matrices  are  rather  complex 
due  to  both  the  irregularity  of  the  mesh  structure;  and  an 
extraordinary  boundary  condition  assumed  at  an  internal  gate  center. 

Terminal  currents  are  accurately  calculated  by  counting 
displacement  currents.  Within  a  region  of  a  device,  the  sum  of  a 
main  component  of  conduction  currents  and  that  of  displacement 
currents  takes  a  constant  value,  which  can  be  considered  as  a 
terminal  current.  Moreover,  we  partly  make  use  of  the  Kirchholfs 
low  that  mathematically  stands  among  the  terminal  currents,  because 
recombinatior/generation  rates  of  electrons  and  holes  are  assumed  to 
be  equal  each  other. 

Time  for  computing  full  periods  of  transient  operation  is  strongly 
concerned  with  time  increments  At’s.  As  for  computing  device 
solutions,  At’s  are  not  restricted  by  any  condition,  because  wc  have 
used  the  backward  Euler  method  that  is  unconditionally  stable, 
though  large  At’s  make  it  difficult  to  determine  trial  device  solutions. 
Moreover,  Eq.  ( 1)  docs  not  explicitly  contain  At.  However,  large 
At’s  tend  to  make  circuit  solutions  divergent  because  terminal 
currents  vary  drastically  by  the  small  change  of  terminal  voltages. 
Consequently,  At’s  are  detetmined  in  this  paper  so  that  the  variation 
of  a  main  terminal  current  during  (t-At,  t)  is  almost  constant,  which 
is  suitable  for  converging  both  Newton  loops. 

in.  TURN-OTF  PROCESS  OF  THE  SI  THYRISTOR 

Device  analysts  have  not  paid  much  attention  to  the  crystal 
orientation  of  wafers  to  which  process  designers  have  attached  great 
importance.  Outside  the  ohmL  region,  drift  velocities  exhibit 
amsot tuple  behavior  with  respect  to  the  orientation  ot  af  electric 
field  applied  to  silicon  crystals  in  common  environment  ( 12|,  which 
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naturally  reminds  us  of  the  plane  index  dependency  of  device 
characteristics.  Therefore,  we  basically  view  carrier  mobilities  as  2- 
D  rank  tensors  113].  Moreover,  tensor  elements  are  assumed  to 
have  separate  relationships  for  electric  field,  temperature,  and 
impurity  density,  respectively.  _  ,  .  7 

We  esc  die  circuit  simulator  for  simulating  the  turn-off  process  of 
a  baric  circuit  containing  the  SI  thyristor  [14  J.'  Computed  results 
arc  physically  as  well  as  mathematically  self-consistent  judging  from 
Fig.  3.  It  is  made  clear  from  the  simulation  that  the  SI  thyristor  is 
hard  to  cause  current  crowding  phenomena  in  turn-off  slates.  On 
the  other  hand,  die  SI  thyristor  may  base  long  tail  periods. 


Fig.  3  Locus  of  an  output  operating  point. 

This  problem  has  generally  been  avoided  by  composing  a  shorted 
anode  emitter  or  consciously  doping  lifetime  killers  in  a  base. 
However,  the  former  lacks  reverse  voltage  blocking  capabilities,  and 
the  latter  causes  large  forward  vol  tage  drops.  Thinning  bases  is  a 
fundamental  method  to  decrease  stored  holes  or  shorten  tail  periods, 
which  is  also  useful  for  making  fotward  voltage  drops  low.  The 
cross  sectional  structure  shown  in  Fig.  4  with  larger  impurity 
density  of  both  gate  center  and  anode  surface,  smaller  impurity 
density  of  n'base,  and  shorter  channel  width  should  have  shorter 
Wf.  higher  forward/ireverse  voltage  blocking  capabilities,  and  lower 
forward  voltage  drops  [  1 5  J.  The  n  base  layer  added  on  the  anode  is 
useful  for  not  only  avoiding  punch  through  phenomena  but  also 
shortening  holr/electron  lifetime  in  low  injection  level  owing  to  their 
impurity  dependencies,  which  is  reasonably  neglected  In  this  paper. 
It  is  interesting  to  investigate  the  optimum  wave  form  of  the  gate 
source  voltage  for  quick  tum-olT operations. 

IV.  CONCLUDING  REMARKS 

Many  issues  have  been  made  apparent.  A  physical  quantity  is 
defined  in  cither  differential  meshes  or  a  potential  mesh.  When  one 
of,the  two  mesh  types  is  used  In  a  finite  difference  formula 
containing  both  groups  of  quantities,  the  quantities  belonging  to  the 
other  mesh  type  should  be  modified,  which  neglects  the  definition  of 
the  mesh  types.  The  distribution  of  space  charge  density  may  be 
hard  to  converge  in  iterative  solutions  as  well  as  forward  differences 
(16),  which  is  not  critical  for  the  function  of  circuit  simulators. 
Avalanche  multiplication  phenomena  should  not  have  been  omitted. 

We  will  consider  two  ways  to  accelerate  the  processing  of  Eq. 


anode,  electrode 


Fig.  4  Superior  cross  sectional  structure  of  the  SI  thyristor. 

(1)  by  viewing  the  tree  shown  in  Fig.  2  in  terms  of  computer 
architecture.  '.One  is  to  process  in  parallel  three  leaves  that  belong  to 
the  node  of  a  trial  circuit  solution  by  using  multiprocessors,  though 
the  tree  traversal  has  been  limited  to  serial  processing  in  this  paper. 
The  other  is  to  traverse  the  tree  from  its  bottom  to  its  top,  which  is 
realized  by  building  up  database  of  device  solutions  in  advance. 
Moreover,  we  will  determine  as  long  At*s  as  possible  to  quickly 
compute  full  periods  of  transient  operations  by  investigating  the 
relation  between  two  Newton  loops  and  the  change  of  a  main 
current. 
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Abstract:  The  FEM  with  linear  shape  function  does  not 
provide  a  correct  expression  for  the  coefficients  of 
FEM  equations  at  axial  points  for  the  rotational 
symmetric  and  sultlpole  scalar  potentials.  A  correction 
of  the  coefficients  for  axial  points  Is  described. 

J.  Introduction 

FEM  is  used  for  nost  computations  of  fields  of 
electrostatic  and  Magnetic  lenses  and  deflectors  in 
electron  optics  [1 ].  High  accuracy  of  field  computation 
is  required  e.g.  for  -direct  ray  tracing  of  charged 
particles.  We  use  the  FEM  with  a  topologically  regular 
mesh  of  ssall  quadrilaterals,  subdivided  further  Into 
triangles,  where  a  linear  shape  function  is  used.  The 
resulting  matrix  has  a  simple  structure,  and  its 
solution  is  very  fast  with  the  preconditioned  conjugate 
gradient  method.  However,  in  the  two  dimensional 
computations  by  FEM  In  cylindrical  coordinates  -  (r.z), 
there  are  some  serious  problems  due  to  the  use  of  the 
linear  shape  function  in  the  evaluation  of  coefficients- 
of  FEM  equations. 

For  rotational  symmetric  magnetic  lenses,  the 
discontinuity  of  the  normal  derivative  of  the  vector 
potential  A(r,z)  on  the  triangle  boundary  can  give  a 
large  error  c.g.  In  the  case  of  high  permeability 
cylindrical  rods  on  the  axis.  The  coefficients  of  the 
FEM  equations  depend  on  the  way  in  which  the 
Integration  of  the  term  A/c  in  the  energy  functional  is 
performed.  We  have  shown  that  It- Is  possible  to  find  a 
method  for  the  evaluation  of  coefficients  which 
overcome  this  problem  I2J. 

For  the  axially  symmetric  scalar  potential 
problems  the  linear  FEM  provides  the  same  coefficients 
as  the  five  point  finite  difference  method  in 
rectangular  meshes,  except  for  the  points  on  the  axis 
I3J.  In  the  computation  of  rath  Fourier  harmonics  of  the 
scalar  potential  $a  for  deflectors  and  other  multipoles 
close  to  the  axis  is  proportional  to  r*.  and -thus 
the  shape  function  linear  In  r  does  not  provide  correct 
results.  This  Is  possible  to  overcome  by  a  formulation 
of  the  FEM  equations  for  (4).  Even  then  tho 

FEM  equations  for  the  points  on  the  axis. do  not  have 
the  required  accuracy.  The  paper  presents  a  suitable 
method  to  correct  for  this  behavior,  valid  both  for  the 
axially  symmetric  potential  b0fr,z)  and  for  the 
function  *m(r,z). 


In  electron  optics  the  axially  symmetric  lenses 
are  mostly  placed  along  the  same  axis.  The 
electrostatic  and  magnetic  deflectors  and  other 
multipoles  are-  frequently  defined  on  rotational 
symmetric  formers,  used  In  the  presence  of  rotational 
symmetric  materials  and  placed  along  the  sane  electron 
optical  axis.  For  some  of  the  frequently  usee 
geometries  of  multipoles  It  Is  possible  to  evaluate  the 
fields  of  these  multipoles  by  making  a  decomposition 
Into  individual  Fourier  harmonics.  For  each  eth 
harmonics  the  distribution  of  scalar  potential  *m(r,z) 

1  The  work  was  performed  as  a  part  of  FOM  project 
I0P-IC-DTN  45.006. 
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can  be  evaluated  as  a  two  dimensional  problem,  with 

bn  the  axis  and  on  the  boundary  far  from  the 
elements.  In  the  vicinity  of  the  z  axis 

*m(r,z)*r°[da(z)-dtJ‘(z)rz/4(w>+l)+ 

*dmw ~(z)r*/32(m*2)(it*l)-  ...J  ( 1 ) 
For  m 0  Is  *0(r.z)  the  axially  symmetric  scalar 
potential,  which  Is  not  equal  to  zero  on  the  axis.  The 
potential  Qm(r,z)  is  not  suitable  for  the  linear  FEM  If 
a>l  because  of  the  dependence  on  r  as  r*  Cl).  Instead 
of  we  define  the  FEM  for  the  potential  function 
+m(r,  z)*$m(r,  z)/r 

The  radial  derivative  of  the  new  function  </m(r,z) 
is  close  to  the  axis  proportional  to  the  distance  from 
the  axis  and  It  is  equal  to  zero  on  the  axis.  The 
linear  shape  function  overestimates  the  effect  of  the 
radial  derivative,  and  the  field  is  next  to  the  axis 
incorrect,  e.g.  for  applications  like  the  direct  ray 
tracing.  The  consequence  of  this  Is.  that  even  for 
regular  rectangular  mesh  the  coefficients  of  the  FEM 
equations  have  lower  accuracy  than  the  five  point 
finite  difference  formula  (31.  For  linear  FEM,  popular 
in  electron  optics,  the  mesh  lines  in  the  paraxial 
region  are  perpendicular  to  the  axis. 

Given  the  fora  of  the  potential  function  near 
the  axis,  a  straightforward  solution  might  be  the  use 
of  r2  as  a  new  variable  (5).  The  way  In  which  the 
finite  difference  formula  for  the  axial  points  Is 
derived  Is  also  based  on  this  approach  (61. 

The  energy  functional  for  the  nth  component  «>m 
contains  the  sum  of  squares  of  the  r  and  z  derivatives. 
For  the  energy  functional  Is 

£  'C  JI{[  (Sf*  (S“)1r2‘"  *  (2) 

with  c*(w/2)(l*6n0)p  for  magnetic  problems,  for 

electrostatic  ones  Is  permeability  p  replaced  with  the 
dielectric  constant  c,  Is  the  Kronecker  6.  The 

first  part  of  this  energy  functional  (2)  can  be 
expressed  in  terms  of  shape  function  linear  in  r  and  z 
coordinates  In  a  triangular  finite  element  ai 

“.-S,  \  »> 

where  Dl"Zl(r2r^*Zz(*'z~ri )*z3(rl~r2)  Is  twice  the 

area  of  the  triangle,  J.J*1,2,3,  are  the  values  of 
*m(r,z)  at  the  vertices,  b,«r2-r3,  c,»z3-z2;  b,.  c,  for 
1«2,3  are  obtained  by  cyclic  Interchange.  £  Is  a 
penalty  factor  equal  to  1  everywhere  except  In  the 
triangles  with  two  vertices  on  the  axis.  This  local 
penalty  factor  will  be  used  later  to  correct  for  the 
too  large  effect  of  the  radial  term.  Rn  Is  an 
expression  in  terns^oj*  coordinates  of  the  vertex 

*«-  i  i  r>k'V,v'-1''  • 

k-0  1-0 

Another  possible  expression  can  be  derived  by 
using  a  shape  function  linear  In  u»r2  and  z,  again 
without  the  last  tern  In  (2),  as 

V  51 II  4  2<£iu*.2>a*d‘]w  ,4) 

with  D2*zl(u2-a3)*z2(u;i-ul )*z3(u,-u2J,  d,«u2-u3,  d2  and 
d3  cyclically.  Un  Is  given  by  the  same  expression  as 
before,  only  using  u  Instead  of  r. 
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Fig.  1.  A  part  of  the  rectangular  aesh  next  to  the 

axis  and  its  subdivision  Into  triangles:. 

Let  us  take  a  part  of  the  fine  aesh  next  to  the 
axis  (fig.  1).  For  simplicity  we  shall  consider  a 
rectangular  aesh  with  a’ step  p  In  the  radial  direction 
and  a  step  <.ln  the  z  direction.  The  FEM  equations  are 
obtained  by  the  variation  of  the  su*  of  terms  In  (3) 
for  all  triangles'  next  to  the  central  point  with 
respect  to  potential  there  as 

V>0  [(2o+3)  2CP2]  ■  [(2b+3K2]*z+  £p2  (v>5  ♦  *,)  (5) 

or  by  variation  using  U2  as 

<2+  2p2]  -  PZ(v^5  *  ^l) 

If  we  substitute  the  expression  (1)  for  near 
the  z  axis  into  (5),  we  get 

,  .  \[r  2a+3  L  - 

^0*^0  *  (2a*3)s}*Z&z  4(0+1}  J  0 

*  <7> 
and  slallarly  substituting  Into  (6)  we  get 

Oo-^o  *  (if*  sr^)d°"  <8! 

We  can  see  that  the  error  In  (7)  Is  of  the  second  order 
in  the  aesh  size  while  the  error  In  (8)  Is  only  of  the 
fourth  order. 

If  we  now  choose  the  factor  ^•(l/4)(2o+3)/(o+l) 
for  the  triangles  with  two  vertices  on  the  axis,  and 
use  £•!  for  all  other  triangles,  we  get  the  accuracy  of 
the  fourth  order  also  for  the  points  on  the  axis, 
without  disturbing  all  the  other  features  of  FEM  using 
shape  functions  linear  In  r  and  z  coordinates. 

In  case  of  steps  <  different  on  the  left  and  right 
hand  sides  of  the  point  sonevhat  aorc  coapllcated 
expressions  are  obtained.  As  expected,  the  first  error 
tera  Is  proportional  to  the  nean  stepslze  and 
difference  in  the  stepslze.  If  only  the  subdivision 
shown  In  fig.  1  Is  used  Using  also  the  other  possible 
subdivision  by  a  diagonal  In  the  quadrilaterals  (7), 
this  error  tera  vanishes,  and  the  first  error  tera 
laproves  by  one  order  of  Magnitude  In  aesh  step.  It  Is 
therefore  necessary  to  avoid  large  change  of  aesh 
density  by  using  a  aesh  with  saoothly  varying  stepslze 

3.  An  example 

An  example  of  the  effect  of  the  correction  on  the 
field  coaputatlon  will  be  given  for  an  electrostatic 
axially  syaaetrlc  potential  for  an  arrangcaent  of  a 
staple  geometry  consisting  of  three  electrodes.  Fig.  2 
gives  the  axial  potential  calculated  in  a  rectangular 
aesh  with  different  nuaber  of  points  within  the  radius, 
with  and  without  the  correction.  The  effect  of  the 
correction  changes  the  axial  potential  In  the  opposite 
direction  than  the  analytical  solution  Is,  due  to  the 
sign  of  the  second  derivative  of  the  axial  potential 
40?z>.  If  no  correction  is  applied,  the  axial  field  at 
a  distance  of  one  aesh  line  has  an  error  of  about  10  %. 
For  aultlpoles  a  slailar  effect  on  the  axial  multlpolc 
function  dn(z)  can  be  obtained. 


Fig.  2.  Axial  potential  for  the  geoaetry  shown  In 
the  corner  (the  distance  between  the  outer 
electrodes  and  the  Inner  electrode  diameter  being  20 
km ):  The  outer  electrodes  are  at  zero  potential,  the 
central  one  has  a  unit- potential.  The ’correction  to 
the  maximum  axial- potential  .is  about  \V.  for  a  coarse 
aesh  with  4  points  between  the  electrodes. 

4, Programs 

Fast  and  accurate  field  computations  can  be 
performed  on  PCs.  Inside  the  standard  640  kbyte  aeaory 
In  DOS  we  can  use  noshes  with  up  to  8400  aesh  points 
(81.  The  field  coaputatlon  prograas  are  written  In 
Fortran  77  and  they  run  also  on  larger  machines.  They 
have  a  suitable  user  Interface  written  In  Turbo  Pascal 
for  IBM  compatible  PCs,  allowing  a  graphical  Input  and 
Modification  of  data,  autoaeshlng,  and  they  provide  the 
Input  file  for  conputations.  They  also  allow  the 
graphical  display  of  outputs  (e  g.  equipotentials, 
axial  fields,  etc)  cn  screen  or  for  hardcopy. 

5.  Conclusion 

The  advantage  In  using  the  Improved  coefficient 
coaputatlons  are  that  we  can  use  the  FEM  with  linear 
shape  function  and  staple  aesh  layout  to  obtain  fast 
and  accurate  results  Very  significant  effect  on  the 
accuracy  of  coaputatlon  brings  the  use  of  the  aultipole 
potential  function  whereas  the  other  corrections 
have  only  a  slight  effect  on  the  axial  field  value 
Although  this  effect  as  such  aay  seea  very  small  and 
visible  only  for  small  aesh  sizes.  It  Is  an  Important 
improvement  for  the  direct  ray  tracing,  c.g  with  the 
slice  aethod  for  obtaining  fields  from  aesh  potentials 
C9J. 
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Absfract-Fowler-Nordheim  tunnelingfl]  and  internal 
field  emission[2]  models  have  been  incorporated  into 
the  2-D  numerical  device  simulator  HFIELDS[3j.  These 
phenomena  are  especially  important  when  modeling  a 
range  of  devices  including  DRAM;  non-volatile  mem¬ 
ories  and  MOSFETS.  The  numerical  implementation 
of  the  equations,  the  assumptions  made  and^their  jus¬ 
tification  will  be  discussed  as  well  as  a  comparison  of 
measured  and  simulated  device  characteristics. 


I.  Introduction 

The  scaling  of  semiconductor  devices  as  the  ULSI  era  approaches 
is  becoming  increasingly  difficult.  This  is  prompting  the  need 
for  more  advanced  design  tools  to  allow  the  accurate  simula¬ 
tion  of  advanced  semiconductor  structures  prior  to  fabrication. 
More  specifically  the  incorporation  of  F-N  tunneling  and  inter¬ 
nal  field  emission  models  into  II FIELDS  allows  the  efficient  de¬ 
sign  of  EEPROM  (Electrically  Erasable  Programmable  Read 
Only  Memory)  and  flash  EEPROM  memory  cells  (Fig.I)  by 
simulating  the  transient  writing  characteristics[4)[5). 


II,  Fowler-Nordheim  Tunneling  (FN) 

The  model  of  Lenzhnger  and  Snow  has  been  used  to  implement 
this  phenomenon  where  the  tunneling  current  density  Jfn  can 
be  expressed  as; 


Jfn  = 


qm 

8xhro*x<£ 


E’«p(- 


3qtlE 


(1) 


where  the  symbols  have  their  usual  meaning.  This  is  a  1-D 
model  integrated  over  a  suitable  surface  to  calculate  the  total 
tunneling  current.  The  2-D  electric  field  values  are  chosen  so 
that  the  calculation  takes  proper  account  of  the  fringing  fields. 
The  integrating  surface  normally  lies  on  the  Si/SiOj  boundary 
where  the  local  field  value  is  used  for  the  calculation.  This 
should  be  accurate  as  long  as  there  is  no  significant  charge 
trapped  in  the  oxide  bulk,  which  is  the  case  in  a  non-degraded 
device.  Charges  trapped  at  the  interface  do  not  effect  the  ac¬ 
curacy  of  the  model.  Furthermore,  different  coefficients  are 
selected  for  the  FN  equation  depending  on  whether  the  emit¬ 
ting  surface  is  mono  or  polycrystaline  silicon.  In  the  case  of 
electrons  tunneling  through  the  silicon  dioxide  into  monocrys* 
talme  silicon,  electron- hole  pair  generation  occurs  because  of 
the  large  energy  difference  between  the  oxide  and  silicon  con¬ 
duction  bands.  The  carrier  generation  rate  is  fixed  at  1.8  per 
tunneling  electron  winch  is  typical  of  that  reported  in  the  lit¬ 
erature^].  The  model  has  been  verified  on  a  tunnel  oxide  ca¬ 
pacitor  where  a  comparison  of  measurement  and  simulation  is 
shown  (Fig  2).  If  a  floating  gate  is  present,  as  is  the  case  of 


an  EEPROM,  then  the  charge  boundary  value  is  continually 
updated  with  time  according  to  the  F:N  current  flowing.  The 
model  has  been  tested  on  an  EEPROM  device  (1.2/im  tech¬ 
nology)  during  programming  where  measured  and  simulated 
tunnel  currents  show  good  agreement  (Fig-3). 

III.  InternalField  Emission 
This  is  commonly  called  band-to-band  tunneling  (BBT)  and 
occurs  in  regions  of  semiconductor  where  high  electric  fields 
and  large  band  bending  are  present.  The  model  used  is  similar 
to  the  well  known  1-D  model  of  Kane[7].  In  BBT  the  carrier 
transition  probability  can  be  expressed  as, 

^  ,  irm)4  \ 

T~eXP(~2,/2ehE)“P('  ehE  *  (2) 

In  this  implementation  the  local  field  is  used  to  calculate  the 
carrier  transition  probability  which  introduces  some  error  when 
the  spatial  variation  of  the  electric  field  is  not  a  constant.  This 
error  is  largest  when  the  spatial  derivative  of  the  electric  field  is 
a  maximum.  However,  since  the  most  significant  internal  field 
emission  in  a  gated  diode  structure  occurs  in  a  very  localized 
depleted  region  in  the  structure  (Fig.4),  in  general  choosing 
the  local  field  introduces  only  a  small  error.  Furthermore  the 
amount  of  band  bending  must  be  evaluated  at  each  element 
in  the  device  since  electrons  will  not  tunnel  into  the  forbidden 
band.  The  results  of  the  model  are  shown  in  Fig.5  where  a 
gated  diode  structure  has  been  simulated  and  a  comparison 
with  measured  values  shows  good  agreement  using  only  a  sin¬ 
gle  fitting  parameter.  Also  shown  in  Fig  6  are  the  simulated 
(BBT)  substrate  currents  seen  during  the  erasing  of  a  flash 
EEPROM  device,  emphasizing  the  sensitivity  to  oxide  thick¬ 
ness. 


IV.  Conclusions 

The  techniques  used  to  implement  the  F-N  and  internal  field 
emission  equations  have  been  outlined  and  the  models  have 
been  verified  by  comparison  with  measured  data.  The  ap¬ 
plications  of  these  new  models  in  II  FI  ELDS  has  been  briefly 
described  in  te*-ms  of  non- volatile  memory  simulation  which 
demonstrates  the  power  of  the  tool  as  an  aid  in  advanced  mi¬ 
croelectronics  device  design. 
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ABSTRACT 

Bipolar  transistors  designed  for 
room-temperature  operation  suffer  serious 
current  gain  degradation  at  low  temperatures 
and  have  not  been  widely  investigated  for 
their  operations  and  performances  at  low 
temperatures.  In  this  .paper,  bipolar 
transistors  fabricated  in  the  n-well  of  a 
C-MOS  process  have  been  simulated  using 
BIPOLE  program  .  at  low  temperatures.  It  is 
shown  that  at  a  temperature  as  low  as  125°K, 
it  is  possible  to  obtain  a  cut-off  frequency 
of  7  GHz  and  a  current  gain  of  20  which  is 
sufficient  for  some  applications. 

INTRODUCTION 

Recent  advances  in  high  temperature 
superconductors  "(1 )  are  giving  new  impetus  to 
the  understanding  of  bipolar  and  MOS  devices 
operation  and  performances  down  to  liquid 
nitrogen  temperature.  BICMOS  has  also 
received  considerable  attention  as  an 
attractive  room  temperature  VLSI  technology. 
Very  high  speed  poly-emitter  bipolar 
transistors  fabricated  in  the  n-well  of  a 
CMOS  process,  having  room  temperature  cut-off 
frequency  of  about  16  GHz  and  an  ECL  gate 
delay  time  of  65  pSec  have  been  reported 
[2,3} .  Little  attention  has  been  paid  to 
date  regarding  the  performances  of  these 
devices  at  low  temperatures.  Numerical 
simulation  has  been  used  (4-6)  as  a  tool  to 
study  the  influence  of  different  process 
parameters  on  bipolar  device  terminal 
behavior  without  going  into  a  costly 
fabrication  and  measurement  procedures. 

The  computer  simulation  results  obtained 
using  the  BIPOLE  device  simulator  have  been 
validated  first  with  the  reported  room 
temperature  experimental  results  (2,3}  in 
relation  to  the  current  gain,  cut-off 
frequency  and  ECL  gate  delay  time  and  then 
the  results  for  low  temperature  simulations 
are  presented. 

STRUCTURE  INVESTIGATED 

The  transistor  studied  in  the  present 
simulation  is  an  integrated  circuit  device 
with  a  polysilicon  emitter  and  uses  oxide 
isolation.  The  emitter  dimensions  are  0.6  x 
2.4  jiiTand  the  e-b  and  c-b  junction  depths 
were  0.04  and  0.14  jim  respectively.  The 
extrinsic  base  sheet  resistance  was  3.0  0/sq 
and  the  collector  was  characterized ,  by  an 
epitaxial  layer  doping  of  8  x  10A  cm  . 

ANALYSIS 

The  simulations  have  been  performed  using  the 
BIPOLE  program  m  over  the  temperature  range 
from  300  to  77v  K.  It  should  be  noted  that 
BIPOLE  program  includes  physical  effects  such 
as  heavy  doping  bandgap  reduction  (8), 
doping-level-dependent  mobility, 


field-dependent  mobility  (9)  and 
recombination  rate  versus  doping  level  (10). 
The*  program  is  also  capable  of  identifying 
the  contributions  of  charges  in  the  various 
regions  to  overall  delay  time  (11).  Special 
attention  has  been  given  to  the  device 
parameters  such  as  current  gain,  cut-off 
frequency  (fT)  and  ECL  gate  delay  time.  It  is 
seen  that  the  room  temperature  simulation 
results  as  described  below,  agree  well  with 
the  experimental  results  reported  in  (2,3] - 

Figure  1  shows  the  variation  of  cut-off 
frequency  (ft)  as  a  function  of  collector 
current.  The  reported  cut-off  frequency  is 
about  16  GHz  (cf. figure  15  (3))  where  as  the 
simulated  cut-off  frequency  is  seen  to  be 
about  18  GHz. 


Figure  2  shows  the  variation  of  current  gain 
as  a  function  of  temperature.  The  reported 
room  temperature  current  gain  is  about  100 
(cf.  Fig.  11  (3))  and  simulated  current  gain 
is  about  105.  It  may  be  seen  that  even  at 
125  K  a  gain  of  about  15-20  may  be  achieved 
for  the  device  considered  and  is  sufficient 
for  some  applications. 


The  experimental  ECL  gate  delay  time  for  21 
stage  ring  oscillator  is  about  65  pSec  for 
fanout  of  1  and  tree  current  of  around  0.4mA 
with  a  logic  swing  of  0.5V  at  CL-0.0  pF.  The 
simulated  delay  time  (computed  using  the 
expression  reported  in  Ref. (12))  is  about  75 
pSec  as  shown  in  Figure  3. 

Figure  4  shows  the  variation  of  peak  cut-off 
frequency  as  a  function  of  temperature  and  it 
is  seen  that  even  at  125  K,  it  is  possible 
to  have  an  fr  of  about  7  GHz.  The  variation 
of  ECL  gate  delay  time  as  a  function  of 
temperature  is  shown  in  Figure  5.  It  is 

interesting  to  note  that  around  0.3  mA 
collector  current  the  minimum  gate  delay 
time  of  ?bout  70  pSec  is  obtained. 

CONCLUSIONS 

The  computer  simulation  results  agree  well 
with  the  experimental  results  reported  and 
are  sufficiently  accurate  to  give  confidence 
in  interpreting  the  results.  More 
experimental  measurement  data  are  necessary 
at  low  temperatures  to  confirm  the 
temperature  dependence  of  ft  and  current  gain 
of  bipolar  transistors. 
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Abstract:  A  3-D  hybrid  finite  element  model  for  the  stationary  semi¬ 
conductor  equations  is  presented.  Symmetry  group  theory  is  used  for 
each  of  the  three  PDE's  to  derive  seven  weak  current  compatibility 
equations  for  an  8-nodcd  tril  inear  brick  element  leading  to  a  stable 
(i  e.  avoiding  spurious  modes)  hybrid  element  discretization.  It  is 
also  shown  that  more  than  seven  linear  independent  weak  current 
compatibility  equations  can  not  be  introduced. 

I.  INTRODUCTION. 


Several  authors  have  transferred  the  concept  of  "Hybrid  Elements"  or 
"Mixed  Element  Discretization"  developed  in  computational 
mechanics  12), (3)  to  semiconductor  device  simulauon  (1J,  (4),  (5), 
[6]  during  the  last  three  years.  This  dimension-independent  concept 
reduces,  in  the  1-D  case,  to  the  standard  Scharfctter-Gummcl  current 
discretization  scheme  (9)  widely  used  in  semiconductor  device 
simulauon.  Numerical  experiments  showed  that  2-D  hybrid  elements 
lead,  at  least  in  some  2-D  examples,  to  more  accurate  results  than  the 
conventional  box-method  in  conjunction  with  the  Scharfetter-Gummel 
current  approximation  (1).  Crucial  in  a  hybrid  element  model  for  the 
semiconductor  equations  is  the  "correct"  selection  of  the  trial  current- 
and  weight  vectors,  and  the  corresponding  additional  equations  in 
order  to  determine  the  inner  degrees  of  freedom  (DOFs)  introduced 
for  the  current  assumption  (6).  A  "bad"  selection  of  trial  vectors  lead 
to  rank  deficiency  of  the  clement  stiffness  matrix,  resulting  in  the 
phenomena  of  "spurious  modes"  [7). 


Punch  and  Atluri  18)  used  group  theory  to  derive  trial  functions  for 
the  stress  tensor  components  (mechanics  problems)  leading  to  a  stable 
3-D  hybrid  element  (i  e.  no  spuriour  kinematic  modes)  which  is  also 
optimal  with  respect  to  computation  time. 


The  goal  of  our  paper  is  to  present  a  3-D  hybrid  brick  element  model 
(section  HI)  to  solve  the  stationary  semiconductor  equations,  i.e.  the 
Poisson  and  the  continuity  equations,  for  the  unknown  scalar  fields, 
namely  the  electric  potential  and  the  Slotboom  variables  (cf.  model 
problem  defined  in  section  II).  Coordinate  invariance  entails  certain 
symmetry  relations  between  the  coordinates.  These  relations  are 
governed  by  group  theory.  Group  theory  is  used  for  each  PDE  to 
define  seven  weighting  vectors  with  polynomial  components  of 
minimal  order,  corresponding  to  exactly  seven  weak  current 
compatibility  equations  (wcce's)  which  guarantee  a  stable  brick 
element  discretization  with  an  8x8  stiffness  matnx  of  rank  seven. 
Eight  trilinear  shape  functions  corresponding  to  the  eight  comer 
DOFs  are  used  in  this  case  for  the  unknown  scalar  field.  These 
weighting  vectors  guarantee  zero  current  distribution  for  constant 
scalar  field  distribution  (rigid-body-motion  condition).  Furthermore, 
these  weighting  vectors  are  divergence  free:  No  Lagrangian 
multipliers  need  be  introduced.  The  two  following  statements  will 
also  be  proven:  a.)  Each  introduction  of  additional  wcce's  with  a  new 
weighting  vector  linearly  independent  from  the  previously  defined 
seven  wcce's  and  compatible  to  the  rigid-body  motion  condition  is 
inconsistent,  i.c.  results  in  a  condition  for  the  inner  DOFs  tn  the 
current  assumption  only.  This  condition  is  independent  from  the 
global  DOFs  of  the  unknown  scalar  field,  b.j  Leaving  out  anyone  of 
the  seven  specified  wcce's  leads  to  instability  (spurious  modes). 


H.  BOUNDARY  .VALUE  MODEL  PROBLEM. 

We  consider  the  boundary  value  problem 

.div  E  s*  p.  in  f2  (1) 

for  an  unknown  (^-continuous  scalar  field  f,  related  to  the  vector 
field  E  by  E  “  a(f,  x,y,z)gradf.  The  scalar  function  a(f,  x,y,z)  is  as 
yet  unspecified.  The  boundary  30  of  the  3-D  simulation  domain  O  is 
split  up  into  two  disjoint  parts.  Dirichlet  boundary  conditions  are 
applied  on  the  first  part  and  homogeneous  Neumann  conditions  (E  Y 
*  0,  Y  being  the  outward  unit  normal  vector  to  the  boundary)  on  the 
second  pan.  The  inhomogeneous  part  p(f,  x,y,z)  of  (1)  is  a  given 
function  of  f  and  the  coordinates  (x,y,z)  The  regularity  conditions 
for  E  and  p  in  order  to  guarantee  existence  and  uniqueness  of  the 
solution  of  this  problem  are  summarized  e.g.  in  (9].  Each  of  the  three 
stationary  semiconductor  device  equations  is  of  the  form  (1)  using  the 
Slotboom  variables  u  and  v  (exponential  functions  of  the  scaled  Fermi 
potentials)  as  independent  variables.  An  extension  of  the  following 
derivation  for  a  set  of  other  independent  variables  is  possible  [1].  It 
should  be  also  observed  that  the  derivation  applies  to  both  Poisson's 
and  the  continuity  equations. 

III.  3-D  FE-HYBRID  ELEMENTS  FOR  THE  SEMICONDUCTOR 
EQUATIONS. 

Following  Kan's  hybrid  model  (2),  an  independent  assumption  for 
the  vector  E  is  introduced  within  each  element  volume  flg  of  a  given 
mesh  in  the  3-D  domain  Q 
M 

Ea  -  XPjBj(w)  in  <2> 

J-l 

with  M  DOFs  pj. 

Moreover,  another  (^-continuous  assumption  is  introduced  withm 
each  element  fer  the  scalar  field  f,  defined  by 

N 

fa»  £fiHi(x,y,z)  in  Oe,  (3) 

with  N  nodal  degrees  of  freedom  ft  and  with  shape  functions  Hj 
This  assumption  is  Cg-continuous  over  the  whole  simulation  domain 
fl  However,  compatibility  between  the  vector  field  Ea  and  the  scalar 
field  fa  is  not  guaranteed  within  each  element  flg. 
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According  to  the  weighted  residual  method,  two  sets  of  discretized 
equations  are,  considered,' firstly  the  weak  form  of  the  vector  field 
compatibility  (wcce  in  the  case  of  a  continuity  equation)  for  each 
dement 

Jr^j  a' 1  <Ea  -  E  <fa.  ^  f?y)  )  <M>  =  0  in  £Je  ,  (4) 

0=1  —  p»  p<=  M) ,  (assuming  a*1 «  a_1(fa>x,y,z)  *  0). 

with  weighting  vectors  Wj>  and  secondly,  the  weighted  conservation 
of  flux  of  the  vector  field  E  over  the  whole  domain  Cl 
JflHiWlvEa-pidn-O,  (5) 

with  CQ-continuous  weighting  functions  Hj  corresponding  to  each 
DOF  fj.  For  a  further  derivation  of  element  residual  equations  and 
generalized  element  stiffness  matrix,  see  (1). 

The  question  is:  How  are  the  3Kj(x,y,z)  and  Bj(x,y,z)  defined  and 
what  are  the  additional  equations  if  P<M  ? 

IV.  APPLICATION  OF  GROUP  THEORY  TO  THE  WEAK 
CURRENT  COMPATIBILITY  EQUATIONS. 

Consider  a  cubic  element  with  element  faces  parallel  to  the  global 
coordinate  axis,  8  trilinear  element  shape  functions  Hj  are  introduced 
corresponding  to  the  8  element  comers,  sec  Figure  below. 


Consider  the  eight  symmetrized  functions 

&0*  -  +.  h2  +  h3  f  H4  +  Hs'+ h7  +  ^8 ) 

hI»(.H1+'H2  +  H3.'H4  +  H5.  Hg-  Hj+Hg)^ 
h2»  (H1+H2^H3  -H4- 

h3=  (H1,-^H2,+  H3-  H4-  H5r  Hg+Hy^+Hg)  ^ 

h4=  (Hj  -  H2-H3+H4+H5-.H6-  Hy.+  Hg)  1^ 

%3  (•  Hj  +  H2  •  *H3  +  H4  -  H5  -  H5+H7  +Hg ) 

h6»(.Hj-H2  +  H3  +  H4-  H5  +  IV  H7  +  H8)  laac 

h7»(«H|  -  H2 -  H3 -  H4  +  H5  +  H$  +  H7  +  Hg)  I3qc  (8) 

(the  shape  functions  are  restricted  to  the  element  boundary).  Note  that 
this  is  a  regular  transformation. 

Result:  The  7x8  matrix 


r,j- 


is  an  "upper-diagonal"  matrix:  Tjj  /  0  for  H-l*j  and  Tjj »  0  for  i+l/j. 
It  follows  that  the  rank  of  T  is  equal  to  the  rank  of  G  and  also  to  the 
rank  of  the  stiffness  matrix. 

Introducing  a  new  weighting  vector  >J£j,  I>7,  the  row  Tjj  can  be 
described  by  linear  combinations  of  rows  of  Tjj,  i<»7  (a  constant 
scalar  distribution  farovcr  the  clement  leading  to  zero  current 
distribution  implies  Tj  j«  0). 

Theoretical  background:  The  decomposition  of  matrix  T  is  a  direct 
consequence  of  the  generalization  of  Schur’s  lemma  (10]:  Consider 
the  symmetry  group  C  consisting  of  all  48,  rotations  and  reflections 
which  leave  the  cube  invariant.  The  linear  maping 


Figure.  Element  corner  numbering 

Writing  the  wcce's  (4)  in  raatnx  form  and  applying  one  of  Green's 
formulas  to  (4),  cf.  (1),  the  matrix 


G:  Hj - >&jY  00) 

whose  matrix  form  is  defined  by  T,  commutes  with  every  other 
element  symmetry  transformation.  As  a  consequence,  irreducible 
representation  spaces  of  C  contain  the  eigenspaces  of  G. 

Thus  the  \¥j  and  hj  transform  according  irreducible  representations  of 
C  (10).  Equations  (8)  arc  the  projection  operators,  well  known  in 
representation  theory. 


has  to  be  evaluated.  This  matnx  describes  the  coupling  between  the 
pj  in  (2)  and  the  fj  in  (3). 

The  seven  divergence-free  weighting  vectors  are  (local  coordinate 
origin  at  element  centroid): 

(1. 0, 0]T  ^4  » (0,  z,  y)T  Y£7=iyz,  zx,  xy]T 
a2-(0. 1.0]T  JVj  -  (z,  0.  x)T 

i^3  *  (0. 0,nT  « [y,  x,  0|T  (7) 


i 


I 


V.' CONCLUSION. 


Using  symmetry  relations,  we  showed  that  at  least  seven  wccc's  are 
needed  to  avoid  spurious  modes  for  a  cubic  trilinear  element,  A 
similar  derivation  of  wcce's  is  possible  if  quadratic  and  cubic  or  other 
shape  functions  Ht  are  introduced.  An  extension  of  the  theory  to  brick 
elements  is  possible.  It  is  hoped  that  , the  selected  weights  are 
"adequate"  for  moderately  distorted  brick  elements,  compare  [il]. 

In  the  case  of  a  «  constant,  the  same  weighting  vectors  can  be  used 
for  the  vector  field  assumption  (Poisson's  equation).  In  this  case, 
the  first  term  in  equation  (4)  splits  in  blocks  along  the  diagonal,  cf. 
[10],  resulting  in  some  computational  speed  up. ' 

If  a  ^  constant,  at  least  seven  vectors  have  to  be  introduced  in  the 
vector  field  assumption  (continuity  equations).' 

If  mere  than  seven  trial  current  vectors  are  used  in  Ea,  additional 
equations  have  to  be  defined  (for  example  integrabihty  conditions  for 
a*  ^  or  inhomogeneity  conditions  for  divfj  -  p ). 

Note  that  in  the  case  of  fij  *  Sj  for  j»  i, 7  leads  to  symmetric 
element  stiffness  matrices. 
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ON  AN  INTEGRAL  EQUATION  DESCRIBING  THE  SWITCHING  BEHAVIOUR 
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The  standard  model  describing  the  switching  behaviour  of  a 
PN-diode  consists  of  the  well-known  Drift-Diffusion-Equations  of- 
van  Roosbroeck;  Under  simplifying  assumptions  (zero  space  charge 
approximation,  low  injection  limit)  an  integral  equation  just  in  terms- 
of  the  time-dependent  current  I(t)  can  be  obtained  from’thesc 
coupled  partial  differential  equations.  A -singular. perturbation 
analysis  of  the  integral  equation  yields  a  mathematical  justification  of 
the  physically  observable  constant-current  phase. 


I.  INTRODUCTION 

The  investigated  physical  system  is  an  electrical  circuit  that 
consists  of  a  voltage-source  generating  a  potential  drop  U(t),  a  serial 
Ohmic  resistance  ©-and  a  PN-diode.  U(t)  is  assumed  to  be 
piecewise  constant  for  t  >  0  and  t  <  0,  respectively.  At  t «  0,  U(t) 
changes  abruptly  from  +W  (biasing  in  forward  direction)  to  -U 
(reverse  biasing).  The-  physically .  most',  interesting  quantity 
describing  this  abrupt  switching  is  the  time-dependent  current  I(t), 
Via  identification  of  the  PN-diode  with  a  bounded  domain  ft  c  Rn, 
n»l,2.3,  see  Fig.l,  the  standard  (semi-classical)  model  describing 
the  diode's 


Cn 


cP 

Fig,l. 

dynamics  given  by  the  (scaled)  Drift-Diffusion-Equations  of  van 
ROOSBROECK  [vanROOSBROECK.1950]  is  applicable.  (For  the 
scalings  see  {MARK.R1NG,SCH,1990).> 


)?AV  =  n-p-C  (Dl) 

(D2) 

Jp  =  -Mp(Vp+pW)  (D3) 

divJnasD(n+R  (D4) 

divJpa-9(p~R  (D5) 


The  physical  quantities  are  the  electric  potential  V,  the  (non¬ 
negative)  concentrations  of  electrons  (n)  and  holes  (p),  the  doping 
profile  C  (which  i$  strictly  positive/negauve  in  the  N-region/P- 
region  /  ftp)  and  the  electron  and  hole  current  densities  Jn  and 
Jp-  Mn  and  |ip  denote  the  electron  and  hole  mobilities,  respectively, 
which  are  assumed  to  be  functions  of  the  spatial  variable  x  taking 
positvie  values.  The  differential  equations  involved  are, 
respectively,  the  Poisson  equation  for  the  electric  potential  V,  the 
continuity  equations  for  electrons  and  holes  and  the  electron  and 
hole  current  relations.  The  spatial  variable  ranges  over  ft,  the  time 
variable  t  ranges  over  (0,*°).  R  denotes  the  Shockley-Read-Hall- 
recombination  term 


_ _ np-54 

tn(p+62)  +  tp(n  +  52) 


(R) 
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based  on  mass-action- models  of  the  electron-hoTe-interaction. 
t0  and  are  the  mean  life-times  of  electrons  and  holes.  They  are 
assumed. to  be, constant  in, the -N-region  and  the  P-region, 
respectively.  Equations  (D 1  )-(D5)  are  subjected  to  the  boundary 
conditions  (B I )  along  CN ,  Cp : 

n-p-c  =  0 
np  =  64 

V  =  Vjy(x)4 U +C0l(t)  ,  X€Cn 

V  =  Vy(x)  ,  xeCp  (Bl) 


where 

Ift)  =  /(J„  +-V  ?-23t(Vv) )  •  v  do 


Sc+JEl 

1  258 


(i) 

(V) 


to  homogeneous  vonNcumann  conditions  along  C  . 

(Vn)*e  =  (Vp)*e  c(Vv)-eaO  (B2) 


and  to  initial  conditions : 

n(.,0)  =  n‘(.) .  p(.,0)  =  p‘(.) ,  V(..0)  =  v'(.)  (D!) 

where  n1,  p1,  V1  are  solutions  of  the  stationary  Drift-Diffusion- 
Equations,  Under  simplifying  -  but  physically  reasonable  * 
additional  assumptions  (smoothness  of  ft  and  the  physical 
parameters,  zero  space  charge  approximation,  low  injection  limit)  a 
nonlinear  Voltena  Integral  Equation  of  the  second  kind  with  I(t)  as 
unknown  function  can  be  obtained  (see  IUNTERREITER,19913, 
(SCH,  UNT.  WEISS,  1991)): 


c<-U-wI(t»  oi+clp+«I-Ip)*g)(i'  (la) 

c<w-“»f>  =  l  +  clp  (lb) 

Ip  denotes  the  (scaled,  dimensionless)  constant  cutrent  that  has 
flown  through  the  device  for  t<0.  c  cumulates  physical  parameters 
(such  as  the  mean  life-time  of  electrons  and  holes)  of  the  diode. 
denotes  the  convolution  operation.  The  integral  kernel  g  is  an 
infinite  sum: 

S(l>=  £<Kj)2«P<V>  (2) 

i*l 

g  is  a  completely  monotone  U(0.—)*function  with 


iim(l*g)(0  =  c 


(3) 


Die  set  l(Xj,K,))  ts  deteimtned  by  the  eigenvalue-problem 

UbJl  =  >-i«i  (4a) 

where  (  Y  -  W  > 

dom(L)  c  =  (f  6  H'W-flc.  =  O.a  e  (N.P), 

^yflp=  const)  (4b) 

UH»  (u)  =  -/7<iiv(^  V(^yu))  -  ^  (4  c) 

(extended  weakly  to  dom(L)) 
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and 

Kj=VY*,'r  fti) '  - 

p  and  i  arc  the  mobilities  and  mean  lifetimes  of  the  minority  charge 
carriers  in  the  N-  and  P-region  of  the  diode. 

Furthermore;  0  >Xj  £  X2  £  X3  £  an(* 


HmXj=-«>  •  (4e) 

(See  (UKTERREITER,  1 99 U  for  further  details.) 


n.-THE  EXISTENCE  OF  A  CONSTANT-CUF RENT  PHASE  (*) 

One  of, the  most  significant  features  _i  the  physically  observable 
cunent  I(t)  is  its  time-independence  :or  a  certain  period  [O.TJ,  The 
justification  of  the  existence  of  such  a  “constant-current  phase"  as 
well  as  the  calculation  of  its  duration  can  both  be  facilitated  by  a 
singular  perturbation  analysis  of  the  integral  equation  (la). 

The  scaling  factor  of  the  potential  U  in  (la)  is  given  by  the  so-called 
"thermal  voltage"  UT  which  is  typically  0(0  025Volt).  UT  is  rather 
small  in  comparison  with  the  unsealed  versions  of  U,WD  - 
W-coIp  «  0(1  Volt).  Consequently,  it  is  advisable  to  introduce 


1 

su+wD 


‘10- 


(5) 


with  U  and  WD  being  scaled.  On  the  other  hand,  the  derivation  of 
(la)  essentially  depends  on  the  assumption  that  I(t)  is  well-scaled 
during  the  whole  switching  process  (low  injection).  Therefore, 


na 


1(0+) 

h 


is  a  well-scaled  quantity.  Introducing  z(t)  :*  *coI(t)  and 

p  T 

(la)  becomes  in  Rims  of  z<t) 

exp(s-i)  =  i — — - — -*g- 
£  c(l+ep) 


_en_ 

c(l+ep) 


t  g 


(6) 


t»> 


Referring  ro  (M1LLER,UNTCRRE1TER,1991)  and  (SCH.  UNT, 
WEISS.  1991) . 


z(0)  £  z(t)  ^  z^  zs  lim  z(t) 


(8) 


z(0)  ■  (1/e)  suggests  the  ansatz  z  ■  (1/c)  +  y  for  sufficiently  small 
t.  (7)  becomes  in  terms  of  y 


ey  =  1- 


1+H 

c(l+ep) 


en 

c(i+ep) 


y*g 


(9) 


As  long  as  y»  o(l/e)  for  e— 0  on  an  interval  (0,T1  ,  y  can 
approximatively  be  neglected  and  z  is  almost  constant. 

Theorem 

Let  0  £  e  <  (q/p)  and  define  ^>0  such  that 

<io> 

1)  If  y « o(l/e) for e-0 on (0,T),  then  T£tg. 

2)  If  g(t) o 0(1*°) for  t—0 ,0£a<  1, then  y « 0(lne) ■  o(l/e) on 
(0,101- 


Part  2  of  this  Theorem  guarantees  the  existence  of  a  constant- 
current  phase  -  provided  it  can  be  proven  that  g(t)  defined  by  (2) 
posse ses  at  most  an  algebraic  singularity  at  the  origin: 


.Theorem. 

If'iKj)2  £  K*  and  IXy^l- 1 ,  p  >  1 ,  as  j-« ,  then 
•gM-OfrO®)  as  t-0. 


■When  dealing  with' one-dimensional  models  ofPN-diodcs 
and  provided  C  and  n  arc  constant  in  the  N-region  awl  the  P-region; 
respectively,  the  eigenvalues  X»  are  the  zeros  of 


V,  ian]K,(X)lN1  |  Y-  tan[K,(>.)lp|  _  0 
14  K+W,  R-  K-(X) 


where;  the  .indices:  V  denote>thc  -N-region-/-P-region, 
respectively,.-^, and,  lp  are  the  scaled -lenghts  (one-spatial 
dimension!)  of  these  regions  and 

K=r?  <i2> 


Kj  Canbc  computed  from 

k2  ( iN — - 

Y+sin-(ic+lN)  Y+it+ 5in(K*lN) 

,  I  1 _ 1.  .COS(Kjp)  2 

PY-sm?(x.lp)  Y-K-  sin(K_lP)  1 


(13) 


it  is  easy  to  sec.  that  3.j-0(j2)  as  j— °°-and  that  (K,)2  stays 
bounded  as  j— «>.  Hence,  in  the  case  of  one-dimensional  modelling 
of  PN-diodes  the  physically  observable  constant-current  phase  has 
been  justified  from  a  purely  mathematical  point  of  viewi 


(*)  The  proofs  of  th^  cited  theorems  can  be  found  in 
(UNTERREITER,  199IJ. 
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A  NOVEL  COMPUTATIONAL  PARADIGM: 
MUCH  MORE  EFHOENT  THAN 
VONNEUMANN  PRINCIPLES 

R.W.  Hinecodn,  H.  Rekuj.  M.  RiedrauUer,  K.  Schmid: 

Univcxsias  Kisersliutcc,  Fjchhatidi  Infornuuk,  Bin  12 
Postfadi  3049,  DrW-6750  KaisenJamera,  Gejmsiy 
phone  (+49-631)205-2606,  Fax:  (449-631)205-3'00 


Afakact 

Compnam  (baled  oo  voa  Xcmiaj  principles)  nr  caraudy 
ifleffioent.  Thff  wfcytHs  pager  iosmfaccs  z  pprd  coMpoMDoual 
paradigm  based  oo  dcw  hardware  machine,  principles. Such 
machines,  called  “xpaicrs”  avoid  most  of  the  bookaecksknown 
from  (voa  Kecaam)  cocpcttn.  so  that  a  hardware  efficiency  is 
obtained  wtach  is  jn^er  by  sevgal  orfas  of  m^mmde- By  means 
of  a  few  algorithm  examples  the  new  peadsgm  will  be  hxrodoccd  as 
a  new  programming  paradigm,  winch  is  data-procedonl  (which  is 
mote  direct  than  th c  octroi-procedural  voa  Neumann  paradigm). 
Finally  the  paper  gives  a  survey  oo  the  novel  application 
development  environments  needed  for  xptaers  and  their  advantages 
over such  tools  for  compeers.  Such  application  support  for  xputers 
includes  two  alternative  source  levels:  high  level  programs,  or  very 
high  level  algorithm  specifications. 


The  Machine  Paradigm  of  Xputers 
In  prograrscing  and  coding  von  Neumann  machines  their  basic 
paradigm  causes  massive  overhead  (control  -  flow  overhead, 
addressing  overhead  and  other  kinds  of  overhead  11]].  which  cats 
op  most  of  the  processor's  throughput  and  its  primary  memory 
bandwidth-  Doc  to  the  control-driven  operation  principles  by 
multiplexer-based  instruction  sequencing  the  individual  processor 
permits  only  very  limited  intra-ALU  parallelism  (1]-  Parallel 
■computer  systems  suffer  from  high  hardware  cost  and  from  massive 
imer-processor  communication  overhead  [2].  In  addition  to  that 
most  of  the  processors  arc  idling  (a  very  few  applications  are 
exceptions  from  that).  Data  flow  machines  suffer  from  massive 
token  flow  overhead  needed  for  agination  and  from  other  kinds  of 
overhead  [2]  and  debugging  is  extremely  difficult. 


fig- 1:  basic  blocks  of  xputers 

Xputers  use  a  data  sequencer  (instead  of  an  instruction  sequencer} 
including  a  powerful  address  generator  to  avoid  addressing 
overhead  and  control  flow  overhead  whenever  possible  (see  also 
fig.  1).  Instead  of  an  ALU  driven  by  an  instruction  sequencer, 
xputers  use  a  reconfigurable  ALU  (called  rALU),  such  thai  powerful 
compound  operators  exhibiting  tmra-ALU  parallelism  may  be 
defined  at  compile  time.  More  hardware  details  have  been  published 


cbewhoc  (1-3J.  A  smart  regsscr  file  (called  scat  cache,  being  a  sac 
adjustable  window  to  memory  space)  inchxSng  an  imelligcm  daa 
memory  interface  supports  ixwcom?*Je  time  optinczatica  smegjcs 
bczog'xnore  efficient  than  those  which  can  be  mapped  os  voa 
Neumann  hardware  only.  Algorithm  execution  examples  wfchm 
other  sections  of  this  pxper  will  pvc  a  flavour  of  the  high  efficiency 
of  this  novd  machine  organization. 

For  computers  die  RAM  (random-access  memory)  is  the  central 
technology  plezfonn  which  provides  flexibility  and  umvcrsality.  For 
xputers,  however,  the  flexibility  and  cni  venality  is  derived  from 
using  iraerconnea-reprogranunable  media  (compare  fig.  1),  also 
called  field-programnuble  media,  which  2re  comoacially  available 
from  a  bBfiooUS-doBareicle  of  fee  iaagzadcncmt  market  (1990: 
world-wide). 

Programming  and  Co.^&iiglipn  for  Xputers 
The  Xpmer  machine  paradigm  is  also  visible  from  a  programmer's 
point  of  view.  Figures  2  and  3  show  two  simple  algorithm 
examples,  which  will  be  used  to  illcsaxe  programming  for  xputers, 
execution  of  programs  on  xputers.  as  well  as  the  compilation  for 
xpuers.  Fig.  2a  shows  the  textual  notation  of  a  simple  recurrent  8 
step  loop.  The  right  ride  of  fig.  2b  shows  its  equivalent  graphic 
notation:  a  signal  flow  graph  (SFG)  which  reveals  the  regular  data 
dependences,  for  programming  an  xpmer  from  such  a  specification 
the  following  code  dements  are  needed  (to  be  generated  by.  a, 
compiler):  a  rALU  subnet  {compound  operators:  saving  memory 
cycles,  since  intermediate  results  are  not  stored)  has  to  be  derived 
(eg.  see  fig.  2b:  by  just  picking  an  iteration  from  the  SFG);  ad  data 
map  has  to  be  derived  (2-dim.  data  map  example  in  fig.  2c:  mapping 
the  SFG  onto  a  grid);  the  scan  cache  size  has  to  be  selected  (eg.  1- 
by-4  words:  see  fig.  2d);  and  an  address  sequence  (called  scan 
paacrtc  see  fig  2d)  has  to  be  selected. 

Fig.  2d  also  illustrates  xputer  operation  in  executing  the  algorithm 
example  from  figures  2a  and  2b.  First  the  data  sequencer  makes  the 
scan  cache  jump  onto  the  leftmost  column  of  the  data  map.  fig.  2d 
also  illustrates  the  auto-apply  mode  and  the  auto-copy  mode.  This 
means  that,  whenever  the  scan  cache  is  placed  onto  a  particular 
memory  location,  it  automatically  (L'e.  without  needing  a  controller, 
thus  avoiding  control  overhead)  invokes" operation  of  the  rALU 
subnet  currently  selected,  as  well  as  a  cache/memory  communication 
cyclc-Since  a  register  has  been*  added  to  the  rALU  subnet  to  save 
cbJ.  only  four  memory  read  cycles  arc  needed  (see  left  ride  of  fig. 
2d).  Due  to  the  scan  pattern  in  our  example  this  is  repeated  8  times 
until  finally  the  scan  cache  arrives  at  the  rightmost  position  of  the 
data  map.  The  tagged  control  word  (TCW)  found  there  tells  where 
to  store  the  final  result  and  provides  the  linkage  code  to  select  the* 
next  scan  pattern  etc.  So  o;  .  out  of  34  memory  semi  cycles  has- 
been  used  for  control,  what  we  call  sparse  control  or  residual  control 
(also  compare  fig.  1). 

Fig.  3  shows  another  simple  algorithm  example:  a  2-dimensional 
filtering  example  where  a  simple  video  scan  is  used  as  a  scan  pattern 
(fig.  3b)  to  scan  a  2-dim.  data  map  (fig  3b)  by  a  3-by-3  word  scan 
cache  (fig.  3a)  Fig  3c  shows  the  very  powerful  compound 
operator,  which  needs  relatively  few  hardware,  since  smaller  data 
path  width  (8  bits)  could  be  traded  for  more  operators  This 
illustrates  the  high  flexibility  of  xputerc  By  code  analysis  it  has 
been  found,  that  in  the  von  Neumann  version  of  the  same  a]gor.„hm 
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(oe  Vax-117750)  935*  of  coepodci  time  is  cscd  for  address 
compaoaoa.  The  MoM  (2]  xpmer  vexshm  avoids  this  addressing 
owncad  by  its  briwed  address  generator  (3]  rcmriag 
i  nai  ■  [Mfj-ffrff  rr~i1rfiTmTif|T  nrir— rilTfirMrr).  ■briritr ~ 
dim.  memory  organization  permits  eves  berrr  performance.  A 
spreadsheet  rrprrwrmioa  of  the  data  map  (4j  supports  a  highly 
artaathcaKrimerface  far  xpmer  appSearioas. 


fig.  2:  A  simple  algorithm  example:  a)  textual,  b)  graphic 
specification,  c)  deriving  a  data  map.  d)  xpuier  operation  illustration 


Hlflh  Performance  Applications 
fig-  4  shows  a  number  of  acceleration  factors  having  been  obtained 
experimcnttlly  from  a  number  of  algorithm  implementations  on  the 
MoM-1  and  MoM -2  xputers.  compared  to  implementations  on  a 
Vax-1 1/750.  These  results  show,  that  for  a  number  of  application 
areas  (Digital  Signal  Processing,  image  processing,  computer 
graphics,  uniform  equation  systems,  etc.)  xputers  are  competitive  to 
many  ASIC  solutions,  to  imp’emcntatio.is  on  special  processors 
(e.g  digital  signal  processors),  or  to  implementations  on 
supercomputers.  A  so  called  retargeting  feature,  being  available 
commercially,  permits  a  direct  path  from  an  implementation  on  a 
programmable  xputer  to  a  very  cheap  customized  silicon  solution: 
machine  code  can  be  directly  submitted  for  gate  array  fabrication, 
such  that  no  expensive  ASIC  design  process  is  needed  (2). 


Conclusions 

A  newel  high  performance  caanac  pxxSgm  of  xpufcrx  has  been 
ioooduced.  wfcxh  for  a  very  large  class  of  algorithms  is  by  several 
onto  of  magnitxfc  acre  rfTekxt  tirtn  the  vo«  Neumann  paradigm. 
The  data-feocednral  principles  of  operation,  whsch  are  more  intuitive 
and  more  direct  than  tratScccd  pradpte,  have  been  Skstraaed  by  2 
examples,  two  cxpcritncscdxpegrarchaecaac  examples  (the  MoM- 
1  and  the  MoM-2)  have  been  implemented  by  breadboard 
techniques.  Highly  promirieg  acceleration  factors  for  a  number  of 
algorithms  have  beak  obtained  experimentally  by  comparing  MoM 
impfaacaaootts  verses  cnpigaesasocs  co  Va*4 1/750. Tberesahs 
show,  dat  xposers  are  competitive  to  cany  ASIC  solutions  as  well 
as  lo  many  algoridea  raplexcrsatiocs  oo  supercomputers.  A  third 
xpmer  ardeteoarc  (the  MoM-3)  stressing  ffcubifity  and  uctivexsaHiy 
isaarendybacgis^femsagdarKasg^agiCTn. 


application  example 

factor 

CMOS  design  ndc  check 

>2000 

sup.  electrical  roles  check 

>300 

Lee  routing  wfchoct  obsseks 

>160 

with  obstacles 

>800 

2-dancnsioQal  fiherisg  (3  by  3) 

>300 

vector-matrix  multiplication 

150 

fig.  4:  acceleration  factors  obtained  ecp-riv  -tally  (xputer 
implementation  versus  optimized  Vax-1 1/750  it^x^uaation 
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fig.  3:  A  2-dimensiocul  filtering  example 
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Abstract  -  In  the  paper  the  fteaoryless 
method  of  analysis  of  dynamic  linear  systems 
described  by  convolution  integral  is 
presented-  The  eeaoryless  method  is 
applicable  if  one  of  the  convolved  functions 
is  a  sun  of  exponential  teres  with  real  or 
complex  coefficients.  The  algorithm  based  on 
the  proposed  method  is  numerically  simpler 
then  the  known.  di  r  ect  algorithms  of 
convolution  in  the  time  domain. 

1.  INTRODUCTION 


2.  TOE  BASIS  OF  TOE  MEMORY LESS  ALGORITHMS 


If  in  eq.  C2)  coef 1 1 c*  ents  .  for 

i»1.2 . L.  denote  the  real  numbers  then 

to  solve  eq.  Cl>  in  the  n-th  step  of 
calculations  the  following  memoryless 
algorithm  can  be  used  131 


y;  -  r  -y; 

y"*iy» 

where  C* 


'U 


expC-a.  •£>, 


C35 

CO 

C55 


The  natural  way  for  the  derivation  of 
the  output  signal  yCt>  of  any  linear. 
tiee-I  nvariant  system  is  to  calculate 
numerically  the  convolution  Integral 
t 

yCO  =  kCt5*xCt5  *  J  kCt-r5 -xCOdx.  05 
O 

where:  kCO-the  pulse  response  of  the 

analysed  system.  xCO-the  Input  signal,  for 
the  sequence  of  the  tiae  saaple  values 
tnCn*l,2..  .  ,M.  The  direct  algorithms  in 
tho  tiae  domain  for  convolutions  are 
numerically  complex,  because  the  nueber  of 
aul tipi  1  cations  in  tho  realization  of  these 
algorithms  for  n  aaples  is  proportional  to 
n2.  Usually,  in  applications  numerous 
techniques  based  on  the  Fourier  or 
Laplace  transformations  are  used  [11  If  one 
of  the  convolved  functions,  e.  q.  kCt5  is  of 
the  fora 

L 

kCt5  ».  £  Ai  *  expC-a  -O  .  C25 

let  1 

then  the  memoryless  algorithm  described  ir. 
tho  paper  (2)  can  be  used  In  the  paper  the 
memoryless  algorithm  in  the  time  domain  for 
the  analysis  of  a  larger  class  of  linear 
systems  is  presented 


The  memory  less  algorithm  can  also  be 
formulated  for  the  case  In  which  the  kCO 
function  is  given  of  the  form  of  the  sum  of 
trigonometric  functions.  Idea  of  such 
algorithm  will  be  explain  for  the  simple 
case,  when 

kCO  °  B*e  a  *'*cos«t.  C85 

Becouse  of 

Be  aS:oswt  -*  ~*expl  -Ca-JuOt  J  ♦ 

♦  |*oxpl*<a+J«5t),  C9) 

than,  after  taking  advantage  of  linear 
properties  of  the  convolution  integral,  we 
receive 

y<t>«xCt>  *•  |  expt -Ca-JaDt)  ♦ 

♦  xCt5*|  expl  -Ca+J«5t)  'y^O  y^CO .  005 

where 

ylCO  ■  y2  Ct°  C11> 

that  moans 

Re  (y  CO)  *  Re  CygCOJ  025 

Im  fy  CO)  *  -  1m  tygCO)  035 
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So.  in  the  n-th  step  of  calculations 

yn  -  2  *  2  R<*y^-  <14i 

For  the  clarity  of  algorithm,  in  the 
further  consideration#  let's  take 

yn  -  2  RoCy^X  .  C15> 


Tnu~.  the  parameters  in  egC2>  can  be 
described  by  the  following  dependences 


and  from  cq  C5>  we  have 
£  «  e  a^Cc osuA  *  J  slrxcAX 


C16) 

07) 


Cl  8) 


After  including  eqs.C3,4.6,7.14-18X  for  the 
n-th  step  of  calculations,  the  memory  less 
algor i tho  is  of  the  fora 


*  **Vxn-l  *  Re<Vxn 

loCyS-  q-UnCy*.^.^  *  Re<y*_,3-«s  * 

'  «  V*W  *  «V*S. 

yn  -  3  ReCy^> .  <193 


y  -  2-  f  t  R^y‘n?  *  I  3 

n  li=l  cn  k**l  sn 

Re<ycn3^-tReC4n-135-6l1-Il:<y^n-l>:,*4J 


[*“  <yc>  -  4 


J - A‘V*2  . 


^y.n3  “  4‘tl!*ysCn^354*ReCys<n-13,43 


•  VA  *3 


v‘.  4 


where  6* 

Series  of  nuaerlcal  calculations 
performed  with  the  use  of  computer  prograa 
based  on  the  described  algoritha  confirm  the 
correctness  and  effectiveness  of  the 
memory  less  method. 


4.  SUMMARY 


where 

hj=  q-CB-f-fij  ♦  j  <203 

h_«  8-7  ,  <213 

8  4 

and  q  ®  c  *  dj  *»  coswt,  6 ^  »  si  not. 

In  the  same  way  we  can  formulate  the 
momoryless  algorithm  for  the  function  kCt> 
of  the  form  of  si nut. 

3.  ThE  MAIM  ALGORITHM 

The  ■no’noryless  algorithm  from  section  2  can 
be  generalized  for  the  Fourier 
roprosontaUon  of  function  of  kCO  of  the 
form 

n. 

kCt^  a  J7A  cxpC-a .  *Ocos».  t  * 

1=1  1  1  1 

♦  jf'B,  expC-b.  t>sinw.  t  .  C22> 

1-1  K 

In  •Ms  case  the  consecutive  samples  of  the 
convolution  y<t)  are  derived  according  to 
equations 


In  the  paper  the  new  algorithm  of  the 
convolution  integral  calculation  is 
proposed.  This  algorithm  is  applicable  for 
the  special  form  of  the  convolved  functions. 
The  principal  advantages  of  the  proposals 
algorithms  consists  in  theirs  numerical 
simplicity  and  in  theirs  "memory! ess” 
properties,  that  means,  for  the  calculation 
of  any  output  samplo  only  the  actual 

input  sample  value  and  the  preceding 

output  sample  value  yn_j  are  sufficiert. 

REFERENCES 

1  Ayarval  R.C.  .Cooley  J.  W.  :  New  Algorithm 
lor  Digital  Convolutions.  IEEE  Trans  on 
Acoust.  .vol  1977  N.  4.  .p.392. 

2  Janko  W.  .Zarebykl  J  *  Fast  Algorithm  for 
the  Special  C*se  Convolution  and 
Deconvolution  Calculations,  IEEE  Intern. 
Symp.  on  Circuits  and  Systems.  New 
Orleans,  Louisiana, 1990. vol .  3,  p.  2377. 


1720 


\ 


A  REGULARIZED  SOLUTION  FOR  DECONVOLUTION  OF  SIGNALS  WITH  EDGES 
BY  AN  ADAPTIVE  ITERATIVE  VERSION  OF  TIKHONOV-MILLER  METHOD 


CARMEN  SANCHEZ  AVILA 
Department  of  Applied  Mathematics 
E.T.S.I.  Telecom  unicaci6o.  U.P.U. 
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Abstract:  An  adaptiTe  regularized  iterative  algorithm  which 
solves  the  'ill-posed*  problem  of  deconvolution  of  signals 
with  edges  in  a  numerically  stable  way  by  incorporating  an 
Adaptive  Projection  Operator  is  described.  Considering  the 
joint  detection-estimation  character  that  edges  deconvolu¬ 
tion  problem  have,  we  introduce  an  adaptively  contracted 
(projection)  selection  operator  to  detect  the  sharp  changes 
in  intensity  (2-P)  or  the  edges  (I-D)  of  the  solution, 
which  can  be  combined  with  iterative  algorithms  based  in 
the  Tikhonov-Miller  method.  A  numerical  example  shows  the 
improvement  of  this  new  adaptive  method  proposed. 


I.  INTRODUCTION: 
DESCRIPTION  PROBLEM 


In  signal  deconvolution  the  ultimate  goal  is  the  re¬ 
covery  of  the  original  signal  from  a  degraded  version- 


or  formally: 


k(x,t)f(t)dt 


g(x) 


a  s  x  s  b  (1) 


Hf»r  (2) 

We  will  consider  the  discrete  version  of  (2) 

H  f  •  g  (3) 

In  addition,  the  degraded  signal  is  nearly  always  corrupted 
by  random  noise.  We  model  our  noisy  degraded  signals  as  fo¬ 
llows- 


H  f  ♦  n  «»  g  (4) 

where  H  Is  an  impulse  response.  We  will  here  concetrate  on 
situations  in  which  H  is  assumed  known,  and  the  abrupt  cha¬ 
racter  of  /  Is  also  known  a  priori. 

Strictly  speaking.  (4)  establishes  a  detection  plus 
estimation  problem  First,  we  have  to  identify  the  edges  of 
/  (original  signal),  then,  to  estimate  the  values  of  this 

sharp  changes. 

Is  well-known  that  this  problem  corresponds  to  resolve 
a  Fredholm  integral  equation  of  first  kind,  and,  in  Hilbert 
spaces  is  generally  an  “Ill-posed*  problem.  We  will  say 
that  a  problem  Is  “well-posed*,  in  the  Hadamard  sense  ill, 
if  verifies  the  following  conditions-  i)  there  is  a  solu¬ 
tion  for  each  g,  ill  the  solution  is  unique,  and  ill)  the 

solution  is  a  cent  incus  function  of  the  data  g.  that  is 
the  problem  is  stable  to  small  fluctuations  of  g.  If  some 
of  these  characteristics  is  not  satisfied,  wc  will  say  that 
the  inverse  problem  Is  “ill-posed*  Then.  In  our  case  a)  a 
solution  In  the  classic  sense  doesn’t  exist,  and  It’s  ne¬ 
cessary  to  define  what  we’ll  call  an  approach  solution  to 
the  problem  (2’*,  b)  the  problem  is  not  stable  as  far  as  g^ 

and  g2  are  close  elements  and  however  their  respective  so¬ 
lutions  fi  and  f ^  may  be  substantially  different. 

In  recent  years  Iterative  signal  restoration,  an  ite¬ 

rative  signal  deconvolution  in  particular,  has  been  given 
considerable  attention. 

However,  most  of  the  existing  iterative  algorithms  are 
derived  without  explicitly  taking  into  account  the  presence 
of  noise  in  the  degraded  signal.  As  a  result  excessive  noi¬ 
se  amplification  may  occur  and  heavy  ringing  effects  will 
be  visible  when  the  number  of  iterations  Increases  It  can 
be  shown  that  both  effects  are  due  to  the  *,ill-posedness“ 
or  “ill-conditlonedness"  of  the  deconvolution  problem  (II 
In  order  to  solve  the  “ill- posed*  signal  deconvolution  pro¬ 


blem,  a  priori  knowledge  abcut  the  original  signal  has  to 
be  included  In  the  derivation  of  the  deconvolution  algo¬ 
rithm.  Such  an  approach  is  commonly  referred  to  as  “regula¬ 
rization*  (I). 

Our  approach  is  to  find  a  regularized  solution  to  an 
“ill-conditioned*  system  of  equations.  The  regularization 
approach  we  take  here  is  based  in  the  Tikhonov-Miller  me¬ 
thod,  which  we  will  revise  in  section  II.  In  section  ill  we 
propose  a  way  to  do  this  method  adaptive.  In  section  IV  we 
present  a  numerical  example  which  shows  the  advantages  of 
this  new  method. 

II.  THE  TIKHONOV-MILLER  METHOD 

The  theory  of  “ill-posed*  problems  and  the  methods  to 
solve  them  began  to  be  extensively  developed  after  funda¬ 
mental  works  by  Tikhonov  [2-3J.  The  most  important  disco¬ 
very  In  this  field  was  the  concept  of  approximate  solutions 
of  "ill -posed*  problems  (21,  the  means  of  finding  such  so¬ 
lutions  being  based  on  the  concept  of  regularizing  algo¬ 
rithms  (3).  Tikhonov  has  suggested  the  following  scheme  for 
obstructing  a  regularizing  algorithm. 

Let  us  consider  the  “ill-posed*  problem  (2),  where  / 
and  g  €  H  (H  is  a  Hilbert  space  finite-dimensional),  and  H 
is  a  continuous  linear  one-to-one  operator  from  H  to  H 
Let  us  consider  the  functional¬ 
ly/)  ■ 

where  a  is  called:  regularization  parameter.  It  can  be  seen 
that  under  the  above  conditions  the  functional  M^C/)  (for 

any  aX))  is  very  convex  and  it  will  have  a  ’ower  bound  on  H 
only  at  the  point  f ^  This  extremal  fa  of  functional  M ft(/) 

is  found  from  the  solution  of 

H1  H  f  »  o  /  =  nT  g 

Additionally  from  the  regularization  theory  111,  it  is 
well  known  that  physically  meaningful  solutions  to  “ill- 
posed*  problems  can  be  obtained  by  incorporating  a  priori 
information  about  the  original  data  or  the  noise  into1  the 
solution  method 

In  signal  deconvolution  we  usually  have  available  an 
estimate  of  the  norm  of  the  noise  present  in  the  degraded 
signal;  so 

0  g-l//  O^c  (5) 

The  bound  c  is  related  to  the  amount  of  the  noise  present, 
and  it  is  assumed  a  prion  known 

The  Tikhonov-Miller  regularization  (4J.  provides  a  se¬ 
cond  kind  of  a  prion  knowledge  To  regulate  iocoitj  the 
tradeoff  between  the  noise  magnification  error  and  the  re¬ 
gularization  error,  wc  propose  to  impose  an  upper  bound  vii 
the  norm  of  the  filtered  signal  L  f*.  where  L  is  the  li¬ 
near  regularization  opera* or  (here,  L  is  a  high-pass  fil¬ 
ter) 

a  L  f'  0“  SE  (6) 

The  bound  E  is  assumed  to  be  (approximately)  known  a 
priori 

Following  the  Miller  regularization  approach  (4 1  we 
combine  (5)  and  (6)  into  a  single  quadrature  formula 

iff)  .  i  *  -  II  t  i*  «  «  n  L  /  0*  (7) 

The  regularization  parameter  has  the  fixed  value  a=je/E)2 
Among  the  solutions  satisfying  (7),  a  physically  reasonable 
choice  is  the  solution  f ^  ,  called  the  Miller  regularized 
solution,  which  minimizes  the  functional  $1/  )  The  solu¬ 
tion  of  »his  minimization  problem  is  given  by  the  normal 

gallons  (//'//  *  o  tTI  I  f™  =  ll'g  (8) 
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Fig.  1.  The  original  signal.  F ig.  3.  Moore-Penrose  pseudoinverse 


Here,  the  solution  f  “  is  approximated  by  using  an  iterati¬ 
ve  method,  which  simultaneously  offers  the  possibility  of 
imposing  a  deterministic  constraint  on  the  solution. 

We  rewrite  (8)  as* 

r  *"'»  (i-«u'u  r“*  e  ht  (g  -  h  /“  )  =  at")  ») 

where  0  is  an  useful  relaxation  parameter  to  insure  the 
contractiveness  of  the  mapping  C. 

III.  AN  ADAPTIVE  T1KHONOV-MILLER  METHOD 

The  knowledge  about  .he  abrupt  character  of  the  origi¬ 
nal  signal  would  be  equivalent  to  introducing  a  square  se¬ 
lection  matrix  P  in  (4). 

It  P  f  +  n  *  g  (10) 

We  can  considered  P  as  a  Projection  Operator,  such  f 
is  a  fixed  point,  l.e  P  /  °  /.  We  define  the  Projection 
Operator  P  as 

P  •  B'1  F  B 
0 

where  B  is  a  basis  change  matrix,  such  that  Bf  represents  f 
in  the  basis 

B  a  (  d(n-i)-5(n-l-J)  >  V  i 
Pp  will  be  a  diagonal  matrix,  with  all  the  elements  equal 

to  zero  with  the  exception  of  those  corresponding  to  nonze¬ 
ro  values  of  B  f  Unfortunately,  P^  is  not  available  in 

practice  However  we  can  try  to  define  adaptively  in  each 
iteration.  So: 

/•  ”  •  B'1  P  “  B 
0 

For  the  new  system  (10).  the  algorithm  (9)  Is  given  by 

f™"  »(I-aO  llpflLP))/™  ♦  0  PT  flT  (  g  -  II  P  ) 
Considering  that  PT»P  and  that  we  need  to  define  a  se¬ 
lection  matrix  at  each  step,  we  arrive  at. 


Fig.  5.  Result  with  the  Tlkhonov-Mlller  method.  IflZO. 


(  1  ir  |  ft  (0  -  /e  (i-1)  |  >  Tm 

ro  <i,n  ’jo  ir  |  t"M  -  f"u-i)  |  s  rm 

whtre- 

T  »  max  {  T  .rnln|/  -S  |k«A> 

m  m-l  •  e  «  1 

y  being  a  empirical  constant,  0  <  y  <  1,  and  A  indicating 
the  set  of  points  selected  by 

IV.  A  NUMERICAL  EXAMPLE 
We  present  the  results  of  a  computer  simulation  to 
illustrate  the  performance  of  the  method  //(m.n)  corres¬ 
ponds  to  the  values  of  the  impulse  response  h(m-n)  of  a 
low-pass  filter.  Fig.l  shows  the  original  signal  /,  where 
the  only  nonzero  differences  are  /(40)-/(39)*2.  /( 46)- 
/(4S)-I.  /(56)-/(55)«-2.  and  /(6S)-/(64)«-I.  Fig  2  shows 
the  degraded  signal  g  °  II  f  *  n,  where  n  is  a  zero  mean 
Gaussian  white  noise  with  e=0  02  Fig  3  demostrates  how  the 
applications  of  a  classic  method  (non  regularized)  the 
Moore-Penrose  pceudoinverse  does  not  provide  a  useful 
result.  Fig  4  shows  the  final  result,  ,  applying  the 
proposed  method  with  =  0,  and  0  *  1.5.  Finally.  Fig  S 


/""  *u*«f  p  Vip  ”Vt"  ♦  e  p  "  h\-iip  "  O 

We  will  assume  here  a  contraction  of  the  projection 
domain  (l.e.,  the  nonzero  elements  of  P  must  be  a  sub- 
o 

set  of  those  of  P^°  )  Assuming  such  a  contraction,  we  can 

write* 

=  Pip  mf"  .  3  ii'  ig-iip  “  t™  n 

We  will  define  P^m  after  obtaining  f™  by  means  a 

threshold  procedure  proposed  by  Papoulis  and  Chamzas  in  an 
extrapolation  context  (SI 


shows  the  result  obtained  applying  the  version  iterative  of 
Tikhonov-Miller  method  (9)  without  the  Adaptive  Operator 
We  can  see  the  improvement  introduced  by  the  new  adaptive 
regularized  method. 
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FEASIBILITY  OF  LYAPUNOV  FUNCTIONS 
FOR  POWER  SYSTEM  TRANSIENT  STABILITY  ANALYSIS 
BY  THE  CONTROLLING  U.E  P.  METHOD 


A.  M.  ESKiCIOfiLU 
Computer  Engineering  Department 
Middle  East  Technical  University 
06531  Ankara,  TQrlcey 

direct  -  Recently  a  great  deal  of  improvement  has  been  achieved  in 
reducing  the  inherent  conservativeness  of  the  direct  method  of  Lya¬ 
punov  in  its  application  to  the  transient  subibty  analysis  of  multi- 
machine  power  systems.  One  major  discovery  is  the  recognition  of  the 
fault  location  as  a  critical  factor  for  the  determination  of  the  boundary 
of  the  stability  region.  The  controlling  u.e  p.  method  tales  into  consid¬ 
eration  the  relevant  unstable  equilibrium  points  to  construct  Lyapunov 
surfaces,  local  approximations  to  the  stability  boundary,  giving  more 
accurate  estimates  In  this  paper,  three  common  Lyapunov  functions 
are  used  in  a  comparative  evaluation  to  determine  their  easibility  for 
the  local  Lyapunov  surface  approach.  It  is  shown  that  the  usefulness 
of  different  Lyapunov  functions  varies  greatly  in  various  cases  of  insta¬ 
bility  occurring  in  the  applications. 


I.  INTRODUCTION 

There  has  been  a  substantial  amount  of  research  iu  the  use  of  direct 
methods  for  the  transient  stability  analysis  of  multi-machine  power 
systems  in  the  last  two  decades  (1)*(4).  Among  the  various  techniques 
proposed,  Lyapunov’s  direct  method  has  attracted  much  attention  and 
proved  to  be  a  promising  tool  of  analysis  for  off-line  and  on-line  studies. 
Initially,  the  results  obtained  using  Lyapunov’s  direct  method  were,  in 
general,  practicably  too  conservative  in  spite  of  the  fact  that  numer¬ 
ous  Lyapunov  functions  were  developed  and  tried  The  main  reason 
for  this  conservativeness  was  later  understood  to  be  determination  of 
the  stable  region  in  the  state  space  independent  of  the  fault  location. 
In  this  original  approach,  commonly  called  the  closest  u  e.p  method 
(4],  a  global  approximation  to  the  sepaiatrix,  the  stability  boundary, 
is  obtained  by  constructing  a  single  Lyapunov  surface  using  the  un¬ 
stable  equilibrium  point  which  gives  the  minimum  value  of  the  chosen 
Lyapunov  function. 

To  alleviate  the  inherent  conservative  nature  of  the  closest  u  e.p. 
method,  recent  studies  were  focused  on  identifying  the  unstable  equi 
librium  point  corresponding  the  fault  under  consideration,  and  em¬ 
ploying  it  for  the  construction  of  the  local  Lyapunov  surface.  Referred 
to  as  the  controlling  u.e.p,  method  (4],  the  new  approach  provides 
a  local  approximation  to  the  separatnx,  producing  much  improved 
estimates  for  critical  clearing  times.  The  fundamental  problem  with 
this  method  is  the  difficulty  in  determining  the  controlling  u  e  p  ,  and 
several  methodologies  have  been  proposed  for  a  solution  (5]-(10)  It 
is  evident  that  both  the  closest  and  the  controlling  uep  methods 
give  the  same  result  only  when  the  closest  unstable  equilibrium  point 
happens  to  be  the  relevant  unstable  equilibrium  point. 

The  controlling  uep.  method  exhibits  a  close  similarity  to  the  method 
of  tangent  hyperplanes  fllj  or  the  method  of  tangent  hypersurfaces 
(12j,  AR  mate  use  of  the  so-called  type  1  points  in  constructing  local 
approximations. 

The  use  of  local  Lyapunov  surfaces  for  the  determination  of  transient 
stability  regions  is  repotted  in  a  previous  work  (13).  A  commonly  used 
Lyapunov  function  was  taken  and  the  results  obtained  by  the  local  and 
global  approximations  were  compared  Similar  investigations  with  dif 
ferent  Lyapunov  functions  are  also  available  m  the  literature  The 
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major  aim  of  this  study  is  to  make  a  comparative  evaluation  of  three 
well-known  Lyapunov  functions  with  the  purpose  of  determining  their 
feasibility  for  transient  stability  analysis  using  the  controlling  u  e.p 
method. 


II.  POWER  SYSTEM  MODEL 

In  the  following  analysis,  the  classical  power  system  model  is  used  In 
this  model,  the  state  equations  of  motion  are  mathematically  expressed 
as 

j,  =  u„  isl,...,n 

Ci,  =  1/M,(P„,  -  D.u,  -  E]G„ 

-££,£,y„costf1-;,-e„)|,  .  =  i„ 

i«i 

where 

6 ,  -  rotor  angle  of  the  ith  machine 

v,  ss  rotor  speed  of  the  ith  machine 

t  =  2*  ft  (/  =  system  frequency,  t  s  time  m  seconds) 

H,  =  inertia  constant  of  the  ith  machine 

Af,  =  (4t/)  R, 

A  =  (2*/)  d,  (d,  =  damj  mg  coefficient  of  the  sth  machine) 

Pm  =  mechanical  power  input  to  the  ith  machine 
Ex  =  constant  voltage  behind  the  direct-axis  transient 

reactance 

Y%)  =  modulus  of  the  i;th  element  of  the  reduced  system 

admittance  matrix 

0,,  =  argument  of  the  yth  element  of  the  reduced  system 

admittance  matrix 

0  =  d/ir 

It  is  stated  that  within  the  Lyapunov  context,  the  correct  mathemati 
cal  model  should  be  described  by  the  relative,  and  not  by  the  absolute, 
rotor  angles  and  speeds  [1]  Taking  the  nth  machine  as  reference  and 
defining  the  state  vector  as 

Z  =  ($ln>Jin,  ,$n-l.»,Wln,tJ3A,  .  ,U»«-i,,)r 

the  above  equations  lead  to 

—  <^n,  I  ”  1 »  ,R—1 

W,n  =  /,{£,«) 

=  l/Af,(P„,  -  £?G„)  -  -  PjG,,) 

-l/M,(£,E«y„  cos(J,,  -  e„) 

+ ”f;£,£,y,,  £<*(«„ -jjn-e„)| 

j-i 

)*> 

+1/M.E  £,£»y,n  CO S(<„  +  0,.)) 

1*1 

»  =  1,  ,«  -  1 

where  C  =  J?,/Af,  for  all  i 
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III.  THREE  COMMON  LYAPUNOV  FUNCTIONS 


For  (17], 


The  three  Lyapunov  functions  considered  for  comparison  are  widely 
used  and  frequently  discussed  in  the  literature.  They  are 

P|  =  Vil  +  P«+P»  (H), 

-  where 

v.l  =  E  E  1/2 

IB] 

VK  = 

IBl  JS1+I 

V,S  =  -E  E  (cos(<,.-<).)-c«(C-i;.)] 

tal  Jii4l 


Vi  ~  Vix  +  Vn  -r  Vn  (15) , 

where 

v»  -  E  E  u^f 

ml  ;ai+l 
n-1  n 

Vji  =  -E  E 

ml  jaitl 

^  =  -EE  UE,E,B„ 

ial  ,o.+l 

Ps  =  V«  +  VM  +  Va3  (16), 

where 

V),  =  1/2  Em, w?, 

... 

=  -E(P„.  -S?C„K4,-%) 

»=t 

n-i  n 

V»  =  ~E  E  S.Ejfl.;  [cO^A,,,  -  A„)  -  VX{i'„  -  A/j] 

in]  ;«-i+I 

In  the  above  expressions,  superscript  s  denotes  value  of  the  angle  at 
the  stable  equilibrium  point,  and 

c.,  =  A1,(P„,  -  E;C„)  -  bU(P„,  -  E]G„) 

M  =  E  Af, 

»*: 

BtJ  =s  susceptances 


It  should  be  noted  that  Vj  has  already  been  used  in  (13].  However, 
in  the  modulus  of  the  ij-th  element  of  the  reduced  system  admittance 
matrix  transfer  conductances  were  not  assumed  to  be  zero,  producing 
rather  different  results  especially  for  cases  of  multi-generator  instabil- 
ity  It  should  also  be  noted  that  the  Lyapunov  function  given  in  (5), 
which  is  aiso  common,  has  been  shown  to  be  identical  to  Vj  and  hence 
can  be  assumed  to  give  the  same  results  reported  for  Pi 

To  account  for  transfer  conductances,  Gl}l  the  expressions  for  the  first 
Lyapunov  function  can  be  ’corrected*  by  assuming  a  linear  faulted 
tr&jMory  in  the  state  space  and,  hence,  adding  a  fourth  term 


V,<  =  E 

ml  j=»+l 

(<„- A*. -(«.-«,)) 

-Af.  E  -  A,,)  -  dn(<;.  -  AJ.)] 

lei 

[A.„  -  Aj*  -  w.  -  a;„)]/[a,,  -  h,  -  (*;.  - «;,)]] 

A  correction  term  for  the  Lyapunov  function  P2  is  derived  in  (18)  as 

=  -E  E  (Af  -  '>MJ)EiEJG,j 

,  ISl  J*I+1 

H«,‘.  -  t;n)(e;„  -  a;,  -  (A,„  -  a,,)) 

-w«;.-g-*(4.-g)] 

This  term  is  obtained  by  the  exact  integration  of  a  part  of  the  non- 
integrate  terms  hither  to  neglected. 

Finally,  the  fourth  term  for  the  Lyapunov  function  p$  is  taken  to  be 
(16] 

yM  =  -E  E  fte,ft,(«in(A,.-AJ.)-!in(A,‘,-A;.)] 

ml  jo.+l 

The  effect  of  adding  the  above  terms  in  the  expressions  for  the  given 
functions  is  also  checked  for  the  test  systems  used  in  this  work  to 
establish  their  validity. 


IV.  APPLICATION  TO  TEST  SYSTEMS 

Two  power  systems  are  used  m  the  application  of  the  controlling  u  e  p 
method  together  with  the  closest  u  e.p  method.  The  smaller  is  a  4- 
generator  system  (16)  which  has  extensively  been  tested  in  numerous 
studies.  The  other  system  (19]  is  provided  by  the  South  of  Scotland 
Electricity  Board  and  has  12  generators. 

Stability  characteristics  of  the  above  systems  are  investigated  by  ap¬ 
plying  a  3-phase-to  ground  fault,  one  of  the  severest  that  a  power 
system  may  be  subjected  to,  at  each  of  the  generator  terminal  buses. 
The  fault  is  assumed  to  be  cleared  without  line  tripping.  Because  of 
the  properties  of  the  classical  model,  only  the  first-swing  instability  is 
taken  into  considera’ion  The  value  for  uniform  damping  is  assumed 
to  be  zero.  This  is  the  usual  choice  as  accounting  for  uniform  damping 
improves  the  quality  of  systems  stability  but  not  its  quantity  (l).  How¬ 
ever,  the  analysis  presented  in  this  paper  was  also  carried  out  using 
a  nonzero  value  for  damping.  Because  of  space  limits,  the  results  are 
not  included  here  As  a  passing  remark,  it  can  be  said  that  they  are 
slightly  worse  than  those  for  zero  damping 
4.1  Results  for  4« Generator  System 

Three  type  1  (unstable)  equilibrium  points  are  found  for  tne  system. 
They  represent  the  modes  of  instability  for  generator  1,  2  and  3  The 
estimates  computed  for  critical  clearing  times  (CCT’s)  are  listed  in 
Table  l.  CCTi’s  and  CCTff’s  denote  the  CCT’s  estimated  by  the 
controlling  u  e  p  method  and  the  closest  u.e  p.  method,  respectively. 


Table  1.  Critical  Clearing  Times  for  4-Generator  Syttezn 
(a)  Estimates  of  the  Critical  Clearing  Times  by  using  V_ 


Faulted 

Bus 

^*7 

Actual  CCT 
<*«> 

CCT, 

(sec) 

Error 

(%)- 

CCT, 

(sec) 

Error 

(54) 

1 

137755* 

0.557 

0.500 

-10  2 

0.500 

-10  2 

2 

170267 

0.441 

0434 

-1.9 

0.391 

*11.3 

3 

216719 

0.491 

0483 

-1.6 

0.396 

-19.3 

(b)  Estimates  of  the  Critical  Clearing  Times  by  using  Vj 


Faulted 

Bus 

V„ 

Actual  CCT 
(sec) 

CCT i 
(sec) 

Error 

(54) 

CCT, 

(sec) 

Error 

(55) 

1 

137246* 

0.557 

0.511 

-8.3 

0.511 

-8.3 

2 

160721 

0  441 

0429 

•2.7 

0402 

-8.8 

3 

195373 

0491 

0476 

-3.1 

0.414 

-15.7 

(c)  Estimates  of  the  Critical  Clearing  Tinea  by  using  V3 


Faultad 

Bus 

V., 

Actual  CCT 
(sec) 

CCT, 

(sec) 

Error 

(54) 

CCT, 

(«c) 

Error 

(%) 

1 

1.6585* 

0.557 

0  493 

-10.5 

0.498 

-10  5 

2 

2.1518 

0441 

0.433 

-1.8 

0  390 

-11.6 

3 

2.7239 

0491 

0.483 

-16 

0397 

•19.3 

*  Mirim  tm  ralae  of  tbe  L,»p«aor  function. 


4.2  Results  for  12-Generator  System 

Eleven  type  1  points  exist,  one  of  which  corresponds  to  the  cluster 
of  generators  from  1  to  10  losing  synchronism  with  the  other  two. 
The  remaining  tea  points  represent  the  geaeiators,  excluding  5  and  7, 
falling  individually  out  of  step  with  the  rest  of  the  system.  The  CCT 
estimates  are  shown  in  Table  3. 


Table  3.  Critical  Clearing  Times  for  12-  Generator  System 
(a)  Estimates  of  the  Critical  Clearing  Times  by  using  V\ 


Faulted 

Bus 

Vn 

Actual  CCT 
(see) 

CCT, 

(sec) 

Error 

(%) 

CCT, 

(sec) 

Error 

(S) 

1 

161597“ 

0.117 

0113 

•3  4 

0.113 

-3  4 

2 

903737 

0.278 

0.274 

-14 

0.138 

-504 

3 

196096 

0301 

0  347 

15  3 

0326 

8.3 

4 

808543 

0.387 

0.371 

-4.1 

0.207 

-46.5 

5 

171611 

0.287 

0.142 

-50.2 

0.139 

-51.7 

6 

82S5S2 

0311 

0326 

4.8 

0.185 

*40.5 

7 

171611 

0  221 

0.111 

-498 

0.108 

-51.1 

8 

495784 

0324 

0.342 

56 

0.236 

•27.2 

9 

899148 

0300 

0.302 

07 

0.144 

-520 

10 

242594 

0401 

0  295 

-26.4 

0.253 

-36.9 

11 

6457812 

0278 

0  318 

14.3 

0063 

-77.3 

12 

8348150 

0311 

0  355 

14.1 

0062 

-80.1 

Examining  the  results  m  Table  1  leads  to  the  pnm«iy  conclusion  that 
local  Lyapunov  surfaces  yield  always  better  approximations  through 
which  much  improved  CCT  estimates  are  obtained.  It  is  noted  that 
very  similar  result*  are  obtained  by  using  the  three  Lyapunov  func¬ 
tions.  This  can  be  explained  by  the  site  and  simplicity  of  the  power 
system  which  does  not  exhibit  multi-generator  instability. 

Next,  the  fourth  terms  are  included  in  the  functions  and  the  estimates 
for  CCT’s  are  recomputed  as  shown  in  Table  2.  It  is  interesting  to 
see  that  the  additional  term  in  the  first  two  functions  brings  about  an 
improvement  in  both  CCTr’s  and  CCT,’s,  the  smallest  errors  being 
provided  by  Vj.  Strangely,  the  inclusion  of  the  fourth  term  in  Vj  leads 
to  relatively  worse  estimates. 


Table  2.  Critical  Clearing  Times  for  4-Gcnerator  System 


(a)  Estimates  of  the  Critical  Clearing  Times  by  using  V\  and  Vn 

Faulted 

v* 

Actual  CCT 

CCT, 

Error 

CCT, 

Error 

Bus 

(sec) 

(sec) 

(tec) 

w 

1 

132391* 

0.557 

0502 

•9.9 

0  502 

-9  9 

2 

172051 

0441 

0  437 

.09 

0394 

-10.7 

3 

215030 

0491 

0  489 

-04 

0404 

-17  7 

(b)  Estimates  of  the  Critical  Clearing  Times  by  using  Vj  and  Via 

Faulted 

V* 

Actual  CCT 

CCT, 

Error 

CCT, 

Error 

Bus 

(sec) 

(sec) 

(%) 

(sec) 

i%) 

1 

146499“ 

0557 

0  519 

-6.8 

0519 

-6.8 

2 

181546 

0.441 

0.443 

0.5 

0.406 

-7  9 

3 

217816 

0  491 

0.488 

-06 

0418 

-14.9 

(c)  Estimates  of  tbe  Critical  Cleanse  Times  by  mine  V,  and  V„ 

Faulted 

V„ 

Actual  CCT 

CCT, 

Error 

CCT, 

Error 

Bus 

(sec) 

(sec) 

(%) 

(see) 

(%) 

1 

1  66349* 

0  557 

0.490 

-12  0 

0.490 

•  12  0 

2 

2.14772 

0441 

0  428 

-2  9 

0  385 

-12  7 

3 

271009 

0491 

0.470 

-4.3 

0.383 

-22.0 

(b)  Estimates  of  the  Cniical  Clearing  Times  by  using  Vj 


Faulted 

Bus 

V„ 

Actual  CCT 
(sec) 

CCT, 

(sec) 

Error 

(%) 

CCT, 

(sec) 

Error 

(%> 

1 

394067 

0.117 

0126 

77 

0110 

-6  0 

2 

1701419 

0278 

0  303 

90 

0,127 

-54  3 

3 

267375* 

0301 

0  324 

76 

0  324 

76 

4 

1262844 

0.387 

0  354 

-8  5 

0.163 

-57  9 

5 

1752938 

0.287 

0  315 

9.8 

0.145 

-49.5 

6 

1275566 

0311 

0  342 

10  0 

0  185 

-35.5 

7 

1752938 

0  221 

0.257 

16  3 

0.114 

-4S4 

8 

792931 

0  324 

0.344 

62 

0.224 

-30  9 

9 

1542023 

0.300 

0  304 

13 

0.126 

-58  0 

10 

425643 

0.401 

0.270 

■32.7 

0.219 

-45.4 

11 

5358124 

0278 

0  304 

94 

0  083 

-701 

12 

6499550 

0311 

0.336 

80 

0086 

-72  3 

A  comparison  of  the  estimates  m  Table  3  shows  that  with  only  one 
exception,  local  approximations  again  provide  much  more  accurate  es¬ 
timates  than  the  global  one.  Aa  reported  earlier  (13],  CCT,'s  obtained 
by  Vi  is  considerably  pessimistic  and  almost  equal  to  CCT/s  for  the 
cases  of  multi  generator  instability  The  use  of  Vj,  however,  not  onl> 
eliminates  this  inefficiency,  but  also  produces  quite  uniform  and  rea¬ 
sonable  errors  in  CCTj’s  (This  uniformity  is  also  ooserved  to  some 
extent  in  the  analysis  of  the  4-generator  system}.  For  i  ase  10,  d  *ea 
sonable  estimate  could  not  be  obtained  either  by  V,  or  Vj  There  seems 
to  be  no  apparent  reason  for  such  anomaly.  The  estimates  computed 
by  the  method  of  hypersurfaces,  for  instances,  are  not  worse  that  those 
for  other  cases  It  was  no.  possible  to  use  V3  for  the  12-gcnerator  sys¬ 
tem  because  it  assumed  negative  values  at  all  type  1  points  Even 
though  the  function  itself  was  found  to  be  monotonically  increasing 
in  the  neighborhood  of  the  stable  point,  neither  the  closest  nor  the 
controlling  u  e  p  method  was  applicable 

The  effect  of  including  the  fourth  terra  in  ach  Lyapunov  function, 
given  in  Table  4,  is  not  positive,  in  general  The  most  significant  con¬ 
tribution  of  Vj«  is  that  the  over- pessimistic  CCTj’s  are  removed  and 
replaced  by  highly  accurate  ones  The  term  V24  does  not  seem  to  con¬ 
tribute  much  to  the  improvement  of  CCT;  in  fact  the  uniformity  of 
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resells  is  *  little  lost  Nevertheless,  it  is  interesting  to  note  the  reduc¬ 
tion  of  error  in  CCT/s  by  about  5  percent  on  the  average.  This  must 
normally  be  due  to  the  inclusion  of  the  effect  of  transfer  conductances 
rendered  by  the  term.  The  function  V3  together  with  its  fourth  term 
took  on  negatise  values  at  the  type  1  points  again,  and  thus  could  not 
be  evaluated. 

Table  4  Critical  Clearing  Times  for  4-Generator  System 
(a)  Estimates  of  the  Critical  Clearing  Times  by  using  Vi  and  Vu 
Faulted  Vn  Actual  CCT  CCT/  Error  CCT,  Error 


Bus 

(s«) 

(sec) 

(%) 

(sk) 

(X) 

1 

240985 

0.117 

0.111 

-  5.1 

0  096 

-17.9 

2 

919S58 

0278 

0.23S 

-14  4 

0.102 

*63.3 

3 

164018* 

0.301 

0  278 

•7.6 

0.278 

•7.6 

4 

752599 

0-3S? 

0  292 

-24.5 

0.127 

-67  2 

5 

1253517 

0  287 

0.283 

•1.4 

0116 

•59  6 

6 

733336 

0311 

0.2S6 

'8  0 

0.152 

•51.1 

7 

1253517 

0.221 

0,230 

4.1 

0.091 

*58.8 

8 

444016 

0  324 

0.2S2 

•13  0 

0.185 

*429 

9 

891269 

0  300 

0245 

•18  3 

0.102 

•66  0 

10 

255495 

0  401 

0.218 

-450 

0177 

-55  9 

11 

4I6S475 

0.278 

0.278 

00 

0  066 

-76  3 

12 

5173594 

0311 

0310 

-03 

0.068 

-78.1 

Vj  does  not  improve  CCT/’s  m  general  but  reduces  the  error  in  most 
of  CCT /s.  Function  V3  with  Vm  was  again  not  applicable. 

4)  On  the  whole,  V2  is  understood  to  be  the  most  appropriate 
function  among  the  ones  examined  here  in  generating  local  Lyapunov 
surfaces. 

It  is  apparent  that  the  performance  of  different  Lyapunov  functions  in 
transient  stability  analysis  of  power  systems  varies  quite  considerably. 
This  performance  difference  is  exhibited  especially  when  using  the  new 
approach,  the  controlling  a  e  p.  method.  The  major  conclusion  to  be 
drawn  out  of  this  study  js  that  although  the  use  local  Lyapunov  sur¬ 
faces  decreates  the  conservativeness  of  the  direct  method  of  Lyapunov 
substantially,  the  controlling  u-e  p.  method  cannot  always  be  employed 
as  an  effective  and  reliable  tool  as  it  is  understood  that  its  accuracy 
varies  a  great  amount  in  different  cases  of  instability  occurmg  in  power 
systems. 
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Another  interesting  observation  concerning  Table  4  is  the  location  of 
Korn-  For  Vx  this  minimal  value  is  obtained  at  the  type  1  point  as¬ 
sociated  with  case  1  When  Vji  is  added,  or  when  V2  or  Vj  4.  Vu  is 
considered,  however,  VWa  i*  computed  at  the  type  1  point  of  case  3. 


V.  CONCLUSIONS 

Three  common  Lyapunov  functions  are  compared  in  ,he  application  of 
the  closest  and  controlling  u  e.p.  methods,  which  give  global  and  local 
approximations  to  *he  separatrix.  The  results  obtained  using  two  test 
systems  can  be  summarized  as  follows. 

1)  For  the  smaller  test  system,  ail  the  functions  provide  similar 
results.  The  inclusion  of  the  forth  term  improves  the  estimates  CCT/’s 
and  CCTj’s. 

2)  For  the  largei  test  system,  function  Vx  produces  over-pessimistic 
CCTi’s  (which  are  approximately  equal  to  the  corresponding  CCT,’s) 
in  the  cases  of  multi-generator  instability  Through  function  Vj,  how¬ 
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3)  When  the  functions  ai  corrected  with  the  additional  terms, 
no  general  improvement  is  gamed.  An  important  contribution  of  this 
correction  in  V\  is  the  remarkable  refinement  of  the  estimates  for  cases 
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Abstract  -  The  design  of  a  digital  communication  system  is  the  result 
of  some  trade-offs  between  several  system  characteristics.  Fora  well 
defined  system  an  optimum  design  is  usually  difficult  to  achieve 
analytically.  For  that  reason  it  is  usual  to  resort  to  the  help  of 
appropriate  software  simulation  and  numerical  optimization  tools 
The  problem  is  further  complicated  if  a  complete  characterization  of 
the  transmission  channel  is  not  known,  as  is  (he  case  when 
designing  a  system  to  operate  m  the  customer  access  connection 
(CAC)  or  in  the  customer  premises  network  (CPN).  This  paper 
presents  a  software  package  SPOT  ( Simulation  Package  and 
Optimization  Tool ),  that  calculates  the  optimally  tolerant  equalizer 
for  a  digital  transmission  system  with  unknown  channel  length  in  the 
absence  of  noise.  In  addition,  a  case  study  corresponding  to  the 
CPN  environment  is  presented, 

Li  Introduction 

The  equalizer  of  a  digital  transmission  system  is  designed  to 
compensate  for  the  distortion  introduced  by  a  particular  channel 
length  However,  in  most  applications,  such  as  the  connection 
between  a  local  exchange  and  a  subscriber,  the  length  of  the  channel 
is  unknown  whereas  different  subscribers  can  be  at  different 
distances  from  ihe  local  exchange.  In  this  case  it  is  usual  to  resort  to 
the  use  of  adaptive  equalizers,  but  this  can  be  a  costly  option  An 
alternative  approach  consists  in  using  an  optimall>  tolerant  fixed 
equalizer. 

This  paper  is  concerned  with  the  design  of  an  equalizer 
optimally  tolerant  to  channel  length  variations  In  section  2.  a  brief 
description  of  the  software  tool  used  to  simulate  the  digital 
transmission  system  is  made  In  section  3,  the  optimization  problem 
is  stated  a.id  the  algorithm  used  is  described.  Some  results 
concerning  ihe  CPN  environment  are  presented  in  section  4  and  in 
section  5  a  bnef  summary  is  made 


A  software  package  SPOT  {Simulation  Package  and 
Optimization  Tool}  is  used  to  calculate  ihe  optimally  (oleum 
equalizei  foi  a  digital  transmission  system  with  unknown  channel 
length  in  ihe  absence  oi  noise.  The  model  ol  ihe  digital  transmission 
system  implemented  by  SPOT  is  presented  in  Fig.  1 


chosen  between  the  elements  of  a  vast  set  which  includes 
Manchester,  CMI,  5B6B,  NRZ,  Walsh-2  and  others 

SHAPING  FILTER  The  pulse  shaping  filter  defines  the 
format  of  the  transmitted  pulses  Two  different  shaping  filters  were 
included  in  our  simulation  system,  a  raised  cosine  with  user- 
definable  roll-off  and  an  ideal  low-pass  filter. 

TRANSMISSION  CHANNEL  +  EQUALIZER.  The 
transmission  channel  introduces  distortion  in  the  signal,  hindering 
the  correct  decoding  at  the  receiver.  This  distortion  must  be 
compensated  by  the  equalizer. 

Several  types  of  transmission  channel  can  be  accommodated. 
Presently  SPOT  simulates  two  types  oi  transmission  channel, 
coaxial  cable  and  optical  fiber  Uj.  The  characteristic  parameters  of 
the  channel  are  normalized  to  the  system  bn  rate  The  models  of  the 
normalized  frequency  responses  of  the  channel,  H(f).  and  equalizer, 
E(f),  are  giver,  by: 


AvTTIwj) 


SC  (I) 


i)  Coaxial  cable: 

H^O-e" 

ii)  Optical  Fiber 


Ho(0  =  e  and  ” c  (2) 

where  A»is  the  channel  distortion  index  ami  Ae  is  the  equalizer 
index 

These  equalizers  are  not  realizable.  Nevertheless,  they  are  used 
as  targets  in  most  realization  methods. 

AC-COUPL1NG.  An  high-pass  filter  models  the  AC-coupling 
effects  in  the  receiver,  which  can  not  be  fully  compensated  by  the 
equalizer.  An  nth  order  high-pass  filter  was  implemented  as  a 
cascade  of  multiple  first-order  high-pass  filters 

Two  parameters,  extracted  from  the  eye  diagram  (ED),  the 
aperture  penalty  P0  and  the  jitter  penalty  Pr  are  used  to  assess  the 
performance  of  the  simulated  system.  The  aperture  penalty  measures 
the  decrease  of  the  vertical  eye  opening  introduced  by  intersymbol 
interference  rfSly.  The  jutei  penally  measures  the  unwanted  pulse- 
postuon  modulation  oi  ihe  recovered  clock,  introduced 
simultaneously  by  ihe  shaping  filtei  and  1S1  Fig.  2  illustrates  a 
typical  eye  diagram _  _ _ _ 
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Figure  1  >  Model  of  a  digital  transmission  system 

ENCODER  The  encoder  maps  the  input  digital  sequence  { Av.} 
into  the  transmitted  digital  sequence  |Bt)  The  line  code  can  be 

*  This  work  was  sypponed  by  RACE  project  R  1052  SPOT  {Signal  Processing 
for  Optical  and  C or  dies*  transmission  ) 


Figure  2  -  Typical  eye  diagram 

The  aperture  penalty  P0  is  given  by 

Po=20log1((o)  (3) 

where  0  is  the  worst-case  normalized  ED  vertical  opening  When 
Ae  =  At  and  there  are  no  AC-couphng  effects,  P0  equals  0  dB 


P^IOO. 


<*) 


where  cfxj  represess  Ac  'Aroic&d  crwacagf  aasafeED,^ 
is  Ac  average  of  e(xj.  2ad  jvt*}  »  Ac  ptofcafccky  dcsssy  faeoca  of 
e(xj.  THs  ctsrespoads  10  Ac  rxts.  of  Ae  sepal  easssxttt  Arocgh 
a  tw  definable  AresboSd. 


?-Oy~i72~m 

3.1)  Sarenxm  of  Ae  problem 

Tbc  transmission  cfciaacl  kngA  is  asssscd  to  vary  btaca 
uppre  and  lower  limrts.  L,aad  L*.  wiA  a  given  probaixby  density 
fcacrioo  p(I)_ 

In  order  10  characterize  Ac  combined  yxxt  and  151  dbuxrka 
introduced  by  a  particular  channel  ksgA.  a  cocpotad  penalty 
measure  is  defined  as 

E.'U'Po+fl -u.jp,  (J, 

Here  u>  is  a  weighting  factor,  describing  the  rdarivc  is^onaacc  of 
jitter  2nd  IS!.  This  parameter  is  dependent  00  the  actsal  hardware 
implementation.  For  example,  a  system  with  a  degraded  clock 
recovery  circuit  must  be  assigned  a  lower  wlae  of  u*.  Vsual  values 
ofw  will  be  near  05. 

A  global  error  metric,  the  average  compound  penalty  E*.  is 
defined  as  the  average  of  the  compound  penalties  over  the  range  of 
permissible  channel  lengths,  ie : 


unmiiion; 

The  optimally  lolerarj  equalizer  is  the  one  with  an  index  A*  that 
minimizes  the  value  E*> 

3.2)  Iterative  Algorithm 

At  the  onset  of  this  work  the  main  requirement  for  the  numeric 
optimization  algorithm  was  robustness.  A  particular  variation  of  a 
constrained  direct  search  method  with  random  jumps  |2)  was 
developed.  The  iterative  procedure  is  started  with  an  heuristic  first 
approximation  and  with  a  high  step  value.  The  algorithm  searches  a 
better  approximation  to  the  optimum  equalizer  using  essentially  the 
following  rules: 

i)Try  the  equalizer  A(nAto+S  (A**  is  the  current  choice  of 
optimum  equalizer  and  S  ts  the  current  step  value),  and  calculate 
E*(A,) 

11)  If  Aeis  a  better  equalizer  than  AM,  then  Ac  is  the  new 
choice  of  optimum  equalizer,  S  is  reduced  to  S/2,  and  point  <;  is 
repeated. 

m)  If  A*  is  not  a  better  equalizer  than  Am,  then  step  $  is 
changed  to  -S/1.5.  and  point  i)  is  repeated. 

iv)  Randomly  (about  once  each  three  iterations)  the  step  size  is 
subject  to  a  random  change  (about  one  fifth  ns  present  value),  with 
the  objective  of  preventing  local  minima. 

v) The  above  rules  are  used  with  two  exceptions.  An 
acceptability  threshold  was  established  for  preventing  the  design  of 
an  equalizer  that  would  be  in  an  average  sense,  or  in  a  particular 
sense,  a  very  bad  one.  if  any  E1C  or  particular  Ec  exceeded  a 
predefined  value  the  equalizer  was  rejected  (in  these  cases,  either 
another  line  code  should  be  tried,  or  adaptive  equalization  should  be 
considered).  Furthermore,  the  value  of  the  optimally  tolerant 
equalizer  index  was  constrained  to  the  range  of  channel  distortion 
indexes  defined  by  the  user. 

vi)  The  above  rules  arc  performed  until  a  stop  condition  is 
reached.  TJis  could  happen  either  by  reaching  a  predefined  accuracy 
ora  predefined  number  of  iterations. 

vii)  The  iterative  procedure  (steps  1  to  vi)  is  always  run  twice. 
This  guarantees  a  further  security  against  the  occurrence  of  local 
minima. 

The  proposed  algorithm  performs  m  average  as  the  classical 
F°.S.c,?,  5  *l£pnthm  (5),  The  interval  reduction  performed  by 
the  Golden  Rule  is  about !  61  .Although  not  oriented  to  an  interval 
reduction  approach,  this  algorithm  performs  a  reduction  between  !  5 
and  2  m  each  iteration,  with  appropriated  chosen  starting  step. 


4  -  ATTfcyVas  »  Cf*y 

week  «  ok  cd  Ac  RACE  pswret  cmncJ  n-i  tie  aa«r 
psrmares  eewd:  E-1035  Cr'N*  (Cuxcrer  ftewa  Xrr*trU 
Has  projra  deaf*  Ac  ssrrmsri  ao  CPN  sa=£*re  *a  Ac  Uxzsz 
fcarg^  Breads  CceacxLL^^ 

assre^rezxs  res  tremsreasrea  csreg  ccoxhl  cubic.  155.52  JQwfcct 
w  race.  CVU  Has  © o£eg  zed  tic  cucacgzacc  wsh  CCTT7 
rcroca=e=dgxxs  a?CP  IH  SU035  oxxdces  a  ssrxaasa 
ciasaracmac  zrarasrejaa  rf  12  f3.«n^xofcg»  2  sacasa 
ks&x  Lj  s  15CVa.  A  easemaa  fcegA  L|  *  SO  o  »  sssssxd.  niteb 
csxTtsTctsds  ajsxw~eiy  ao  a  daseasaric  KK==r»a  of  1  d3 
R.1035  cscs  aa  S  d3  cqeafizer  |3J_ 

ta  Ac  saarfasoB  paces*,  the  caasaearf  poise  sSax  ww 
oodeBed  as  a  -l&Z  roO-cdT  eased  cctsac.  AC  cocp&cg.  wxS  a  cw- 
off  frequency  ©f  150  KHz.  was  also  cxxsndcredL  Tbs  waoc  as 
dcariy  wen^asc  oca  so  were  faced  oeg^Scu  b  was 

zLoaggasd&sefcsredlec^aregpy^ 
is  was  feed  tbt  a  sanriiaoa  wrh  N  ■  20  rf-- — r's  provided  a 
rcaccaNcresoIericc. 

Tbc  resets  obtained  are  preseeccd  sa  Vrp,  3.  As  c  eta  be  scea 
Ac  opcsca  cqeaSzcrsbocid  pccsca  * 

pr3=xrergre«r  to  Il.5dB.rf  tvrexwcabiy 

lower  Aaa  &h  of  Ac  ED  versed  epesag  {w  >  0L$X 

Tbc  average  compocad  pesa&cs  ofcaaed  wtA  RJ035  Si3 
cqtahza  aad  ocr  opereem  eqoaErea  are  abo  dq?*reaJ  a  fig.  3.  Foe 

w»0.9.ucscbcscea  that  AcEjeof  Aefirrtoocis  Ufcgbrr  This 
cocrespoeads  to  aa  average  dS  ED  venxal  cfxxaag  peaahy.  tf 

two  Qisrems  present  do  jscee  peazby. 

It  is  wxxth  to  mecaoa  Aai  Acsc  results  were  oboiaed 
asstmsmg  sa  terfonn  dissribemoo  for  the  leegAs  of  Ac  traasresssaoa 

channel 

■*'  A)  cpfrcaZy  trVrzrs  igabw 
'*•  B;  tic  -  optnxr\ 


wo£hong  factor  n> 


^.Cooclusion 

A  Simulation  Package  and  Optimization  Tool  enabling  the 
design  of  an  equalizer  optimally  tolerant  to  channel  lenght  variations 
was  presented.  An  application  example  related  to  the  customer 
premises  network  environment  was  discussed.  It  was  founo  that  the 
use  of  an  optimally  tolerant  equalizer  can  improve  significantly  the 
performance  of  a  digital  transmission  system  with  a  variable  channel 
length. 
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Aistrzcl  -  The  prcblem  of  rcq^rzietric  es- 
tiaatica  of  disccstissscs  fmetiens  is  con¬ 
sidered.  In  this  case  conventional  methods 
are  inefficient  because  they  lead  to  the 
excessive  smoothing  near  points  of  disconti¬ 
nuity.  To  solve  the  problem  we  suggest  a  spe¬ 
cial  snccthing  method  based  on  L^-aininiza- 

ticn  with  following  I^-aaxinl ration.  The  me¬ 
thod  can  have  saay  applications  in  signal 
and  irage  processing. 

1-  INI RODGCTICS 

Noise  snoothing  is  a  widely  used  operation 
in  signal  and  inage  processing-  The  choice  of 
the  appropriate  algorithm  is  determined  by 
the  bind  of  noise,  by  the  class  of  processed 
signals  and  by  the  statement  of  the  probles. 
In  this  paper  the  nor.pararetric  estimation  of 
discontinuous  functions  will  be  considered. 

When  the  signal  or  inage  is  described  by  a 
discontinuous  function  one  often  needs  to  re¬ 
cognize  points  (curves  in  two  dirensicnal 
case)  of  discontinuity  and  to  determine  their 
positions  and  values  of  jumps  as  accurate  as 
possible.  In  this  case  conventional  sethods 
are  irefficient.  Ke  suggest  a  special  snoo¬ 
thing  method  based  on  the  using  of  ^-minimi¬ 
zation. 


cm  the  set  of  all  functions  satisfying  to 
condition  1-  It  can  be  proved  that  under  some 
conditions  $  consists  of  sore  than  one  ele¬ 
ment.  Denote 

Suppose  the  function  f  (x)  is  estimated  by 
F(x,€)  for  which  the  next  two  conditions  are 
satisfied:  1)  P(x,e)  belongs  to 
2)  fl^(F)  *  sax  J^,(g),  where  sax  is  taken  by 

all  g£$. 

The  estimator  F  is  very  convenient  for  the 
estimation  of  discontinues  functions.  The 
first  step  (minimization  of  Q})  provides  the 
smoothness  (or  more  exactly  nonroeghness)  of 
the  estimate.  The  second  step  (maximization 
of  Q^)  prevents  from  the  excessive  smoothing 
near  points  of  discontinuity.  Note  that  the 
number  a  plays  a  role  of  smoothing  parameter 
(see,  for  example,  (1)). 

The  cube  £ Yj—CC, ,  i*l,  2,  n,  can 

be  changed  into  the  ball  with  the  center 
(YJ#  Yr)  and  some  radius  R  (smoothing 

parameter). 

Nov  let  us  assume  that  there  is  a  sequence 
of  finite  sets: 


2.  THE  BASIC  IDEA 

The  underlying  idea  of  the  method  is 
as  follows.  Consider  the  regression  problem 
where  we  have  observations  Yj  at  points  xJ# 
i-1,2,  ...,  n, 

V  f(V  +  ' 

where  f  is  the  unknown  function  to  be  estima¬ 
ted,  €-(£1#  is  a  vector  of 

errors  (for  simplicity,  we  will  assume  that 
f  j  arc  independent  and  identically  distribu¬ 
ted  random  variables  with  zero  mean) .  If  the 
distribution  of  is  known  one  can  show 
positive  a  and  £  such  that 
P(-a<€j<0)  >  1-C 
for  any  e>0,  or  in  other  words, 

P  (Yj -£<f  (Xj )  <YJ4tf )  >i-c . 

Assume,  for  simplicity,  that  ^  has  a  symmet¬ 
ric  distribution.  Then  Let  us  introduce 
a  measure  of  roughness  of  any  n-dimensional 
vector  b*(bj,  b2,  bft)  as 

°i“»  -"§| ibrb,.ii 

Consider  the  class  4»  of  all  functions  g  such 
that  1)  Y1-a<g(xl)<Yj4a  and  2)  g  minimizes  Q1 


Vxn . xln1)' 

such  that 

VlSXi ,  i-1,  2,  ...,  and  their  limit 


a 


is  dense  on  the  function  f  domain  of  defini¬ 
tion  which  assumed  to  be  closed  and  without 
isolated  points.  Suppose  that 

P(l?,l>t)*0(c'ct),  t  ■«. 

It  is  proved  that  under  these  conditions 
there  exists  a  sequence  ctj,  ct2,  ...,  such 
that  the  sequence  of  estimators 
P(x,aj),  P(^»a2)/  is  consistent  for  any 

point  of  continuity.  If  ^  are  bounded  then 

the  sequence  of  estimators  is  strongly  con¬ 
sistent  (with  probability  1) . 

3.  CONCLUDING  REHARKS  AND  SOHE 
APPLICATIONS 

In  the  previous  section  we  only  considered 
the  simplest  case.  The  method  permits  some 
extensions.  First  of  all  it  can  be  used  in 
the  multidimensional  case  as  well.  Then  the 
conditions  for  £  (independence  of  their 
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identical  distriictlco)  can  be  essentially 
softened. 

An  advantage  of  the  method  is  that  it  caa  be 
also  used  in  the  situation  when  the  observed 
signal  contains  both  the  additive  aixd  malti- 
plicative  noise: 

V  (WjWV  +  Si  ' 

and  besides  the  resalt  is  usually  as  good  as 
in  the  case  when  there  is  coly  additive  com¬ 
ponent - 

Sote  that  v the  suggested  method  can  be  used 
together  with  scare  other  one  by  the  follow 
scheme:  on  the  first  stage  all  points  of  dis¬ 
continuity  are  searched  by  Lj -method,  cn  the 

second  stage  the  signal  is  filtrated  by 
another  method  separately  on  each  interval  of 
continuity. 

The  method  can  have  manr  applications  in 
signal  and  image  process vvg-  In  this  section 
we  consider  in  brief  some  of  then. 

A  number  of  applications  come  frea  scanning 
electron  microscopy.  In  microelectronics,  for 
example,  investigated  objects  often  have 
structure  which  can  be  described  by  a  piece- 
wise  constant  functions  of  two  or  three  vari¬ 
ables.  Besides,  there  are  both  of  noise  com¬ 
ponents:  additive  and  multiplicative  (multip¬ 
licative  noise  occurs  as  a  result  of  beam 


current  oscillations).  In  two  dimensional 
case  (surface  investigation)  a  typical  image 
usually  consists  of  a  number  of  simple  figu¬ 
res  like  stripes,  rectangles  etc.,  which  re¬ 
present  different  values  of  some  physical 
property  (atomic  number  of  the  material,  sur¬ 
face  flatness  etc.).  The  problem  is  to  find 
boundaries  of  those  figures.  In  three  dimen¬ 
sional  case  images  usually  have  sore  compli¬ 
cated  nature  but- they  are  still  described  by 
discontinuous  functions. 

Another  class  of  applications  arises  in 
spectrums  processing.  For  example,  the  spec¬ 
trum  of  electron  losses  is  discontinuous  (the 
point  of  discontinuity  is  related  with  the 
energy  structure  of  electrons  shells).  Such 
spectrums  are  usually  observed  with  a  high 
noise  so  the  point  of  discontinuity  cannot  be 
found  directly. 

Note  also  that  the  suggested  method  can  be 
used  for  solving  of  so  called  change  point 
problems - 
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Abstract  -.Mathematical  aspects  of  signal 
processing  of  seaming  electron  microscopy 
(SIX)  are  considered-  In  particalar  it  is 
investigated  a  class  of  integral  equations 
describing  the  signal  formation  in  SEX  profi- 
1 cue try.  Computational  problems,  relating 
with  SIX  study  of  multilayer  structures,  are 
also  investigated.  Models  and  algorithms  of 
signal  processing  are  presented  for  these 
cases. 

2.  ISTRODOCTION 

Scanning- electron  microscopy  is  one  of  the 
rain  diagnostic  tools  of  semiconductor  struc¬ 
tures  investigation .  However  there  are  sene 
difficulties  in  its  using, for  studying  of 
subnicrcn  structures.  Although  the  present 
day  nicr ©scopes  can  provide  a  few  nanometer 
resolution  it  is  only  possible  for  specially 
prepared  thin  samples  with  contrast  inclu¬ 
sions.  Examining  the  mass  samples  inherent  in 
microelectronics  a  locality  i?  determined  by 
the  dispersion  zone  size  of  primary  electrons 
in  a  sample,  and  makes  from  fractions  of 
micron  to  a  few  microns  under  different  expe¬ 
rimental  conditions.  Thus  the  signal  has  a 
complex  integral  nature  and  the  validity  of 
its  Interpretation  requires  use  of  special 
methods  of  signal  treatment. 

In  this  paper  some  problems  of  signal  pro¬ 
cessing  of  scanning  electron  microscopy  are 
considered. 

2.  SEM  PRO FI L0ME7RY  SIGNAL  PROCESSING 

A  number  of  SEM  diagnostics  problems  leads 
to  nonlinear  integral  equations.  The  first 
class  of  integral  equations,  we  consider, 
comes  from  the  problem  of  surface  relief  re¬ 
construction  (SEM  prof ilometry) .  As  it  was 
shown  in  (1)  the  surface  relief  reconstruc¬ 
tion  is  equivalent  to  the  solving  of  an  in¬ 
tegral  equation  of  the  form 

to 

JjK(x,x0,y,y0)P(£(x,y)-f(x0,y0))dxdy  - 

(1) 

-  S(x0,y0)  ,  -«<x0,y0«»  . 

where  f(x,y)  is  the  unknown  function  to  be 
found  (f  describes  the  shape  of  the  relief), 

K  and  F  are  two  known  functions  satisfying 
to  the  following  conditions:  F  is  a  continu¬ 
ous  monotonic  function,  K  is  a  positive 
continuous  furction  such  that 
to 

J£K(x,x0,y,y0)dxdy  <  a 

for  any  xQ  and  y0,  and 
to 

■fX|K(x.Xo,y,yo)-K(x,x0,y,y0)  jdxdy  -0 

at  X*  -x0  and  y0‘-y  fl 

S(x0,y0)  is  the  observed  signal.  One  of  the 
main  questions,  related  with  the  equation, 
is  the  uniquty  of  its  solution.  The  follow 


theorem  shows  that  the  solution  of  equation 
(1)  is  unique  up  to  the  addition  of  an  ar¬ 
bitrary  constant. 

Theorem  1.  suppose  that  fj(x,y)  and 
-2(x,y)  are  two  solutions  of  equation  (1). 
Then 

*j(x,y)  -  f2(ac,y)  +  constant. 

Sew  consider  the  problem  of  the  numerical 
solution  of  equation  (1) .  It  can  be  solved 
by  the  next  iterative  procedure: 
o 

J'p:(x,x0,y,y0)P(^I(x/y)-fn(x0,y0))dxdy  - 

(2) 

-S(x0,y0). 

It  takes  place  the  next  statement. 

Theorem  2.  Let  fQ(x,y)  be  an  arbitrary 

initial  estimate  (any  continuous  function). 
Then  the  sequence  fj,  ...  ,  where  fn  is 

determined  from  equation  (2),  converges  in 
the  uniform  norm  to  the  solution  of  equation 
(1). 

3.  SIGNAL  PROCESSING  OF  SEM  DIAGNOSTICS 
OF  MULTILAYER  STRUCTURES 

The  investigation  of  multilayer  structures 
is  a  very  important  problem  of  SEM  diagnos¬ 
tics,  especially  in  nicroelectronics.  How¬ 
ever,  until  recently  the  most  important  types 
of  signal  (secondary  and  backs cattered  elect¬ 
rons,  electron  beam  induced  current  etc.) 
have  been  used  only  for  surface  analysis.  The 
main  reason  of  this  is  the  intricate  depen¬ 
dence  of  the  generated  signal  on  the  object 
structure  in  the  case  of  multilayer  specimens 
investigation.  In  this  section  of  the  paper 
we  discuss  matters  connected  with  models  and 
algorithms  of  signal  processing  arising  in 
multilayer  structure  diagnostics. 

Assume  that  the  object  in  question  has  an 
ideally  smooth  surface  and  inhomogeneous  dis¬ 
tribution  of  the  atomic  number  f*f(x,y,z) 
within  itself,  f  is  an  unknown  function  which 
should  be  recons tructed.  Let  the  electron 
beam  be  incident  to  a  point  (xQ,y0).  Denote 

by  E  the  electron  bean  energy  and  by 
P(x0,y0,E)  the  secondary  electron  emission 
coefficient.  Then  the  signal  formation  can  be 
described  by  the  following  equation 

X/J-K(x-x0,y-y0,z,E)R(2/Er)f(x,y,z)dxdydi  - 

(3) 

"  P<VVE>  '  <  x0,y0  <  "  •  *>0' 

where  K  and  R  arc  known  functions.  If  the 
electron  beam  energy  E  is  fixed,  then  equa¬ 
tion  (3)  is  urderdefined:  the  unknown  func¬ 
tion  depends  on  three  variables  whereas  the 
rihgt-hand  side  only  on  two.  Hence,  f  cannot 
be  unanbiguosly  reconstructed  by  P.  To  make 
equation  (3)  unanbiguosly  solvable  for  f,  the 
energy  E  should  be  made  variable.  Equation 
(3)  is  a  special  case  of  the  Fredholm  equa- 
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tiers- of  the  first  kind"and  represents  an  Ill- 
posed  problem-  If  the  energy  interval  is.  vide 
enoogh  then  it  can  be  solved  numerically  by 
a  special  regularization  method  based  cn  use 
of  the  Fourier  or  Kell  in  transfora. 

Another  class  of  numerical  problems  arises 
in  electron  bean  induced  concentration 
methods  (2] - 

The  methods  of  induced  concentration  (KIC) 
include  the  following  methods:  EBIC,  TRE3IC, 
IRSIC,  EBXV,  SHF-absorption,  cathodoluxinis- 
cence,  therroa caustics,  SDLTS-  The  measured 
contrast  c  in  all  these  methods  is  spatial- 
dependent  if  tinelife  T  of  induced  carriers 
is  not  unifora,  T(m,y,z) 
c(r0)-£  (I0-I(r0,  (TJ)  1/I0,  V(x0,z0> 

Of  course  the  information  (two-dimensional 
function  c(xQtzQ})  is  not  sufficient  for  re¬ 
construction  three-dimensional  function  T 
(solving  the  inverse  prcblea) .  We  suggest  to 
use  as  a  third  coordinate  energy  of  incident 
beax  Zg.  The  equation  of  signal  is 

c<r0'EB>“  I  Qtr)  F<r>  p<r'r0'V'  (<) 

where  concentration  of  minority  charge 
carriers,  P(r),  is  a  solution  of  the  diffu¬ 
sion  equation 

(-  DSV+(?#))+l/T0+r)P«g, 

r(r)-l/T(r)-l/T0 

The  signal  equation  is  valid  for  any  method 
mentioned  above  because  of  the  factor  Q  is 
known  function  for  each  case.  The  dependence 
of  P  on  life  tine  T  makes  equation  (4)  nonli¬ 


near.  We  proved  the.nonl inear  prcblea  (4)  is 
equivalent  to  linear  equation  for  renormali¬ 
zed  function  rGff  and  simple  relation  between 

r  aad  reff 

<=«VV  ^  P0(r,r0,iy, 

r(Ej^j(Dr(Ej/[ccf)- 

Jd  E'Q(E')r(r,>)C(i'.E)] 

Here  PQ  and  G  are  solution  and  "Green-function 
of  dif fusion  equation  for  uniform  case.  New 
we  can  say  that  yell  known .  Donolatc  approxi¬ 
mation  gives  namely  ^but  not  the  real 

time-life.  Of  course,  if  non-uniformity  of 
T  (the  value  of  contrast),  is  small  the  func¬ 
tions  T  and  reff  are  very  close.  The  computer 
simulation  has  shown  that  it  is ; necessary  to 
measure  contrast  with  high  accuracy • (relative 
error  lO^-lO--5) . 

In  a  number  of  situations  an  analytical  mo¬ 
del  of  the  signal  generation  is  not  availab^ 
le.  For  such  cases  we  suggest  the  use  of  Koii- 
te  Carlo  simulation  as  a  tool  to  solve- the' 
inverse  problem. 
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Abstract:  Unlike  the  vocal  loop  control  strategies 
which -are  now  usually  performed  by  numerical 
algorithms  on  many  process  plants,  the  supervisory 
level  may,-  still  be -devolved  to'  human  operators. 
Recently  developed  techniques  '  In  Qualitative 
Reasoning  may  be  useful  to  aid  operators  within  some 
of  their  tasks.  Inltlal  results  In  Qualitative  Physics 
showed  encouraging  potential.  However,  such  an 
approach  may  not  be  the  most  appropriate  basis  from 
which  to  start'ln  addressing  process  supervision 
problems.  Control  engineering  principles,  encoded 
In  Qualitative  Transfer  Function  fotm,  seem  to 
provide  a  more  widely  applicable  technique. 

1  Introduction 

One  major  Irony  of  Increasing  process  plant 
automation  Is  the  demand  for'  improvements  In 
operators  skills  and  the  need  for  more  effective 
training  In  order  to  enable  them  to  better  carry  out 
their  supervisory  tasks  IBalnbridgeS3).  Supervisory 
control  Implies  a  global  understanding  of  Ihe  process 
evolution  which  is  be  difficult  to  captuie  In 
mechanistic  model,  especially  In  complex  plants. 
Moreover  the  results  provided  by  numerical 
simulations  may  not  be  at  the  desired  level  of 
abstraction  for  supervisory  tasks. 

There  are  therefore  motivations  to  develop  a 
qualitative  model-based  reasoning  technique  and 
recent  work  on  qualitative  modelling  and  simulation 
have  shown  the  validity  and  the  relevancy  of  such 
approaches.  Initial  results  have  been  encouraging  and 
steps  are  now  being  taken  to  apply  Qualitative 
Reasoning  methods  to  real  world  problems.  Whist  a 
deep  knowledge  of  the  physical  characteristics  of  a 
process,  providing  they  can  be  expressed,  might  be 
useful  for  control  purposes.  Qualitative  Physics 
(deiaeer841  (Kulpers891  may  not  provide  the  most 
useful  level  of  abstraction  to  deal  with  process 
supervisory  Issues.  Mapping  the  problem  Into  control 
engineering  transfer  function  form  and  adopting 
qualitative  modelling  concepts  could  lead  to  a 
"qualt.atlve  control  engineering"  approach  obviating 
the  well  known  ambiguity  problems  In  Qualitative 
Physics  modelling. 


It  is  now  broadly  agreed  that  Qualitative  Reasoning 
has  a  major  role  to  play  In  high  level  supervisory 
control  on  process  plant  lKu!pers891  iF6ray91) 
ILeltch89).  However,  the  requirements  for 
supervisory  control  are  different  from  those  for 
closed  loop  automatic  control.  Loop  control, 
optimisation,  etc..  Is  based  around  numerical  models 
of  appropriate  structure  and  detail.  In  many 
situations  overall  process  supervisory  ron'rol  tends 
to  be  based  around  human  operators  who  make  use  of 
both  qualitative  descriptions  of  approximate  relations 
between  variables  and  direct  quantitative 
measurements.  Different  goals,  different  techniques, 
different  conditions  of  application  demand  different 
tools.  In  order  to  drive  supervisory  control  closer  to 
automatic  control  philosophies,  Qualitative  Reasoning 

•  To  whom  all  correspondence  should  be  sent. 


developers  have  attempted  to  mimic  physicists" 
reasoning  and  mathematics  -  creating  so  called  deep 
knowledge.  Control  engineering  philosophies  seem 
to  be  a  more  appropriate  basis  upon  which  to  build 
process  plant  supervision  strategies. 

Some  Initial  steps  have  already  been  made. 
Qualitative  techniques  hare  been  used  to  reveal  some 
structural  properties  of  physical  systems  - 
controllability,  stability,  etc.  tTrav£S6].  Work  Is  still 
being  carried  out.  however,  on  qualitative  algebras 
[Missier89]  In  order  to  provide  Qualitative’ Reasoning 
with  a  systems  related  formalism.  Concepts  such  as 
qualitative  matrix,  qualitative  rank  of  a  matrix, 
qualitative  eigenvalues,  etc.,  have  been  defined  with  a 
view  to  transposing  the  results  of  ■classical"  control 
engineering  Into  their  qualitative  counterparts  (e.g. 
pole  assignment). 

Another  approach  consists  of  providing  a  view  of  the 
relationship  between  two  variables,  in  terms  of  Input 
and  output,  without  knowing  exactly  the  physical 
relationship  between  the  two  variables.  This 
corresponds  to  the  methods  adopted  in  Transfer 
Function  theory.  Thus,  a  qualitative  counterpart 
would  be  to  describe  the  evolution  of  related 
variables  through  behaviour  constraints  qualitatively 
described. 


A  major  advantage  of  Transfer  Function 
methodologies  Is  the  simplicity  of  representation  A 
similar  expressive  form  is  achieved  by  using  a  causal 
graph  model  of  the  process  IFeray89).  Instead  of  a  set 
of  differential  equations  (numerical  or  qualitative 
[deKIeer84D  to  describe  physical  phenomena  Each 
node  of  the  graph  represents  a  variable  relevant  to 
that  required  for  process  supervision  purposes.  The 
arcs  are  described  as  the  behaviour  constraints 
(called  Qualitative  Transfer  Functions  QTF)  between 
the  variables.  The  QTF-based  model  encodes  the 
concepts  used  within  the  Transfer  Function  (TF) 
framework.  The  resulting  qualitative  model  Is  then  to 
be  used  to  simulate  process  behaviour  In  order  to 
provide  a  means  of  early  detection  of  process 
malopcratlon,  with  subsequent  fault  diagnosis. 

The  evolution  of  a  variable  (called  history)  is  a  piece 
wise  linear  function  linking  the  Important  steps  in 
the  variable  behaviour.  These  "steps'  consist  of 
remarkable  points  on  tire  evolution  of  the  process 
variables  (e.g.  alarm  threshold  overshoot)  Between 
any  two  points  (called  eusn(s).  the  approximation  Is 
linear  for  the  purposes  of  simplicity.  QTFs  are 
defined  as  the  way  of  relating  two  piece-wise  linear 
evolutions.  They  are  approximations  of  the  a-prtori 
well-known  responses  of  numerical  TFs  to  standard 
Inputs  (steps  and  ramps)  For  consistency,  these 
responses  are  transformed  into  piece-wise  linear 
functions  which  can  be  seen  as  sets  of  Intervals 
parameterised  by  the  values  of  the  temporal  variables 
describing  the  QTF  (time-delay,  settling  time  or 
period  of  oscillation). 

In  order  to  describe  the  different  forms  of  behaviour 
constraints  between  continuous  variables,  a  QTF 
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library  Is  proposed  [FdraySS].  The  QTF  library  has  an 
open-ended  structure.  Behaviour  constraints  can 
always  be  added  If  needed  (e-g.  Integrating  and"  nbn- 
minimum  phase  characteristics,  etc.).  Some  other 
kinds  of  propagation  constraints  Including  fuzzy 
functions  are  proposed  In  [Vesccvl90). 

3.2  Simulation  Algorithm. 

The  qualitative  simulation  is  based  on  an  event-based 
propagation  algorithm  [LeyvalSOJ.  "This  algorithm  Is 
asynchronous  arid  fed  by  data  measured  from  the 
process  (or  from  any  other  source).  The  propagation 
through  the  graph  Is  achieved  by  transforming  all 
events,  sorted  according  to  the  time.,  into  a  set  of 
events  using  the'  QTrs  and  merging  these  newly 
created  events  Into  the  existing  histories  of  the 
variables.  The  number  of  new  events.  In  a  set  their 
"value*  and  "time*,  depend  on  the  Input  event  and 
on  the  information  contained  In  the  arc  (Le  the  kind 
of  behaviour  constraint,  and  the  different 
-parameters).  It  is  terminated  when  there  Is  no  event 
left  to  be  propagated  or  when  the  propagation  has 
been  completed  within  a  pre-determlried  time 
interval  (in  case  of  loops  in  the  graph). 

The  relevant  information  produced  by  a 'QTF-based 
model.  Is  the  shape  of  the  behaviours,  the  orders  of 
magnitude,  etc.  Such  Information  Is  sometimes  more 
useful  for  process  supervision,  fault  diagnosis,  etc. 
than  precise  numerical  values  ILeltch89). 

4  Applications. 

The  first  application  of  the  QTF  approach  was  for  the 
modelling  of  a  steam  generator  of  a  nuclear  power 
plant  IMQ&D91).  The  aim  of  the  system  was  to 
provide  the  pro  ess  operators  with  some  useful 
information  to  help  them  prevent  unwanted  and 
costly  shut-downs  of  the  steam  generator. 

Another  application- of  the  QTF  approach  to  a 
different  kind  of  plants,  pulsed  columns.  Is  described 
In  |Fdray89).  Work  is  now  being  carried  out  to 
provide  an  analysis  of  both  qualitative  simulation  and 
the  measurements  on  the  process  in  order  to  achieve 
a  better  understanding  of  the  process  evolution 
IMontmaln90].  It  Is  reported  that  another  application 
Involving  the  same  system  will  start  in  1991  on  a 
nuclear  fuel  treatment  plant  (MQ&D91). 

The  general  modelling  tools  arid  the  simulation 
algorithm  have  also  been  used  to  build  a  qualitative 
model  of  a  distillation  process  I  FerayD  1 1 .  Using  a 
complex  numerical  physical-chemical  model  as  a 
reference,  the  results  produced  by  the  QTF-based 
model  arc  compared  with  those  from  the  detailed 
numerical  model.  The  relevancy  of  the  Information 
piovidcd  Is  then  able  to  be  assessed.  Comparisons 
have  been  made  on  both  the  top  and  bottom  product 
flows  and  both  the  top  product  and  product 
compositions,  and  demonstrates  the  validity  of  the 
approach  (Feray911. 

g-PfoUMloni 

The  motivation  for  our  work  is  to  build  simple 
models  encoding  both  approximate  "behavioural* 
knowledge  and  more  numerically  accurate  data, 
when  available,  within  the  structured  framework  of  a 
QTF-based  model.  The  qualitative  simulation  results 
are  mainly  composed  of.evolutlon  tendencies  of  the 
process  variables  and  their  orders  of  magnitude. 

It  is  stressed  that  qualitative  modelling  does  not  aim 
at  replacing  nr  ncrical  modelling  in  control  Ioop3, 
but  at  comple-  renting  the  existing  loop  control  level 


with  a  high  level  decision  making  loop  providing 
Information  that  Is  closer  to  the  human  operators' 
way  of  reasoning. 

Qualitative  Control  Engineering  provides  a  set  of 
tools  for  process  supervision  built  on  the  well  known 
classical  and  modem  theories  for*  process  control. 
Such  an  approach  appears  to  provide  the  prospects 
for  a  major  step  forward  tn  Qualitative  Reasoning  If 
some  of  the  statements  made  by  MacDerinott  In  his 
article  "Artificial  Intelligence  meets  Natural 
Stupidity"  bear  fruit.  then'-' Qualitative  Control 
Engineering  could.  In  the  short  term,  aid  operators 
In  their  supervision,  task  whilst  ultimately 
contributing  to  the  goal  of  a  totally  computer  based 
control  strategy. 
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Abstract  -  This  paper  deals  with  Qualhaave  Reasoning  concepts  for 
^•nrAfnn^  rhe  i.T>tfTpr»minn  of  measuremena  and  cfcsgvaiiocs^pcr- 

Ibe  definifioas  of  xasopretaiioa,  explanation  and  causality,  theawintt- 
ity  existing  between  these  three  basic  notions  is  shown,  aad  a  (prelim¬ 
inary)  theory  of  uuopretadori  is  proposed.  Then,  the  metbaJoJo|y 
used  for  designing  a  model  of  integration  applied  to  the  field  of  bio¬ 
logical  processes  is  described. 

L  INTRODUCTION 

In  the  domain  of  system  supervision  (Caloud,  1988;  Feray 
Beaumont  et  aL,  1989),  the  concept  of  interpretation  seems  to  be  par¬ 
ticularly  important,  as  continuous  processes  internal  so  complex  sys¬ 
tems  aro  not  cHrecdy  observable.  They  can  only  be  apprehendal  by  the 
use  of  sensors,  analyses  or  observations,  Le.  in  a  discrete  and  extemal 
way.  Data  that  are  collected  on  diem  constitute  the  set  of  signs  dis- 
*irned  by  the  observer.  If  eventually  they  can  have  a  direct  meaning 
(are  not  they  themselves  an  interpretation  of  real  phenomena  7),  they 
get  only  their  full  signification  through  interpretation,  itself  related  to 
the  main  goal  to  be  reached. 


figure  1  -  The  cootfol  loop  for  system  managemeae 


Interpretation  is  obviously  based  on  subjective  estimations,  because 
external  world  signals,  received  by  our  senses,  are  inteipreted  through 
our  perceptive  categories,  molded  by  self-experience  (Ninio,  1989), 
Interpretation  is  essential,  both  in  the  constitution  of  operator’s  know¬ 
ledge  and  in  its  ability  to  operate.  Interpretation  prepares  decision¬ 
making,  and  thus  tppeare  to  be  the  first  step  of  Cognition,  central  pro¬ 
cess  in  the  control  loop,  just  between  Perception  (i  e.,  data  collection) 
and  Action  (fig.  1). 

In  fact,  a  human  operator  behaves  as  an  Expert  (Vogel,I988), 
Judgment  and  action  rules,  progressively  constructed,  are  applied  to 
data  (eventually  preliminary  processed),  usually  translated  into  quali¬ 
tative  terms  to  become  the  basis  of  reasoning  (De  Kleer  and  Brown, 
1983).  This  high  level  approach  is  by  nature  action-oriented,  as  it  inte¬ 
grates  a  global  knowledge  simplifying  the  real  world  complexity  to 
make  it  understandable  and  controllable  (Rasmussen,  1985).  Hence, 
as  representation  can  to  some  extent  fill  the  gaps  in  mathematical  mod 
eling,  when  no  acceptable  model  is  available  or  when  solutions  geper- 
atea  must  themselves  be  interpreted  before  being  used  in  decision 
making  (Caloud,  1988).  We  believe  therefore  that  it  does  exist  a  space 
for  a  complementary  approach,  aimed  at  computing  all  available  infor¬ 
mation  on  the  system,  including  the  pieces  which  would  be  too  diffi¬ 
cult  to  formalize  in  terms  of  classical  mathematical  equations. 


IL  A  THEORY  OF  INTERPRETATION 

Bat  wfcu  is  interpretation  ?  According  to  Fttfida  (1980),  the  “art  of 
interpretation”  is  based  on  the  recognition  of  a  sense,  hidden  under  the 
apparent  meaning  taken  by  god’s  word,  the  manifestation  of  a  sign, 
the  expression  of  a  gescre  or  a  worL.  Id  agreement  with  that  defini¬ 
tion,  giving  an  interpretation  is  an  action  of  sense  production  by. 
means  of  a  discourse  applied  to  signs/Dris  genial  concept  is  applied 
to  such  various  fields  as  theological  exegesis,  art,  psychoanalysis, 
human  sciences,  medicine-. 

.How  about  explanation  ?  According  to  Varela  (1989),  explaining  is 
reformulating  phenomena  in  such  a  way. that  their  components  be 
causally  related  to  each  others.  Thus,  if  interpretation  is  sees  as  a  dis¬ 
course  about  things,  explanation  can  be  seen  also  as  a  pan  of  interpre¬ 
tation. 

How  causality  takes  place  within  these  concepts  ?  In  agreement  with 
Saint-Serum  (1980),  we  will  start  from  the  idea  that  the  term  causality 
does  Dot  mean  direaly  a  property  cf  the  relationships  between  objerts 
(ontological  meaning),  but  a  characteristic  of  the  statements  constitut¬ 
ing  an  explicative  theory  of  observed  facts.  Therefore,  if  explanation 
is  considered  like  the  way  to  establish  causality,  and  causality  like  the 
way  to  explain  facts,  there  is  a  clear  equivalence  between  these  two 
notions. 

MEASUREMENTS  ANALYSES  OBSERVATIONS 


Fijcrc  2  -  A  decry  of  mtnpretttion :  tnrukiion,  cdcolos  &  explanations. 


finally,  our  “theory”  of  interpretation,  considered  as  meaning  as¬ 
signment  to  observed  signs,  includes  three  basic  levels  necessary  for 
the  comprehension  of  the  system  under  observation  (fig.  2) : 

-  Translation :  assignment  of  individual  meaning  to  the  model  input- 
values  (e.g.,  the  measured  values  x,  y,  z...  of  variables  A,  B,  C. 
will  be  qualified  as  low,  bad,  slow...,  according  to  their  nature); 

-  Calculus :  assignment  of  individual  meaning  to  all  the  model  inter¬ 
nal  variables  (e.g.,  combining  several  Qualitative  values  A  *  low,  B  » 
bad ,  C  *»  slow...,  will  allow  one  to  deduce  the  qualitative  values  K  « 
good,  L  *  high...,  of  unmeasurable  variables  K,  L...); 

-  Explanation :  assignment  of  meaning  to  the  system,  by  enunciating 
all  the  causal  relationships  between  variables  accounted  for  translation 
and  calculus  (e.g ,  A  *  tow  and  B  «  bad  imply  K  *»  good...). 

Thus,  giving  an  interpretation  is  deducing,  from  any  subset  of 
input-values,  qualitative  values  of  as  many  unknown  variables  as  pos¬ 
sible,  and  explaining  the  reasoning  to  provide  the  user  with  an  overall 
comprehension  of  the  phenomena. 

HL  METHODOLOGY  OF  INTERPRETATION 

This  approach  was  applied  to  data  interpretation  in  hydro-ecology, 
lc,  interpretation  of  measurements,  analyses  and  observations  per¬ 
formed  on  an  aquatic  ecosystem  for  management  purposes  (Guemn, 
1990;  Guemn,  1991)  It  involves  four  methodological  steps  for  de 
signing  an  interpretation  model  based  on  expert  knowledge  (stalled 
process  engineer’s): 

1  -  Identification  of  relevant  variables  for  system  supervision  pro¬ 
poses  ,  especially  unmeasurable  or  unobservable  ones; 

2  -  Representation  of  influences  between  the  variables  as  a  causc-ef 
feet  graph,  by  starting  from  the  identification  of  elementary  causal 
chains; 


1736 


(1)TR/Vr*»-X- unQ . 

^<nSr^*>Vir-Q-}q-va^ 

{2}  TR^obs-Y  -  o.*£:'n;}.{o1_  on])  ; 

o  -  {_&ro^}  ->  Q-Var  -  {qvaf) 

P)Tft<tfq^t_  c^v^n) : 


qva?j  m  {q-vaT}.  q-varj  -  {<*vaT%~  •>  Q-Var  .  {q*aTJ 
Q-Var  -  — q-wcjj  fop]  qvirj... 

WAR; 

RiM'X  TR-c<'n«»-X)  ->  O- V* 
cb*-Y  TH-o(obvY)  ->  Q-Var 
q-var,  q-varj_  TR-q<q^ar^  qva/p)  ->  O- V* 
qvarfc  q-varj_  TR-q{q-vark,  q-vaq_)  ->  Q-V*r 
fijw  3  -  Specjfieauoo  framework  of  (pafiuiirc  transfer  roles  ( TR-o .  translation 
of  mearoraneott;  TR-o  !  translated  of  observations,  TR-q ;  eaten  las,  AR .  setk* 
roles). 


3  -  Knowledge  specification,  according  to  a  standardized  frame¬ 
work  which  distinguishes  (Fig.  3) : 

•  Translation  Rules  ( TR-m  and  TR-o)  of  numerical  data  (measure¬ 
ments,  analyses)  and  linguistic  terms  (observations)  respectively,  into 
a  relevant  vocabulary  for  the  application  !  e.g.,  symbols  like  {pp,  p, 
m,  f,  ff]  whose  semantics  corresponds  to  orders  of  magnitude  (e.g., 
very  low,  low,' medium,  high,  very  high,  respectively); 

•  Calculus  Rules  &R-q)  operating  on  this  set  of  symbols  (called 
Quantity  Space)  to  deduce  the  qualitative  value  of  a  variable  from 
qualitative  values  of  its  causes,  by  using  6  formal  calculus  operators 
(definitions  given  Fig.  4  below); 

•  Action  Rules  (AR)  for  controlling  the  use  of  above-cited  rules. 

4  -  Software  programming  (that  was  made  with  PROLOG);  at  this 
level,  giving  an  interpretation  consists  essentially  in  producing,  from 
input-values  (measurements,  observations,  given  by  the  user  or  deliv¬ 
ered  by  sensors) : 

•  the  system  State .  vc ,  the  set  of  all  the  generated  estimations  in 
the  form  of  <qualitadve- value,  last  of  qualitative  variables>, 

•  Explanations  ;  i.e.,  in  agreement  with  De  Kleer  and  Brown 
(1983),  the  execution  trace  of  reasoning  (at  least)  having  led  to  the  cal¬ 
culus  of  variables  throughout  all  the  paths  available  m  the  causal 
graph,  displayed  in  a  language  that  can  be  understood  by  the  user. 
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Hjwre  4  -  Definition  of  six  operators  for  qualitative  cakulai  on  a  five  symbol* 
space  QS»{pp,p,m.f,ff), 


IV -DISCUSSION 

We  most  point  out  that  our  system  performs  a  static  reasoning,  as  it 
gives  an  interpretation  of  a  set  of  measurements,  analyses  and  obser¬ 
vations  made  at  apartkular  instant.  As  it  works  in  a  deductive  sense, 
it  is  predictioo-onented  (given  the  causes  determine  the  effect).  Bet 
these  principles  of  imerpxtaaon  could  also  be  applied  to  the  diagnostic 
sense  (given 'the  effect  determine  the  causes).  Beyond  this  aspect,  we 
believe  that  taking  into  account  the  past  evolution  of  variables  (l e^  in¬ 
terpretation  of  the  succession  of  memorized  states)  would  be  of  major 
interest  Integrarioo  in  qualitative  reasoning  of  information  extracted 
from  time  evolution  curves,  seems  therefore  fundamental 
(Willoughby,  1989).  We  are  beginning  to  address  this  topic  which 
highlights  some  interesting  problems :  recognition  of  patterns,  intelli¬ 
gent  smoothing,  kinetic  comparison— 

Our  formal  calculus  system  has  a  certain  lack  of  generality,  for  ex¬ 
ample  :  m  (+)  p  [+J  p  [+J  p...  will  always  result  in  (m).  Calculus  are 
therefore  restricted  to  relatively  shallow  reasoning.  On  the  other  hand. 
Quantity  Space  symbols  represent  actually  adjacent  intervals.  Fuzzy 
intervals  would  be  perhaps,  conceptually  better.  This  was  thought  un¬ 
necessary  because  in  our  application,  adjacent  intervals  fit  the  expert’s 
way  of  thinking,  without  generating  spurious  results.  Two  other  prob¬ 
lems  must  be  emphasized,  although  they  appear  more  specific  to  liv¬ 
ing-system  s  (Varela,'  1989) : 

-  first,'  is  the  problem  of  “circular  causality”,  when  an  effect  and  its 
cause  are  confounded;  we  dealt  with  it  by  choosing  an  unidirectional 
sense  of  causality  according  to  our  purposes  (ite ,  by  breaking  the 
loop) ;  we  hope  that  better  solutions  can  be  found, 

-  then  is  the  problem  of  designing  a  real  dynamic  model,  because  >f 
a  lack  of  knowledge  about  temporal  relationships  between  variable  ^ 
such  as  delay  or  settling  time. 

Our  actual  perspective  (outside  the  scope  of  tune  evolution  curve 
analysts)  is  mainly  to  apply  these  concepts  to  other  biological  systems 
We  arc  beginning  a  work  on  fermentation  reactors  Our  preliminary 
experience  m  that  field  is  encouraging.  Beyond  this,  a  longer-term  per¬ 
spective  would  be  frying  to  build  something  like  a  “General  Theory  of 
Interpretation”  valid  at  least  in  the  field  of  biological  processes... 
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ABSTRACT 

This  paper  deals  with  a  qualitative  interpretation  of  an  on  line 
simulation  for  supervision.  The  simulation  algorithm  relies  on  the 
propagation  of  events  through  a  causal  graph  :  each  signal  is 
decomposed  as  a  series  of  significant  variations.  The  fault  detection 
rests  on  a  qualitative  interpretation  of  the  difference  bc'ween  the 
process  and  the  simulation.  This  comparison  has  to  be  robust 
enough  to  avoid  the  classical  drawbacks  of  the  threshold  methods 
and  to  take  into  account  the  disturbances  inherent  to  the  modelling  or 
to  the  measurement  system. 

Firstly  modelling  techniques  are  briefly  reported  and  the  notion  of 
significant  event  in  a  process  is  presented  Next,  several  specific 
terms  are  introduced  to  provide  a  definition  of  qualitatively  similar 
behaviour  Finally,  the  algorithm  describing  the  qualitative 
interpretation  of  the  error  is  presented .  it  is  based  on  fuzzy  analysis. 


KEYWORDS 

Qualitative  interpretation  -  Supervision  -  Fault  detection  -  Diagnosis 


I.  INTRODUCTION 

DIAPASON  is  a  system  which  aims  to  assist  operator,  in  supervising 
industrial  continuous  processes.  It  provides  operators  with  the 
simulation  of  the  normal  behaviour  of  the  process  and  with  a  help 
for  high  level  diagnosis  (1).  It  is  a  modular  system .  PROTEE  is  the 
qualitative  simulator  whose  model  is  a  causal  graph  {2J ,  MINOS 
analyses  the  simulation  results  by  comparing  them  on  line  with  the 
process  measurements,  and  starts  a  fault  diagnosis  when  necessary 
{3J ,  SPHYNX  diagnoses  by  providing  the  causes  of  the  detected 
defect  thanks  to  expert  rules  that  use  a  structural  knowledge  of  the 
process  [4). 

This  paper  deals  with  the  control  and  decision  unit,  MINOS,  and 
describes  more  especially  the  comparison  algorithm.  The  aim  is  to 
detect  complete  and  drift  failures  of  the  process  as  soon  as  possible 
thanks  to  model-process  comparison. 

An  image  of  the  normal  behaviour  of  the  process  is  obtained  owing 
to  the  simulator  PROTEE.  The  model  of  the  process  is  expressed  as  a 
graph  whose  nodes  arc  the  variables  relevant  to  the  operator  and 
whose  arcs  represent  the  causal  relations  between  the  variables.  This 
model  is  not  based  on  a  sound  physical  analysis  of  the  various 
phenomena  occurring  in  the  process.  The  evolution  of  a  variable  is 
represented  by  a  piecewise  linear  function.  The  simulation  algorithm 
takes  into  account  step  or  ramp  evolutions  according  to  dynamics  of 
signals.  An  evolution  is  propagated  through  an  arc  piece  after  piece ; 
the  response  is  still  a  piecewise  linear  time  function  with 
discontinuities,  named  Qualitative  Response 
The  signals  to  be  compared  are  the  measured  signal  which  is 
sampled  and  the  simulated  one,  a  Qualitative  Response,  In  order  to 
have  regard  to  homogeneity,  the  sampled  measured  signal  has  to  be 
replaced  by  a  segmented  representation,  it  means  by  a  piecewise 
linear  time  function.  This  approximation  is  not  very  restrictive 
concerning  the  calculated  error .  the  use  of  the  sampled  signal  of  the 
process  would  have  provided  an  illusory  precision  with  regard  to  the 
point  of  view  of  the  modelling  and  the  quality  of  industrial 
measurements.  Moreover,  this  substitution  is  beneficial  to  the 
storage  of  the  histones  of  vanables ,  it  is  far  less  costly  to  store 
senes  of  significant  events  than  periodically  sampled  values. 
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The  simulation  results  correspond  to  a  synthetic  representation  of  the 
process  and  to  the  level  of  abstraction  relevant  to  supervision.  From 
this  point  of  view,  the  shape  of  an  evolution  is  as  important  as  the 
precise  values  of  its  points.  So  it  is  necessary  to  use  integral  criteria 
on  temporal  windows  rather  than  instantaneous  criteria  in  order  to 
compare  the  simulated  signals  and  the  real  ones. 

The  causal  model  also  used  for  diagnosis  has  a  numerical 
parameterization :  gains,  delays,  settling  times ...  But  the  nature  of 
the  simulated  signals  is  rather  qualitative .  approximation  of  integral 
signals  by  piecewise  linear  functions. 

The  model-process  comparison  is  the  basis  of  the  decision  unit 
MINOS  which  has  to  start  a  finer  diagnosis  when  necessary.  Starling 
a  diagnosis  on  a  simple  comparison  of  the  simulation  error  with  a 
threshold  makes  a  system  generally  efficient  in  the  detection  of 
defects  but  too  sensitive  to  the  various  disturbances  and  imprecision 
(it  leads  to  bad  detections).  To  avoid  this  problem,  qualitative 
notions  which  are  more  consistent  with  the  modelling  principles  are 
introduced  in  MINOS ;  the  algorithm  rests  on  a  local  treatment  on 
each  simulated  variable  and  takes  into  account  the  various  sources  of 
disturbances ;  this  provides  therefore  an  ’algorithm  robust  in 
dynamic  phases. 


2.  DEFINITIONS  AND  TERMINOLOGY 

An  event  e,  is  a  significant  change  in  the  behaviour  of  a  variable,  it 
means  a  variation  able  to  modify  soundly  the  global  behaviour  of  the 
process ,  it  is  parameterized  by  its  initial  time,  its  amplitude  variation 
and  us  slope  variation.  It  is  noted .  (t„  Ap,  Aa)  When  the  next  event 
(i  +  1)  occurs,  the  duration  of  event  i  can  be  calculated ,  t,*-i  - 1! 

An  evolution  is  a  sequence  of  events  whose  durations  arc  not 
equal  The  response  to  an  evolution  through  an  arc  of  the  causal 
graph  is  still  an  evolution  (stability  of  the  representation  for  the 
simulation  algorithm).  An  evolution  is  represented  by  a  piecewise 
lintar  discontinuous  time  function  which  is  deduced  by  an 
elementary  bijeetton.  From  evolution  [  (00,  1  0, 0  0) ,  (1.0,  T  0, 
0  0) ,  (3.0,  0  5,  0.5)  ],  the  piecewise  linear  time  function  can  be 
easily  deduced  in  figure  1 : 


Figure  1 :  An  evolution 

The  simulation  is  started  by  significant  events  detected  on  the  uxitrol 
vanables  or  on  measurable  disturbances  of  the  process.  A  a  event 
detection  aigonthm  is  needed.  After  filtenng  noise,  the  detection 
must  filter  measured  evolutions  so  as  to  exclude  too  small  amplitude 
or  too  short  duration  events.  This  aigonthm  which  runs  on  line  is 
called  segmentation .  it  enables  to  transform  pcnod’cally  sampled 
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signals  into  evolutions.  Two  main  parameters  are  necessary  for 
adjusting  the  segmentation  according  to  dynamics  of  the  evolutions : 
in  amplitude  threshold  and  a  temporal  window  width. 

A“  temporal  window  TEMPO  is  defined  for  each  variable  so  as  to 
analyse  the  behaviour  on  a  significant  deration. 

A  basic  error  criterion  e(t)  is  defined  for  the  comparison :  it  is  a 
signed  integral  error  between  the  simulated  evolution'  x*(t)  and  the 
segmental  measured  one  x(t)'; 

n+TEMPO 

e ©  =  TV4PO'  I  WO  " x*(0) tit-  (1). 

This  allows  fluctuations  of  the  simulated  evolution  around  the 
measured  evolution  on  the  window  TEMPO.  The  cost  of  the  calculus 
e(t)  is  very  low  for  it  corresponds  only  to  an  integration  of  piecewise, 
linear  time  function. 

Describing  qualitatively  a  behaviour  has  lead  us  to  extend  the  notion 
of  similar  evolutions :  the  use  of  a  moving  temporal  window 
enables  to  define  a  temporal  average,  a  standard  deviation,  and  more 
generally  it  gives  access  to  a  local  analysis  of  the  compared 
evolutions  over  the  selected  window  (tendencies  and  shapes...). 

A  tendency  is  a  symbolic  scale  relative  to  the  derivative  of  a 
function ;  three  levels  are  distinguished:  INCREASING, 
DECREASING  and  STEADY.  A  tendency  is  the  result  of  the  analysis 
on  a  temporal  window. 

The  evolution  of  a  variable  is  available  on  a  temporal  window 
TEMPO  at  every  moment  Then  it  is  possible  to  get  the  corresponding 
simulated  history  of  the  variable  on  a  second  temporal  window 
whose  width  would  be:  TEMPO  +  2.0  •  MARGIN.  MARGIN  is 
named  the  temporal  margin  and  it  enables  to  have  the  simulated 
evolution  on  a  larger  duration  on  both  sides  of  the  measured  one  at 
one’s  disposal  (figure  2) : 


Considered  time  interval  ^  ^Futur. 


Figure  2 :  Defined  temporal  windows 

So  translations  in  the  plane  (lime,  value)  make  it  possible  to  search 
for  the  pan  of  the  simulated  evolution,  on  a  duration  TEMPO, 
extracted  from  the  complete  simulated  evolution,  defined  on 
jti  -  MARGIN  ;  ti  +  MARGIN  +  TEMPO],  which  approximates  as 
well  as  possible  the  corresponding  measured  evolution  on 
[ti ,  ti  +  TEMPO].  The  translation  in  the  plane  that  provides  the 
minimization  of  the  quadratic  deviation  between  the  simulated 
evolution  and  the  measured  one  on  [ti ;  ti  +  TEMPO]  corresponds 
to  the  optimal  proximity  (OP)  of  the  real  behaviour  (5).  If  the 
quadratic  deviation  between  both  evolutions  provided  by  the  optimal 
proximity  is  below  a  given  threshold,  both  evolutions  have  similar 
shapes. 

The  associated  criterion  to  be  evaluated  is  the  quadratic  deviation : 
tstTEMPO 

c(tO,  a)  =  }(jc(t)-x*(l-  tO)  ■  a) 2  dt  (2) 

<> 

The  OP  calculus  cost  is  low,  for  x(t)  and  x*(t)  are  piecewise  linear 
functions. 

The  minimization  of  e  (tO,a)  is  simplified  by  writing  (2)  as : 
a2  *  TEMPO  *  2  *  a  *  e(tO,  0)  f  e(tO,  0),  where  e(t0,  z)  is  defined 
utTEMfO 

by:  e(tO, z)  -  ( (x(t)  -  x*(t  -  tO)  -  z) 

ti 

It  enables  to  separate  tO  and  a,  and  to  get  a  quadratic  equation  for  a 
parameterized  by  tO.  AU  the  evaluations  are  processed  at  little  cost 
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for  the  formal  calculus  is  developed  as  far  as  possible  thanks  to  the 
piecewise  linear  functions.  Otherwise  the  SCHWARTZ  inequality 
enables  to  link  the  thresholds  of  e(t)  and  e(t0,  a) :  if  we  assume  that 
I  e(t)  1  <  c,  the  threshold  of  e(i0,  a)  will  be  a  percentage  of 
TEMPO  •  c2. 

We  can  still  notice  that  this  calculus  can  be  used  to  study,  the 
parallelism  of  the  signals :  if  the  result  of  the  optimal  proximity  zs  a 
constant  vector  in  time,  it  means  that  the  evolutions  arc  locally 
parallel.  Two  'evolutions  are  said  to  have  similar  shapes  if  a- 
translation  (tO ;  a)  for  which  the  cri  tenon  e  (tO,  a)  is  reduced  below 
the  given  threshold  can  be  found.  It  corresponds  :o  e  (tO ;  a)  ->  0 
qualitatively  after  minimization  by  OP. 

Two  evolutions  are  said  to  be  qualitatively  similar  if  at  least  one 
of  the  next  conditions  is  verified : 

-  there  is  no  error  in  the  sense  of  (1), 

-  the  evolutions  have  similar  shapes  in  the  sense  of  (2), 
every  combination  of  these  two  conditions  is  interpreted. 

The  aim  of  the  comparison  algorithm  is  to  detect  as  precociously  as 
possible  complete  failures  and  dnft  failures.  The  first  ones  are 
generally  characterized  by  significant  drops  in  the  evolution  of  the 
error  functions  and  the  second  ones  by  smoothly  increasing  error 
functions.  Other  causes  of  divergences  such  as  imprecision  inherent 
to  the  modelling  or  disturbances  of  the  measurement  system  must 
not  entail  a  defect  detection. 


3.  INTRODUCTION  OF  A  FUZZY  ANALYSIS 
3.1.  Vague  facts  and  fuzzy  set  theory 

For  our  purpose,  we  use  the  numerical  values  computed  by  (1)  and 
(2)  but  they  have  to  be  interpreted  symbolically  to  lead  to  a 
decision : 

-  the  simulation  is  identical  with  the  process  evolution,  there  is  no 
problem ; 

-  the  simulation  is  very  different  from  the  process  evolution,  a 
diagnosis  has  to  be  started ; 

-the  simulation  does  not  exactly  correspond  to  the  process 
evolution,  a  more  precise  study  has  to  be  carried  out  before  a 
decision  is  taken. 

For  instance,  the  error  c(t)  could  be  23%  whereas  the  corresponding 
decision  rule  could  be :  "if  the  error  is  high,  a  diagnosis  is  started". 
If  the  attribute  "HIGH"  is  based  on  a  single  threshold,  the  system 
using  this  decision  rule  may  react  very  differently  if 
e(t)  *  THRESHOLD.VALUE  +  8  or 

e(i) »  THRESHOLDS ALUE  -  5,  regardless  of  the  magnitude  of  8. 
The  difficulty  here  docs  not  come  from  the  improper  choice  of  the 
value  of  the  threshold  THRESHOLDS  ALUE  but  from  the  lack  of 
existence  of  a  well  defined  threshold  that  might  separate  error  values 
compatible  with  ’’HIGH"  from  values  that  arc  incompatible.  "HIGH"  is 
a  vague  predicate,  and  should  be  modelled  as  such  [6] 

ZADEH’s  fuzzy  set  theory  offers  a  very  simple  and  elegant  tool  for 
dealing  with  vagueness  of  terms,  especially  when  they  clearly  refer 
to  one  or  several  numerical  scales  [7]. 

Using  this  approach,  a  vague  condition  such  as  "error  €  (t)  is  high" 
is  represented  by  means  of  fuzzy  intervals  as  represented  in 
figure  3 : 


Figure  3 .  Fuzzy  intervals 


The  relevance  of' the,  terms  F:  HIGH  NEGATIVE  (HN), 
NEGATIVE  (N),  ZERO  (0).  POSITIVE  (P)  and  HIGH  POSITIVE  (HP), 
with.jegard  to  the  numerical  value  of  e(t),  is  expressed  as  a 
compatibility  function.  It  means  that  for  any  value  e(t)  €  S,„a 
degree  jip(e)  e  [0 , 11  can  be  obtained  that  evaluates  the  relevance 
of  the  term  F  to  the-situation  described  by  the  value  6 -(or 
equivalen  tly,  the  degree  of  compatibility  of  e  with  F)-  Note  that  Pf<£) 
is  not  a  degree  of  uncertainty.  Indeed  the  situation  is  completely 
.known  (  e(t)  =>  23%  ). 

3.2.  Degrees  of  relevance 

The  various  degrees  of  relevance  or  degrees  of  compatibility  used  in 
our  comparison  algorithm  are  now  presented.  Each  calculated  or 
measured  vaiue  e(t),  or  c(t0,  a), ...  is  characterized  by  a  degree  of 
relevance .  ha(£)*  °r  Ra(c).  ...»  which  associates  it  with  its 
compatibility  with  the  vague  predicate  A. 

Pa(£)  is  associated  with  the  criterion  (1),, where  A  may.  be  the 
symbolic  value  HIGH  POSITIVE  (HP),' POSITIVE  (P),  ZERO  (0), 
NEGATIVE  (N)  or  HIGH  NEGATIVE  (HN). 

Two  other  degrees  are  affected  to  the  criterion  e(t0,  a),  (2).  The 
optimal  proximity  algorithm  provides  not  only  the  evaluation  of 
e(t0,  aXnm.  the  minimal  quadratic  deviation  between  the  segmented 
evolution  and  the  simulated  one,  but  also  the  translation  vector  that 
gives  this  minimum.  The  norm  of  this  vector  corresponds  to  the  cost 
of  the  optimal  proximity,  the  importance  of  the  translation  to  be 

earned  ou.  is  g.ven  by  (3) :  ±  ^  ^  ^  (3). 

where  margin  and  8max  correspond  to  the  components  of  the  largest 
allowed  translation.  Consequently  the  minimal  quadratic  deviation 
and  the  corresponding  cost  are  associated  with  the  interest  of  the 
optimal  proximity  calculus. 

Ha(c)  corresponds  to  the  compatibility, of  e(tO,  a)  with  the  symbolic 
value  A,  which  may  be  HIGH  (H)  or  ZERO  (0). 

Ha(c)  corresponds  to  the  compatibility  of  the  translation  cost  with 
the  symbolic  value  A,  which  may  be  HIGH  (H)  or  zero  (0). 

The  interest  of  the  optimal  proximity  can  be  represented  through  a 
symbolic  scale,  that  takes  into  account  at  one  and  the  same  time  the 
minimal  quadratic  deviation  e  and  the  cost  c  of  the  operation.  The 
degree  (OP)  is  a  function  of  Pa’(c)  and  PA"(c).  A  may  be  "BAD" 
(if  the  optimal  translation  has  not  resulted  in  similar  shapes)  or  "OK" 
in  other  case.  Let  us  set : 

-  when  the  error  (I)  is  P  or  N : 

Fbad  (op)  *  Mh  (c)  +  Mh  (c)  -  Ph  (c)  *  Mil  (c) 

POK  (OP) n  Mo  («)  •  Po  (c) 

-  when  the  error  (1)  is  HP  or  HN :  Pbad  (OP)  “  1  and  Pok  (OP)  “  0. 
The  decision  to  start  a  diagnosis  is  represented  as  a  symbolic  scale 
that  expresses  the  risk  that  the  situation  is  incidental  whereas  it 
corresponds  to  a  knowledge  expressed  by  terms  in  the  natural 
language.  Pa  (decision)  is  the  weight  attributed  to  a  decision :  A 
may  be  DIAGNOSIS  or  OK.  Let  us  set : 

^diagnosis  (DECISION)  *  Pop  (bad)  •  pc  (P  or  N), 
with  regard  to  the  definition  of  two  similar  evolutions,  and 
p0K  (DECISION)®  1  -PDIAGNOS1S  (DECISION). 

3.3.  Results 

To  detect  complete  failures,  the  deviation  between  the  simulated 
evolution  and  the  measured  one  is  generally  so  large  that  the  optimal 
proximity  algorithm  is  unable  to  reduce  enough  the  quadratic 
deviation  m  the  permissible  part  of  the  plane.  So  the  cost  c  or  the 
minimal  quadratic  deviation  are  usually  very  high. 

With  regard  to  the  drift  failures,  the  result  of  the  optimal  proximity 
algorithm  is  generally  successful  but  when  the  test  is  repeated,  the 
cost  of  the  calculus  is  increasing  in  time  (increasing  norm  of  the 
optimal  proximity  translation  vector). 

Other  error  sources  and  noises  generally  correspond  to  low  cost  or 
low  e(t0,  a)min. 


5.  DISCUSSION 

The  comparison  or  defect  detection  algorithm  is  a  local  treatment 
where  each  variable  is  individually, studied  as  in  classical  alarm 
systems.  Wc  arc  now  working  on  an  over  refinement  of  the  previous 
step  .  a  pre-diagnosis.  When  each  variable  has  been  individually 
examined  by  the  comparison  task,  the  pre-diagnosis  is  started  when 
simulated  and  real  evolutions  of  at  least  one  variable  arc  considered 
to  be  significantly  different  by  the  defect  detection  The  idea  is 
originally  that  the  defect' proposed  by  the  comparison  is  not 
necessarily  a  primary  defect :  it  may  not  correspond  to  the  genuine 
source  of  the  malfunction.  The  aim  of  the  pre-diagnosis  is  to  find  the 
root  variable  from  the  detection  variable.  The  analysis  rests  on  the 
.  causal  graph  supporting  the  simulation.  It  allows  the  localization  on 
a  subgraph  which  is  suspected  to  be  in  fault  This  subgraph  is  a 
consistent  path  whose  root  is  a  candidate  for  the  origin -of- the 
malfunction.' 

The  aim  of  the  modelling  by  Qualitative  Responses  is  to  describe  the 
global  behaviour  of  a  process.  This  model  is  not  based  on  a  sound 
physical  analysis  of  the  various  phenomena  occurring  in  the 
process  ■  it  provides' an  approximation  of  the  behaviour  of  each 
variable  in  the  causal  graph,  whose  abstraction  level  is  more  adapted 
to  supervision  tasks.  A  specific  representation  of  signals  has  been 
created  thanks  to  the  notion  of  event,  and  the  shape  of  a  signal  is  as 
important  as  its  values,  that  is  why  a  specific- defect  detection 
algorithm  had  to  be  developed  to  check  consistency  of  the  complete 
jyslem.  It  is  based  on  fuzzy  calculus  for  various  quantities,  such  as 
insta.t»ancous  or  integral  errors. 

Furthermore,  the  strength  of  the  algorithm  is  due  to  us- weak 
constrain  s  (qualitatively  similar  evolutions)  that  do  not  lead 
necessanb*  to  defect  suspicion  even  when  a  classical  simulation  error 
is  over  a’  %ivcn  threshold.  The  choice  of  thresholds  is  then  less 
drastic :  t.tcy  may  be  set  without  any  accurate  quantitative 
information.’ 

However  it  i.  clear  that  the  robustness  of  the  algorithm  is  more 
important  than  .precociousness  for  it  has  to  work  on  line  by  taking 
into  account  all  the  disturbances  of  the  measurement  system  and  the 
modelling  approximations. 
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Attract.  Order  of  QaaLutJre  *r«  bo3t  on  *  p*rtilko  of  rhe  real. 

■2**  »nd  ute  Qopeiator*  which  *2ow  iomt  Qu*!iUt!ye  ca’-tJu*  »»Hkh  U  mjoired  lo  be 
conus  lent  with  r«J  number*  and  operations.  Kowerer  the  table* of  the  Qoperatce*  depend 
on  the  position  of  the  boundary  naanber*  detemuninj  the  partition.  In  thi*  paper,  a  study 
of  all  the  possible  symbolic  table*  for  the  Q-rum  is  performed.  Robutlness  and  preouon 
of  U.U  tablee  ere  analized.  Moreover,  it  is  shown  that  the  boundaries  can  vary  np  to  some 
limit  stifl  keeping  tbe  Q-snm  table  invanatt  Finally,  •  particular  attention  is  payed  to 
the  much  snupJer  symmetric  partition  case,  which  is  useful  in  many  practical  applications 

I.  Introduction 

Qualitative  models  ate  characterized  by  a  middle  position  between  heuris¬ 
tic  models,  .requiring  solely  symbolic  manipulations,  and  pure  numerical  mod¬ 
els,  using  exclusively  calculations.  As  Song  as  engineering  problems  are  eon. 
sidered,  one  deals  with  physical  quantities.  The  qualitative  algebraic  struc¬ 
ture  should  thus  remains  consistent  with  real  numbers  and  operations.  The 
specific  class  of  Order  «/  ifognituJt  QtMt fire  AfycAras,  which  are  built 
from  partitions  of  the  real  line,  have  been  defined  this  way  ((4j,(7))<  This 
class  includes  the  usual  Signs  Algebra  based  on  the  roughest  partition  of  SR. 
+  *  0  *  *. 

However,  the  tables  of  Q-operators  are  not  independent  of  the  specific 
partition  of  the  real  line  we  are  working  with.  This  means  that  if  the  position 
of  the  numbers  determining  the  partition  are  changed,  the  table  for  the  Q- 
operator  might  be  different.  In  this  paper,  a  whole  study  is  performed  for  the 
Q-$um  However,  it  is  important  to  mention  that  the  procedure  remains  valid 
for  any  other  Q-operator,  the  Q-product  in  particular.  A  study  of  rvlusfiuss 
tables  is  made,  and  it  is  shown  that  a  radius  around  the  boundary  numbers 
can  be  found  so  that  the  variations  of  the  boundaries  up  to  this  radius  still 
preserve  the  same  symbolic  table. 

II.  Preliminaries:  Q-pairs  defined  on  the  real  lino 

Definition  IJ.l  Consider  a  nonempty  set  S  ( (fairer*  of  dtstnpUon)  and  an 
order  relation  $  defined  on  5.  Q«a/itafirc  tqtaliff  ts  {Q- equality)  is  defined 
as  the  folowing  binary  relation  induced  by  $  on  5:  a  fcs  6  if  there  exists  *  €  S 
such  that  *  <  a  and  z  <  A. 

A  Q**hUtn<  pair  (S,«)  is  defined  as  a  set  S  and  a  Q-equality  defined 
©a  S. 

In  practice,  the  problem  is  generally  to  represent  physical  quantities  which 
are  known  with  poor  precision  so  that  real  numbers  cannot  be  used  Qual¬ 
itative  models  provide  a  solution  as  long  as  they  remain  consistent  whith 
real  numbers  and  real  operators.  One  way  to  do  so  is  to  define  the  Q- 
pairs  from  a  partition  of  the  real  line.  This  paper  considers  the  Q-pair 
obtained  from  partitioning  the  real  line  X  into  seven  classes  with  associ¬ 
ated  labels'  negative  large  (N V),  negative  medium  (NM),  negative  small 
(NS),  *ert  ,\0),  positive  small  (PS),  positive  medium  (PM)  and  positive 
large  (PL)  (see  Fig.  I).  The  set  Si  a  [NL,NM,  NS, 0,  PS,  PM,  PL),  or- 
o 

NL  |  NM  .  NM  t  PH  ,  l‘M  ,  PL 

“  *»  a  <1 

Figure  I:  The  partition  of  X 


III.  The  Qualitative  sum 

From  now  on,  focus  is  put  on  the  particular  case  of  the  Q-sum  ©  'However  it 
is  important  to  mertion  that  a  similar  study  can  be  done  for  any  Q-operator 
(tbe  Q-product  for  instance).  The  important  concept  of  consistency  of  Q- 
operators  does  not  go  without  problems.  In  particular,  the  qualitative  result 
may  differ  when  the  boundaries  of  the  intervals  are  changed  For  example, 
if  10  is  the  upp«  boundary  of  PS,  then  PS®  PS  =  (PS,  PM]  if  the  upper 
boundary  of  PAf  is  greater  than  or  equal  to  20,'and  if  it  is  less  than  20  then 
PS®  PS  =  +.  Aitough  there  are  some  invariant  results,  like  for  example 
NL®NLn  NL,  the  symbolic  table  for  the  operation  ©  depends  on  the  real 
numbers  a,b,e,d  defining  the  partition  Si  All  possible  symbolic  tables  for 
the  Q-sum  ©  are  included  in  Table  1  (half  of  it  only  has  been  filled  because 
©  is  strictly  commutative). 
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Table  1:  AH  the  Q-sum  tables 


From  tbe  qualitative  point  of  view,  two  symbolic  Q-sum  tables  are  said 
to  be  Q-equals  if  every  couple  ©f  corresponding  squares  are  Q-equaU.  Taking 
into  account  this  definition,  it  is  easy  to  see  that  any  ©  table  is  Q-equal  at 
least  lo  one  of  the  two  tables  included  in  Thble  2 «  So,  it  can  be  concluded 
that  there  only  exist  two  tables  qualitetively  differents 

Nevertheless,  not  all  tbe  combinations  in  Table  1  are  possible,  for  example 
PS®PS  =  +  is  incompatible  with  PM®  PM  -  [PAf,  PL],  since  PS®PS  = 
+  if  and  only  if  d  <  2c  and  PM  ®  PM  =  (PAf,  Pl\  if  and  only  if  d  >  2c 


Title  2:  Qualitetively  different  ©-tables 


In  the  following,  our  first  problem  «  to  discern  how  many  tables  ate 
possible  and  to  de‘ermine  them  Given  (a.t.c.d)  €  R*  with  a  c.  6  <  0  < 
c  <  d,  then  the  symbolic  table  of  ©  is  completely  determined.  Whence  if 
we  consider  D  =  {(<t,A,c,<f)  €  R*,a  <.  A  <  0  <  c  <  d)  w«  can  establish  the 
following  equivalence  relation' 

Definition  III.1  Two  elements  of  D  are  said  to  be  ©c{*iWcaf«  if  they 
induce  the  same  table  of  ©. 


dered  by  NL  <  NM  <  NS  <  0  <  PS  <  PM  <  PL,  constitutes  the 
highest  level  of  specification  of  our  Q-modcl,  The  lowest  one  is  total  unde- 
termination  (?).  Between  the  highest  level  of  specification  and  the  lowest 
one  given  by  ?,  the  actual  partitioning  induces  four  ordered  levels  of  spec¬ 
ification.  Interpreting  the  labels  as  intervals,  the  four  levels  correspond  lo 
the  union  of  two,  thr*e,  four,  and  five  adjacent  intervals.  Labels  within 
these  levels  are  denoted  (a,0\,  where  a,0  €  S%  —  {0}  and  <*  <  0.  (<*,$ 
represents  the  interval  obtained  from  the  union  of  the  intervals  associated 
with  a  and  0  and  the  ones  in  between  Our  universe  of  description  is  thus 
5  a  Si  U NS\,  ,  (NL,  PM],[NM ,  PL),*}  and  tbe  order 
relation  <  considered  in  S  is  the  inclusion. 

Definition  II.2  Let  ©  be  a  binary  operation  on  5  and  let  x  be  a  binary 
operation  on  ®  is  said  to  be  const jfeaf  vtth  x  if  for  any  a,0  €  5,  <*  ©  0 
is  the  minimum  set  of  S  (with  respect  to  the  inclusion)  that  contains  the 
interior  of  a  x  0. 


IV.  Tho  conditions  related  to  Q-sum  table 

Taking  into  account  Fig.  1,  the  results  in  non  fixed  squares  of  Table  i  depend 
on  the  relations  of  a,  b,  c,  and  d  For  instance,  the  result  of  NL  ©  PS  is 
conditioned  in  tbe  following  way. 

{(ATL.tfAfJ,  if  e  <  A-a, 

if  A— a  <e<  -a, 

(tfL.PS),  if  -a<<. 

Proceeding  in  a  similar  way  for  the  eleven  undetermined  squares,  the 
conditions  determining  the  table  can  be  summarized  in  an  algorithmic  way 
as  follows* 

1,  Compare  A  with  f. 
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2;  Compare  e  with  i — a,  — £  and  —a:  Depending  on  the  result  of  applying 
the  first  rule,  the  situation  of  these  three  cumbers  b-a,  -b  and  -a  is 
different  We  have  the  three  following  possibilities  in  the  positive  half 
real  line: 

i)  If b <  5, then  0<6-a<-£< -a; 

ii)  If  6  —  then  0<  t — a  —  —4  <  -a; 

111)  If  t  >  ||  then  0  <  -•£<£  —  a  <  —a. 

Then,  depending  on  the  situation  the  possibilities  for  c  are  different 

d  Compare  d  with  those  of  these  numbers  b-a,  -b  and  -a  which  are 
greater  than  e  and  with  c  -  b,  c  -  a  and  2c. 

The  arboreseenee  of  the  above  algorithm  provides  all  the  possible  Q-sum 
tabid:' The  number  of  possible  different  tables  is  201. 

V.  The  ©-equivalence  classes.  Robustness  and  Precision 

From  the  outlined  conditions  it  is  evident  that  any  ©-equivalence  class  cor¬ 
responds  to  a  connez  domain  of  D.  So,  given  a  point  p  in  D,  there  always 
exists  a  convenient  direction  of  move  from  p  such  that  the'i  associated  ta¬ 
ble  remains  invariant  On  the  other  hand,  there  exist  ©-equivalence  classes 
with  hypervolum  0,  and  another  kind  of  classes  with  infinite  hypervolum.  Of 
course,  if  the  ©-equivalence  class  of  a  point  p  ©  D  has  hypervolum  0,  1  e 
it  is  a  strengh  line,  only  one  direction  of  move  from  p  preserves  the  Q-sum 
table.  If  the  ©-equivalence  class  of  p  has  infinite  hypervolum,  either  a  sector 
of  directions  of  move  from  p  preserve  the  table  if  p  »  on  the  frontier  of  the 
class,  ox  all  directions  if  p  is  an  interior  point  of  its  tlass. 

V.l  Robustness.  Given  two  tables  Tt  and  T},  the  question  is  when  is 
Tt  more  robust  than  T/?  That  is,  if  Ct  and  C/  denote  their  associated  ©- 
equivalence  classes,  when  is  the  domain  Ci  "greater*  than  the  domain  Cy  7  In 
order  to  compare  the  "size"  of  these  domains,  we  compare  first  the  possible 
results  of  the  sum  a  00  for  any  pair  (or,/?)  €  $?,  that  is  for  every  square  of 
the  table.  Ihe  robustness  ol  the  whole  table  is  a  consequence  of  the  stability 
of  the  result  of  the  sum  within  every  square.  Of  course,  fixed  squares  are 
not  considered  Since  some  eondilions  give  rise  to  infinite  hypervolumes,  the 
comparison  is  performed  by  using  a  geometrical  entenuro. 

Given  (a,0)  ©  $},  let  x  and  y  be  two  possible  results  ot  a  00,  then  x 
»  more  stable  than  y  if  the  domain  D»  C  O  corresponding  to  the  result  x 
contains  the  image  of  the  domain  Dr  C  D  by  a  rigid  movement  of  R*  A 
table  is  said  to  be  more  stable  than  another*  if  all  the  results  of  the  pairs  a©/? 
corresponding  to  the  first  are  more  stable  than  those  that  corresponding  to 
the  second  one. 

The  relation  to  be  more  ilsMe  <Aan  between  tables  is  an  order  relation, 
but  is  not  a  total  order.  Table  3  shows  the  most  stable  Q-sum  table  and 
Cm  =t  ©  D;  $  <  6,c  >  -o,d  >  2c)  is  it  associated  ©equivalence 


Table  3:  The  most  stable  Table 


V.2  Precision,  Another  aspect  to  consider  is  the  precision  of  the  Q-sum  ta¬ 
bles.  This  concept  is  related  to  the  fact  that  the  result  of  aQ0  for  (or.jd)  €  Sf 
may  he  an  element  of  Si  or  not  In  the  first  case,  it  has  the  maximal  preci¬ 
sion  allowed  by  our  algebra.  Hence,  the  more  information  a  table  provides, 
lb.  more  precise  it  can  be  said  As  robustness,  the  precision  of  the  tables  « 
studied  "square  by  square*  by  comparing  the  results  of  the  sum  a  00  for 
any  of  the  pairs  (cr,0)  €  5?  which  do  not  correspond  to  fixed  squares.  The 
precision  of  the  tables  is  a  consequence  of  precision  of  each  of  the  possible 
results  in  the  squares.  However  a  measure  for  the  degree  of  precision  of  the 
result  x©Sofa©j?canbe  provided  in  the  following  way.  a)  if  x  ©  S», 
then  x  has  degree  of  precision  l;  b)  otherwise,  the  decree  of  precision  of  x  is 

From  these  morymaf  degrees,  the  d'gree  of  precision  of  the  vhote  table 
is  obtained  by  taking  the  minimum  of  the  degrees  of  precision  of  all  the  no 
fixed  squares  of  the  table.  A  table  is  said  to  be  more  precise  than  another,  if 
the  degree  of  precision  of  the  first  is  greater  than  the  degree  of  precision  of 
the  second.  Ibis  relation  »  a  pre^otder  that  has  neither  a  maximum  nor  a 
minimum. 


VI.  Sensitivity  analysis 

Thw  section  deals  with  the  problem  of  finding  how  the  boundaries  of  the 
intervals  of  the  partition  of  35  can  fredy  move,'  in  a  continuous  way,  keeping 
the  symbolic  Q-?um  table  invariant.  From  the  topological  point  of  view,  two 
kinds  of  ©-equivalence  classes  must  be  distinguished,  the  open  classes  and  the 
•  ones  that  have  the  property  that  all  their  points  are  frontier  points  First,  the 
study  is  performed  for  the  Q-sum  tables  with  open  associated  ©equivalence 
class. 

Let  p  =  (ao.to.co.da)  be  a  point  of  D  such  that  its  ©  -equivalence  class 
Cf  is  an  open  set  in  Jt*.  Moving  the  four  coordinates  of  p  in  a  continuous 
way  and  still  remaining  inside  of  Cy,  can  be  interpreted  as  the  research  of 
a  common  radius  r  for  this  four  coordinates,  such  that  every  coordinate  can 
vary  inside  its  correspond^  interval  (See  Figure  2)  This  is  equivalent  to 


*0  do 

Figure  2:  Moving  the  boundaries 


searching  an  hypercube  with  center  p  and  side  2r  totally  contained  in  the 
class  C„  thas  is: 

S“r(p)  =)do  -  r,  a0  -f  r(x]£o  -  r,  50  +  rjxjc©  -  r,  co  +  r(x)do  -  r,  d0  4-  r(. 

Moreover,  it  is  possible  to  find  the  maximum  r  such  that  S,(p)  C  Cf  To 
this  end,  it  is  sufficient  to  impose  that  the  eigth  boundaries  of  the  intervals 
of  variation  in  Figure  2  satisfy  the  results  of  the  conditions  corresponding  to 
Cf. 

Finally,  we  analyze  the  case  in  which  C,  is  a  set  such  that  all  their  points 
are  frontier  points.  Then  at  least  one  of  the  conditions  that  determine  C,  is 
an  equality.  Searching  for  a  radius  r  for  free  movement  of  o,  6,  c,  d,  implies 
that  all  the  coordinates  that  take  part  in  any  equality  condition  must  remain 
fixed,  and  the  radius  can  only  be  found  for  the  other  coordinates 
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Appendix:  Tho  symmetric  ease 

Numerous  practical  problems  can  be  modelled  by  using  a  symmetric  parti¬ 
tion  which  provides  enormous  advantages  in  terms  of  complexity  This  case 
corresponds  to  the  boundaries  of  the  intervals  satisfying  -a  =■  d  -  0  and 
4cc  =  o.  There  arc  only  three  different  tables  for  ©,  instead  of  201  in 
the  general  case’,  and  they  are  qualitetively  equals  These  tables  are  in  corre- 
spondance  with  the  three  domains  in  the  plane  Oa0  represented  in  Figure  3 
Since  $2  >0i,  the  first  two  domains  are  injectable  m  the  third,  and  therefore 
the  table  that  corresponds  to  the  case  0  >  2a  is  the  most  robust  one 


Figure  3 
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The  purpose  of  our  research  is  to  assess  the  iJidamestal 
limitations  of  qualitative  simulation.  la  the  past  decade,  sev¬ 
eral  research  groups  (lj,  [2],  (3)  developed  simulation  techniques 
based  on  a  common  principle  which  we  call  Qualitative  Simula¬ 
tion.  Based  oa  lb«r  empirical  results,  we  deeded  to  xavesrigrte 
whether  gradual  further  developments  of  these  techniques  could 
ever  lead  to  acceptable  results. 

We  identified  three  combinations  of  fundamental  design  de¬ 
cisions  in  Qualitative  Simulation,  each  of  which  is  characteristic 
for  the  approach,  is  hard  to  revise,  and  has  important  negative 
consequences.  In  our  opinion,  those  consequences  are  hardly 
ever  acceptable  in  practice.  The  least  radical  revisions  required 
to  overcome  these  limitations  amount  to  a  complete  redesign. 

In  the  following  sections,  these  (combinations  of)  design  de¬ 
cisions  and  their  consequences  will  be  disussed  briefly.  For  more 
information  on  this  topic  we  refer  to  the  first  part  of  our  Ph.D. 
Thcsis  (4]. 

1  Only  Signs  of  Derivatives 

A  state  of  a  system  with  n  state  variables  can  be  visualized  by 
a  point  in  an  n -dimensional  state  space.  The  solution  £{f)  of  a 
set  of  the  differential  equations  ^  =*  «p(z)  can  be  visualized  by 
a  so-called  trajectory. 

The  initial  set  of  ordinary  differential  equations  is  satisfied 
iff  both  the  signs  and  the  absolute  r clues  of  the  left  and  right 
terms  are  equal,  i.e.,  Sign(^)  —  Stgn{P(x) )  and  Abstfi)  = 
Abs[b\x)).  In  Qualitative  Simulation,  the  equality  of  the  abso¬ 
lute  values  is  not  verified,  hence  the  direction  of  each  piece  of  a 
trajectory  can  be  known  only  up  to  an  n-duncnsiona!  generali¬ 
sation  of  a  quadrant  accurately.  A  quickly  diverging  bundle  of 
“possible"  trajccones  is  thus  predicted.  The  predicted  trajecto¬ 
ries  seldom  have  interesting  properties  in  common,  hence  little 
is  actually  predicted  about  the  exact  solutions. 


long  term  d 
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A  consequence  is  that  ordinal  relationships  of  a  stele  variable 
as  a  function  of  time  can  only  be  determined  within  the  same 
period  of  soaotonic  behavior.  The  ordinal  relationships  between 
nc n- adjacent  local  extremes,  like  two  adjacent  iocal  maxxmmss, 
thus  can  not  be  determined.  Another  consequence  is  that  a 
revision  of  the  first  combination  of  design  derisions  (only  signs  of 
derivatives)  can  not  have  any  favorable  effect  unless  this  second 
combination  of  design  decisions  is  revised. 


3  Prediction  Step-Size  &  Orthogonal 
Quantification  of  State  Space 

In  the  traditional  approaches,  the  state  space  rs  quantified  by 
a  grid,  the  resolution  of  which  is  determined  by  the  number  of 
digits  used.  In  Qualitative  Simulation,  the  state  space  is  »mi 
lardy  quantified  by  an  orthogonal  grid  into  so-called  qualitative 
states.  For  example,  m  the  3-dimensional  state  space,  a  quali 
tativc  state  is  a  cube,  bounded  by  26  qualitative  boundaries  (6 
sides,  12  edges  and  8  comers). 

The  prediction  step-size  is  the  length  of  a  displacement  of  a 
short  term  prediction.  |x(f,)  -  x(/,_,)I  In  Qualitative  Simula 
non,  the  prediction  step  size  has  been  chosen  so  small  that  short 
term  predictions  correspond  with  transitions  between  adjacent 
qualitative  states. 

This  third  combination  of  design  derisions  has  three  cons©- 


2  State  Space  on  Ordinal  Scale  &  Markov  ^  revwon  of  the  first  combination  of  design  decisions  (only 


Sequence  of  Short  Term  Predictions 

In  Qualitative  Simulation,  long  term  predictions  arc  formed  by 
independently  chaining  short  term  predictions,  just  like  in  the 
traditional  numeric  aproachcs.  The  state  variables  are  repre¬ 
sented  on  an  ordinal  scale.  By  definition,  on  an  ordinal  scale 
only  the  order  of  the  values  is  represented,  the  lengths  of  the 
intermediate  intervals  are  unknown. 

A  problem  with  this  combination  of  design  dcasions  is  that 
the  lenghts  of  the  short  term  displacements  can  not  be  repre¬ 
sented  as  an  intermediate  result.  However,  in  general,  this  in¬ 
formation  is  required  for  determining  the  signs  of  the  composed 


signs  of  derivatives)  can  not  have  any  favoraWa  effect  unless  this 
third  combination  of  design  derisions  is  revised.  A  state  tran¬ 
sition  between  two  adjacent  qualitative  states  is  possible  iff  a 
trajectory  exists  that  successively  visits  those  qualitative  states 
This  is  determined  by  the  direction  of  the  derivative  vector  at 
the  intermediate  qualitative  boundary  Because  of  the  orthog¬ 
onal  quantification  of  the  state  space,  t!>at  direction  must  be 
known  only  up  to  an  n-dimcnrional  generalization  of  a  quadrant 
accurately,  only  the  signs  of  the  derivatives  can  thus  be  used 
effectively. 

The  predictions  arc  d  rowning  is  rounding-off  errors  Rou  ndmg 
off  errors  arc  of  the  same  magnitude  as  the  diameters  of  the  qua! 


1743 


iuisve  sizus.  Beexssc lit pct£e&asUp^reha beta tlxsa 
tqz3ltotbt£asxUTcithe£xstUac£tbeqs^hihesUi<st 
the  ihra  term  day^emcsca  art  dri? xzssgarasa^sg-cSericex. 

Toe  xrcagg  (avenged  over  cdnl  qratorirc  suits)  rmmbe: 

pcssik  qsaSawt  aaccessog  states  pc  qa^uure  suu  a 
BF  ~  2*  -  1,  vice  sstbt  number  c£  stale  a-uHe.  If  we 
2SSSZX  till  die  suit  space  has  act  beta  quantified  along  tra yxr 
tones,  then  tins  nemer  es  adepesdezi  cf  lie  acted  vector  fidd 
{ke,&»acf<££etgal  equations).  Tie Branching  Fader  BF 
» defined  as  tie  qnedera  cf  tic  total  number  cf  Pcssihk  Suit 
TBssao  #PST,  and  tie  total  cumber  cf  QeaSiarrvt  Suits 
i~QS. 


£PST 

Breadaay  Fedor  —  ZQS  ' 


SPST  tQB 
£QB  '  SQS 


(2) 


la  bw  cf  QuaEtttfve  Simda&sj,  let  us  assume  tiat  each  qwJ- 
rtatrre  boundary  is  rmeoected  by  trajectories  a  only  oae  direc¬ 
tion,  hence  ~J§£  ~  1-  Tie  number  cf  qualitative  boundaries 
pc  cjcaEuivc  state  depends  only  upon  the  dimension  of  lie 
stale  space  ns  ~  2"  —  1.  Tie  average  cumber  of  predicted 

qwEtetht  behaviors  is  lies 


^PredictedBehcviors  ~  BF"  ^  (2"  —  1)’  (3) 

where  s  is  tie  comer  of  prediction  steps. 

4  Misconceptions 

The  article  of  Fouchd  &  Mefin  [5]  seems  to  contradict  with  our 
results  twice  with  respect  to  determining  behavior  over  periods 
of  non-rnonotonic  behavior,  and  with  respect  to  the  practical 
usefnllncss  of  Qualitative  Simulation. 


4.1  The  first  misconception 

Focche&  Mffin  have  shown  that  Qualitative  Simulation  in  com- 
binatwn  with  an  appropriaie  model  can  determine  the  decreasing 
amplitude  of  the  the  oscillations  of  a  non-linear  damped  oscilla¬ 
tor. 

We  claim  that  as  a  consequence  of  the  second  combination 
of  fundamental  design  decisions,  ordinal  relationships  of  a  stole 
variable  as  a  function  of  time  can  only  be  determined  within 
the  same  period  of  monotonic  behavior.  Consequently,  if  “posi¬ 
tion"  and  “velocity"  arc  state  variables  of  that  oscillator,  then 
their  decreasing  amplitude  can  not  be  determined  by  Qualitative 
Simulation. 

A  state  variable  is  a  variable  only  the  derivative  of  w  hich  is 
determined  by  the  (ordinary)  differential  equations  ±SaleVar  = 
F(SlateVcrs,InjmtVars)i  no  instantaneous  relationships  be¬ 
tween  state  variables  can  exist.  Instantaneous  relationships  de¬ 
termine  the  values  of  non-stalc  variables  Non  -  StateVar  = 
G  (Slated  ars,  fnputVars). 

Ordinal  relationships  of  n on-stole  variables  as  a  function  of 
time  can  be  specified,  even  over  periods  of  non-monolonic  be¬ 
havior.  Let  us  assume  that  there  is  only  one  state  variable 
called  ‘'time" »  That  time  Is  specified  to  increase  monotonically 
2 l*m&  s  Constant  >  0.  The  function  G(timc)  can  specify  any 
behavior. 

Although  the  proposal  of  Fouchfi  Is  M£!in  is  more  sophisti¬ 
cated,  it  is  nevertheless  based  on  the  same  principle  of  specifying 
a  monotonically  decreasing  state  variable  called  “energy".  The 
variables  “position"  and  “velocity"  arc  instantaneously  related 
with  the  variable  “energy",  and  thus  can  not  be  state  variables 
any  more.  Our  claim  thus  docs  not  contradict  with  the  experi¬ 
mental  result  of  Fouchc  is  Mehn 


4-2  The  second  misconception 

ficcteLMdotasathi  rib es  o/r.pt  dui  Q.rFtativt 

Ssstorca  can  lo-rt  a  cstf=3  ico-tnml  problem.  We  rjrte 
with  then  statement  that  the  sjfccaua  a.  cscfuS  and  ooo  tririal 
However,  «  disagree  with  the  sutocat  ihl  the  solution  has 
been  found  by  Qualitative  Sgrhtice. 

The  cays  costribsuoa  let  finding  that  jofslscc  cacscsU  cf 
the  isqnnlwj  manual  analysis  of  iht  sys.  :m.  We  encourage 
the  authors  u>  costume  automating  that  part.  However, we think 
that  the  subsequent  oat  of  Qualitative  Sicshun  as  jest  a  de¬ 
tour. 


5  Conclusion 

We  identified  three  combination*  of  fundamental  design  dcrisSocs 
in  Qcabt-'aive  Simulation,  each  of  which  is  characteristic  for  the 
approach,  «s  hard  to  revise,  and  has  important  negative  cos 
sequences.  In  our  opinion,  those  consequences  are  hardly  ever 
acceptable  in  practice.  The  least  radical  revirions  required  to 
overcome  these  limitations  amount  to  a  complete  redesign. 

Qualitative  Simulation  is  practically  useless  and  irreparable, 
and  should  therefore  be  abandoned. 
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Abstract:  Thri  paper  pmcti  scree  probSan  ococsccd 
derirg  ^aEcive  simulation  a?  dynastic  syces *.  review* 
existing  soJcticos  tad  SJsstrates  then  ca  a  ccc-wnal 
example. 

Introduction 

Qualitative  simulation  is  an.  approach  for  deriving  behavioral 
inferaation  about  pfcyxieel  system*  that  arc  nsaafly.  modeled  by 
syttesu  of  ordinary  differential  equations.  Several  researchers  (fet 
instance  Calood  |I|%  Foehns  (3)i  de  Kieex  4b  Brown  [2J.  Koipexs  (6). 
Sbcn  &  Latch  IX3J.  Sacks  (12),  Wiltiams  (UJ)  have  dmreloppcd 
qualitative  shculators.  From  oar  poim  of  view,  the  most  domrin- 
independent,  mathematically  sophisticated  algorithm  is  Kmpen'  QSIM 
16).  \9]-  However  some  objections  have  been  fomalttcd  against  the 
foundations  of  qualitative  simulation  (11)  and  it  bas  been  claimed  that 
it  would  always  be  impossible  to  simulate  any  dog -trivial  system.  The 
goal  of  this  paper  is  to  analyze  the  main  problems  encountered  in 
qualitative  simulation,  to  briefly  review  the  techniques  that  address 
these  problems  and  to  show  on  a  Doo-trivial  example  that  QSIM  can 
derive  useful  conclusions  a  boot  xl 

Qualitative  Simulation:  Problems,  Causes 
and  Solutions 

Very  often  first  runs  of  qualitative  simulation  produce  a  lot  of 
behaviors,  from  which  it  is  difficult  to  extract  behavioral  features  of  a 
system.  Behaviors  may  proliferate  either  because  they  are  not 
representid  at  the  appropriate  level  of  description  (in  which  case  they 
differ  very  slightly)  or  because  spurious  behaviors  are  predicted  (Le. 
behaviors  that  do  not  correspond  to  any  solution  of  any  system  of 
differential  equations  consistent  with  the  qc-ditative  model).  In 
practice,  when  one  analyses  an  intractable  tree  of  behaviors,  one  can 
observe  two  phenomena,  referred  to  as  chatter  and. occurrence 
brandling,  that  are  the  major  symptoms  of  the  two  problems: 

•  A  variable  may  exhibit  chatter  if  Us  derivative  is  unconstrained. 
Basically,  if  at  some  time  point  a  variable  transitions  to  a 
critical  point  (that  is,  iu  derivative  becomes  zero)  then  its 
qualitative  value  m  the  next  open  interval  of  time  is  determined 
by  iu  second  derivative.  If  no  information  is  provided  abort  this 
second  derivative  then  simulation  will  branch  on  each  possible 
futuri 

•  Occurrence  branching  happens  when  the  temporal  ordering  of 
two  or  more  events  (for  instance,  a  variable  crossing  a  landmark 
or  reaching  a  critical  point)  cannot  be  determined.  Qualitative 
simulation  then  branches  on  all  the  possible  orderings. 

Causes  of  behavior  proliferation  are  found  in  the  main  assumptions 
which  qualitative  simulation  is  based  on,  and  can  be  dasiified  as 
follows: 

1 .  Representation  of  a  variable's  values: 

1 .  Discretization,  the  domain  of  a  variable  is  partitioned  into 
'landmark"  points  and  intervals  between  them. 

2.  Ordinal  scale:  only  the  ordering  of  landmarks  is  represented. 

2.  Representation  of  a  variable's  derivatives,  in  basic  algorithms, 
only  the  s’gn  of  the  first  derivative  is  represented,  and  higher 
order  derivatives  ate  ignored. 

3.  Local  reasoning:  successors  of  a  given  state  arc  determined  using 
only  that  state,  without  considering  any  preceding  state. 

4.  Mofel  specification;  simulation  results  are  very  sensitive  to  the 
choice  of  .variables  and  landmarks,  as  well  as  constraints  and 
corresponding  values. 

In  the  following  section  we  explain  how  symptoms  and  causes  arc 
related,  and  present  existing  solutions. 

Chatter 

As  mentioned  above  (cause  #2),  only  the  sign  of  the  first  denvative 
is  explicitly  represented.  As  long  as  no  information  about  the  second 
derivative  of  a  variable  is  provided,  simulation  will  branch  whenever  a 
variable  reaches  a  critical  point. 

Possible  solutions  are: 

•  add  a  new  variable  to  represent  the  derivative  of  a  chattering 


variable.  raJ  e.  This  wocjd  ccly  cruier  ie  pcofcJem. 

becacse  the  derivative  of  the  newly  gKodxtd  variable  woeld 
cihlbct  daag  as  well: 

•  find  aa  expression  for  the  second  derivative  of  a  variable,  and  esc 
ii  when  is  Cm  derivative  becomes  zao*(7); 

•  ignore  the  direction  of  change  of  a  variable  so  critical  poems  can 
do  longer  be  feeaed*[7); 

•  partition  the  csrisiortmnat.  graph  into  equivalence,  claries  to 
gather  rimilar  «aCes*[4J. 

Occurrence  Braaehlag 

Indesrnniniijoo  in  temporal  ordering  of  events  may  be  caused  by 
limited  knowledge  about  landmarks  (cause  f  1.2  in  conjunction  with 
cause  #2),  an  inappropriate  model  specification  (cause  14)  or 
sometimes  by  the  local  nature  of  reasoning  (cause  13).  Some  solutions 
exist: 

••  disable  the  possibility  to  create  new  landmarks  during  simulation 
for  some  variables*: 

•  remove  some  landmarks  from  quantity  spaces  so  events  defined 
by  a  variable  reaching  or  crossing  that  landmark  can  no  longer 
happen; 

•  ignore  ihe  direction  of  change  of  a  variable  so  events  defined  by 
critical  points  can  no  longer  happen*(7); 

•  more  specify  constraints  by  adding  corresponding  values; 

•  analyze  similar  behaviors  and  group  those  which  differ  only  by 
temporal  orderings  of  events* (4); 

•  add  partial  quantitative  information  (8). 

Other  symptoms 

Behaviors  may  also  proliferate  for  reasons  that  arc  not  easy  to 
understand  and  which  are  not  associated  with  particular  symptoms.  This 
is  mainly  due  to  the  local  nature  of  reasoning  (cause  13)  in  conjunction 
with  the  discretization  of  a  variable’s  domain  (cause  #1.1;  it  happens 
that  the  validity  of  a  transition  depends  on  the  behavior  in  which  it 
takes  place),  or  an  inappropriate  model  specification  (cause  #4) 
Possible  remedies  include: 

•  disable  the  possibility  to  create  new  landmarks  during  simulation 
for  some  variables  ; 

•  reason  in  the  phase  space  representation  and  eliminate  behaviors 
that  correspond  to  self-intersecting  trajectories  in  the  phase 
plane  of  two  independent  variables  (for  second-order 
systans)*(10). 

•  reason  in  terms  of  energy,  decompose  processes  into 
conservative  and  non -conservative  ones,  and  check  if  the  law  of 
conservation  of  energy  is  satisfied  (5). 

•  reconsider  the  model,  add  new  vanables  and/or  redundant 
constraints. 

•  add  partial  quantitative  information*(8). 

Using  Qsim  in  Practice 

In  this  section  we  will  consider  an  example  of  a  second-order,  non¬ 
linear  system  for  which  analytic  solutions  are  not  known,  but  for 
which  one  can  denve  useful  information  about  us  solutions,  using 
qualitative  simulation.  Our  goal  is  to  show  incrementally  how  the 
techniques  presented  above  can  be  used  to  come  up  with  a  tractable  tree 

of  behaviors.  The  system  (E)  is  jp  *  -  x3  -  y  and  ^  »  x  -  x3y 


Eliminating  y  in  (£)  yields  the  following  equation: 

2}=.4x*§.(iU  1)*. 


As  -  4  x2  ti  0  for  all  x  and  x4  +  1  depends  only  on  x,  one  can 
make  an  analogy  with  a  mechanical  system,  x  representing  the 

position  of  an  object,  its  velocity  and  — j  its  acceleration 


■4x2^  represents  a  friction  force  and  -  (x4  +  1)  x  a  conservative 
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force.  Seda  i  sy£ea  is  <5 supzsrt  cd  jbdet  u^tockiSy  *a 
smc.  We  via  shorn  tiui  this  accshre  docrjtke  cf  (£)”* 
lobaoci  is  ronfimvd  ty  gmlrferr  nactfcoo  pefooed  by  QSIM. 

We  begia  wih  b«3<£ac  a  qsJsiachre  modd  Mi  of  (X).  The  variables 
are  x,  y,  dxJUt.  dyAfe,  32. x3.  -y  aod  x2y,  and  die  cocssuxs  are  dkk  (a 
dx/dQ.  dAk  (y  dy/dt).  snkos  (y  -y).  (x  x  *2).  cel:  (z  x2  t3\  oci 
(x2  y  x2yX  **d  («Ais x3  -j)  and  add  (dyAfc x2y *X 

The  way  we  approached  the  probiaa  was  the  fbOowipgrwc  traced 
the  model.  jealyzasd  why  we  came  op  wish  «oo  many  bduvko,  cbosc 
to  appropriate  techniqoe  to  solve  die  problem  and  then  reran  a 
sinclaocc.  What  fbBows  sums  cp  what  we  obtained: 

Scnboon  resufcr  IS  behavion  ac  time  t2  (see  figure  a).  Analysis: 
6x]£t  danm.  Solstice:  igoorc  dish's  directioa  of  cbinge- 

Sisslekn  resells:  7  behaviors  at  time  13.  Analysis:  x2y  chatters. 
Soldloc:  igacre  x2/s  direction  of  change. 

Saaclarico  reschs:  12  behaviors  at  tone  tj.  Analysis:  dyAk  charters. 
Sofcmoo:  ignore  dy/dfs  directioo  of  chaste. 

Simulation  resells:  3  behaviors  at  time  I2-  Analysis:  1  spcriocs 
behavior  in  winch  /O2)  =  0  and  3t(t^)  *  0  (impossible!).  Solution: 
Modify  the  model. 

Pntm&iicaHy  with  QSIM.  it  is  better  to  qualitatively  translate  a 
factorized  equation  than  a  developpcd  one,  because  two  equations 
algebraically  equivalent,  but  expressed  differently,  may  have  non- 

equivalent  qualitative  translations.  Ia  ocr  case.  ^  =  X  -  x^y  is 

equivalent  10  = x  (1  -xy).  box  the  set  of  constraints  {mult  (x  x 

x2).  mult  (x2  y  x2y).  add  (dyAk  sly  x)]  is  not  equivalent  to  the  set  of 
constraints  (mult  (x  y  xy).  add  (xy  i-xy  1).  mult  (x  1-xy  dy/dt)). 
Ccnsequently  we  chan  ged  the  model  Mj  to  a  model  M2  whose  variables 
are  x,  y.  dx/dt.  dy/di,  x2.  x3,  -y,  xy  and  1-xy,  and  the  constraints  are 
dAk  (x  dx/dt),  dAk  (y  dy/dt).  minus  (y  -y).  mult  (x  x  x2).  mull  (x  x2  x3), 
mull  (x  y  xyX  add  (xy  1-xy  1).  mull  (x  1-xy  dy/dt)  and  add  (dx/dt  x3  -y). 
We  obtained  the  results  summarized  below: 

Simulation  results:  4  behaviors  at  lime  13.  Analysis:  xy  and  1-xy 
chatter.  Solution:  ignore  x/s  and  1-xy’s  directions  of  change. 

Simulation  results:  4  behaviors  at  time  13.  Analysis:  x2  starts  from 
x2(to).  decreases  and  reaches  0.  then  reaches  a  new  maximum,  which 
can  be  greater  than,  equal  to  or  lower  than  x2(t^).  Uninteresting 
distinction.  Solution:  Disable  landmark  creaaon  for  x2. 

Simulation  results:  9  behaviors  at  time  t$.  Analysis:  x  oscillates, 
but  QSIM  is  unable  to  determine  the  nature  of  x’s  oscillations. 
Solution:  Derive  an  energy  constraint  for  x:  decompose  d2x/dt2  into 
conservative  and  non-conservative  terms. 

Simulation  remits:  5  behaviors  at  lime  t$.  Analysis:  Unable  to 
determine  the  nature  of  x3’s  oscillations:  increasing,  decreasing  or 
steady.  x3  should  have  a  behavior  similar  to  x.  But  constraint 
propagation  through  the  means  of  corresponding  values  does  not 
happen:  no  new  landmarks  are  created  for  x2.  thus  no  new 
corresponding  values  for  mult  (x2  x  x3)  are  created,  and  x3  is  not 
constrained  enough.  Solution:  Disable  landmark  creation  for  x3. 

Simulation  results:  6  behaviors  at  time  lg.  Analysis,  y  oscillates, 
but  QSIM  is  unable  to  determine  the  nature  of  y't  oscillations. 
Solution:  Use  the  phase  plane  (x  y)  and  eliminate  behaviors  that  self- 
intersect. 

Simulation  results:  The  uee  grows  lincanly:  4  behaviors  at  lime  tg, 
S  at  t20<  Analysis:  all  the  behaviors  are  possible,  x  and  y  exhibit 
decreasing  oscillations.  The  system  can  reach  an  equilibrium  slate  (x  = 
0,  y  a  0)  after  an  arbitrary  number  of  oscillations  (see  figure  b). 

Conclusion 

In  this  paper,  we  have  shown  that  QSIM  can  draw  useful  conclusions 
about  a  system  that  were  not  obvious  to  derive  with  analytical 
methods.  When  the  simulation  steps  we  described  above  will  be 
sufficiently  automated,  we  think  that  QSIM  will  be  an  even  more 
efficient  system  analysis  tool. 


«  .  <S>) 

(a)  Bridal  behavior  tree 
(b)  Final  behavior  tree  and  x's  and  y't  behavior 
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Abstract  :  Daring  the  Uii  line  years  an  example  application  of 
a  knowledge  baaid  scheduling  system  has  been  developed  for  use  in 
«  IIOECHST  AG  fibre  plant.  The  design  of  the  system  follows  tie 
assumption  that  plant -lev  el  scheduling  problems  can  successfully  be 
modelled  using  a  rule-based  approach.  In  contrast  to  a  mathematical 
solution,  the  schedule  generated  is  saboptimal  with  regard  to  some 
(  hypothetical )  cost  function,  nevertheless  the  user  acceptance  of  the 
system  is  much  more  promising  than  with  the  previously  used  “linear 
programnring“-slyle  solution. 

L  INTRODUCTION! 

Starting  in  the  late  60’s,  the  fibre  plant  under  discussion  was  sup¬ 
posed  to  use  a  linear-programming-based  system  for  its  scheduling 
problems.  Although  the  program  worked  and  proposed  an  “optimum" 
solution,  it  turned  out  to  be  unsatisfactory  with  regard  to  several  cru¬ 
cial  points : 

•  Although  the  initial  scheduling  proposal  was  accepted  and  belie¬ 
ved  to  be  optimal,  it  was  almost  totally  incomprehensible. 

•  The  plant  manager  was  not  able  to  influence  the  programs  be¬ 
haviour  other  than  through  *x>me  very  rough  parameters. 

•  Inevitable  “manual*  changes  to  the  proposed  schedule  usually 
led  to  more  or  less  drastically  worse  solutions. 

•  The  sy.tem  had  a  bad  response  lime  (several  hours). 

•  The  quality  of  the  schedule  computed  was  questionable,  since  the 
plant  manager  usually  had  several  (conflicting)  goal  functions  in 
mind  —  there  was  no  way  of  introducing  conflicting  goals  into 
the  system. 

As  a  result  of  the  unsatisfactory  program  behaviour,  it  was  only 
rarely  used  in  the  scheduling  process  —  most  of  the  scheduling  task 
was  consequently  done  with  pencil  and  paper.  Regarding  this  situa¬ 
tion  as  being  even  more  unsatisfactory,  the  plant  manager  agreed  with 
a  proposal  to  solve  the  scheduling  problem  using  a  knowledge-based 
system.  Main  goals  of  this  system  were  defined  as  follows . 

•  The  system  should  follow  the  same  line  of  reasoning  the  plant 
manager  uses  when  generating  a  schedule. 

•  Each  individual  schedulmg  decision  should  be  as  transparent  as 
possible  to  the  user. 

•  The  system  should  only  make  suggestions,  “manual"  changes 
to  the  proposed  schedule  should  always  be  possible,  the  system 
should  take  are  of  incorporating  these  changes  in  an  “optimal" 
way  in  the  schedule. 

•  The  response  time  should  be  as  abort  as  possible,  a  maximum  of 
one  hour  was  agreed  upon. 

In  what  follows,  the  basic  idea,  the  implementation  and  first  expe¬ 
riences  with  the  system  are  described, 

II.  THE  PROBLEM  ENVIRONMENT 

The  fibre  plant  regarded  here  is  an  example  of  a  single-step  produc¬ 
tion  plant :  raw  material  enters  one  end  of  the  production  line,  on  the 
other  end  the  finished  product  (an  intermediate  for  textile  products) 
laves.  The  plant  itself  consists  of  several  (  5  to  40  )  such  production 
lines  with  more  or  less  different  characteristics. 


Planning  and  scheduling  in  this  plant  a  completely  based  on  custo¬ 
mer  orders  :  there  is  no  need  to  regard  storage  of  products  nor  are 
there  problems  of  lot  rise  determination.  The  scheduling  period  va¬ 
ries  from  three  weeks  to  two  months,  typically  thcre  are  from  300  to 
1000  orders  to  schedule  within  the  period.  “The  spectrum  of  possible 
products  ranges  over  about  12000  different  combinations  of  product 
parameters,  where  the  most  prominent  parameters  are : 

•  The  basic  type, 

«  Outer  (and  possibly  inner)  diameter, 

•  Colour, 

•  Bobbin  type  and  size. 

Due  to  different  characteristics,  the  set  of  possible  production  lines 
for  famishing  as  individual  product  is  limited.  A  typical  product  may 
be  produced  on  4  to  12  distinct  production  lines.  The  exact  time 
needed  for  the  production  of  an  individual  product  *1y>  varies  with 
the  production  line  used,  typical  variation  is  a  factor  of  two. 

When  production  proceeds  from  one  product  to  another  the  pro¬ 
duction  line  has  to  be  reconfigured  to  some  extent  depending  on  the 
parameters  of  the  predecessor  and  the  successor  product.  A  main  goal 
(from  the  plant  managers  point  of  view)  is  to  keep  the  overall  reconfi- 
f  -ation  effort  as  email  as  possible.  The  orders  due  date  is  a  secondary 
criterion  from  the  point  of  view  of  the  plant  manager  —  as  long  as  all 
orders  are  scheduled  within  the  scheduling  period,  due  dates  may  be 
neglected. 

Apart  from  the  main  scheduling-goal, several  other  restrictions  have 
to  be  satisfied  by  a  schedule,  for  example  : 

•  Regular  maintenance  intervals  have  to  be  scheduled  for  each  pro¬ 
duction  line. 

•  Randomly,  production  lines  are  unavailable  for  productive  al- 
Ioction  (  R  k.  D-aUocation,  failures  ...  ). 

•  Differently  coloured  versions  of  certain  fibre  types  must  not  be 
produced  during  the  same  lime. 

III.  THE  IMPLEMENTATION 

Since  the  system  under  construction  was  to  follow  a  knowledge- 
based  approach,  we  did  not  intend  to  propose  an  algorithmic  solution, 
but  mstad  decided  to  model  the  plant  manager’s  approach  to  schedu¬ 
ling.  After  an  initial  knowledge  engineering  phase  the  following  simple 
scheduling  strategy  turned  out  to  be  in  use  within  the  plant ; 

•  Step  I .  Determine  the  possible  production  lines  for  ach  of  the 
orders. 

•  Step  II .  Starting  with  the  initial  (given)  allocation  of  production 
lines,  look  for  the  “most  promising"  successor  order  regarding  all 
of  the  Una. 

•  Step  HI :  If  there  are  more  than  one  equally  “good*  orders  left 
from  Step  II,  choose  one  of  them  based  on  secondary  restrictions 
(due  date,  customer, ...). 

•  Step  IV  .  Allocate  the  order  from  Step  III  and  go  back  to  Step  II 

•  Step  V  :  After  all  orders  have  been  scheduled  (or  the  capacity 
limit  is  reached),  review  and  eventually  manipulate  the  schedule 
with  regard  to  “special  cases". 
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Although  tic  procedure  sketched  above  b  simple  and 'disputable 
with  regard  to  cfUaiRy,  It  was  a  procedure  accepted  by  the  plant 
manager  and  It  helped  to  prodoce  •good’  acted  ale*.  Nevertheless 
—  as  it  was  do nt  with  paper  and  pend!  —  it  was  a  tirae-consojan g 
process,  so  it  was  done  no  mere  than  once  a  month  and  took  a  couple  of 
days  of  the  plant  managers  lime.  Since  the  orders  scheduled  this  way 
csoally  are  subject  to  changes  (  in  amount,  type,  colour  _  )  literally 
op  to  the  last  minute,  the  “good*  schedule  was  changed  on  the  fly  and 
tended  to  get  ‘worse"  with  erery  such  change  made. 

The  system  constructed  in  response  to  the  plant  managers  request 
in  principle  follows  the  procedure  outlined  above.  As  can  easily  be 
deduced  from  the  above  description,' Steps  II,  III  and  IV  of  the  pro¬ 
cedure  can  be  modeled  by  using  a  non-backtracking,  forward-chaining 
rule-based  paradigm.  Step  I  (at  least  in  the  simplistic  form  it  is  slated) 
can  be  implemented  by  a  database  request,  the  central  point  of  inter¬ 
est  was  Step  V  :  If  many  and  frequent  changes  were  really  made  to  a 
proposed  schedule,  Step  V  would  bet  ter  be  viewed  as  being  interleaved 
with  Steps  II,  III  and  IV — resulting  in  a  backtracking  or  re-schedaling 
strategy  instead  of  early  commitment  through  forward  application.  As 
our  observations  resulted  in  figures  of  about  5  to  10  changes  per  sche¬ 
dule,  we  decided  to  use  the  forward-chaining  paradigm.  This 'decision 
not  only  led  to  a  very  efficient  implementation  of  the  scheduling  pro¬ 
cess  —  explanation  of  scheduling  decisions  has  been  much  easier  to 
incorporate  than  it  would  have  been  using  a  backtracking  version. 

The  schedule  is  presented  to  (he  user  on  a  graphical  workstation 
in  form  of  a  Ganlt-chart  —  which  is  accessible  throughout  the  whole 
scheduling  process.  Whenever  an  order  b  scheduled,  it  immediately 
becomes  visible  on  the  screen  and  while  (he  process  b  continuing  in 
the  background  the  user  has  the  ability  to  select  every  individual  al¬ 
location  (using  a  pointing  device)  and  may  request  an  explanation  for 
the  allocation  or  a  detailed  set  of  informations  on  the  selected  order. 

Once  the  proposed  schedule  is  complete,  the  user  may  change  each 
individual  allocation  in  one  of  the  following  ways : 

•  Request,  (hat  this  order  should  be  allocated  to  a  certain  produc¬ 
tion  line, 

•  Request,  that  ihu  order  is  to  be  the  direct  succersor  of  another 
allocated  order, 

•  Request,  that  a  non-scheduled  order  is  to  be  the  direc*  successor 
of  an  allocated  order, 

•  Cancel  an  allocation. 

Instead  of  directly  obeying  the  user's  request,  the  system  gathers 
all  of  those  requested  changes  and  eventually  regards  them  as  n;w 
scheduling  restrictions  as  soon  as  the  user  decides  to  start  scheduling 
anew.  This  feature  helps  the  plant  manager  to  avoid  schedules  beco¬ 
ming  worse,  each  time  he  has  to  do  some  changes.  Since  the  whole 
schedule  is  re-done,  a  “good*  result  will  again  be  achieved. 

The  most  crucial  decisions  to  be  made  during  the  scheduling  process 
concentrate  on  the  order  selection  in  Steps  II  and  III.  In  order  to  give 
the  user  a  maximum  of  flexibility,  he  is  allowed  to  edit  the  criteria 
to  be  used  in  those  steps  by  himsetf.  The  system  to  thb  end  keeps  a 
list  of  “user  visible  rules*,  that  allow  the  plant  manager  to  define  the 
“most  prondsing*  successor  in  terms  of  parameters  of  any  predecessor  - 
successor  -  pair.  This  list  of  “rules*  b  subdivided  into  priorized  packa¬ 
ges,  the  packages  themselves  are  sorted  by  user-defined  priorities  —  so 
the  plant  manager  has  complete  control  over  the  successor-selection. 
Within  thb  rule-set  the  selection  criteria  of  Step  III  are  also  incorpo¬ 
rated,  thb  allows  the  user  to  state  the  relative  importance  of  due  dates 
and  certain  other  restrictions. 

The  system  is  today  running  on  a  VAX-Station  using  VMS,  the 
graphical  interface  uses  VWS  (VAX  Window  System).  The  total 
work  sums  up  to  about  3  manyears,  where  about  60  to  70  percent 
of  the  effort  were  spent  on  integration  aspects  —  the  system  is  in  close 
connection  with  existing  databases  and  standard  software  components. 
Another  10  percent  were  spent  for  the  user  interface.  The  “Al"-part  of 


the  system  has  been  implemented  using  O PS-5,  the  more  traditional 
parts  were  done  in  FORTRAN,  today  the  system  consists  of  about 
60.000  lines  of  code.  A  typical  schedule  proposal  b  generated  in  about 
6  to  10  minutes —  the  time  needed  to  complete  all  necessary  changes 
b  less  than  one  hour  including  several  rccomputalions  of  the  schcdale. 

The  system  b  used  prod  actively  today  in  one  plant,  the  rate  of  “ma- 
cca]*  changes  made  to  the  proposed  schedsle  b  constantly  decreasing. 
Two  years  ago  we  started  with  20  percent  accepted  proposals,  a  few 
weeks  ago  we  arrived  at  a  ratio  of  70  to  30,  a  realistic  goal  lies  at  about 
80  to  20. 

IV.  CONCLUSIONS 

As  of  today,  we  believe,  that  production  planning  and  scheduling 
are  typical  candidates  for  the  use  of  AI-tecbniques.  In  our  opinion, 
there  are  several  reasons,  why  algorithmic  solutions  to  such  problems 
seem  unrealistic : 

•  Conflicting  goals  and  an  enonrous  amount  of  parameters  are  to 
'  be  take  into  account. 

•  Goals  as  well  as  parameters  often  are  not  easily  converted  into 
numeric  descriptions. 

•  Mathematical  solutions  are  hard  to  explain. 

•  ‘On  the  fly*  changes  to  schedules  are  hardly  manageable  in  al¬ 
gorithmic  attempts. 

•  If  the  environment  changes,  it  is  often  hard  to  adopt  an  algo¬ 
rithm. 

Thb  does  not  mean  that  we  blindly  believe  Al-techniques  to  be 
the  “one  and  only"  solution  to  scheduling  problems,  as  we  had  to 
learn,  a  moderate  mixture  of  AI  and  standard  approaches  seems  to 
be  a  feasible  way.  Only  if  we  manage  to  integrate  small  Al-parti.  into 
bigger  exbling  organizational,  hardware,  and  software-environments 
will  problems  successfully  be  solved;  a  purely  Al-based  solution  would 
have  been  as  much  of  a  failure  as  a  purely  algorithmic  one. 

We  are  today  continuing  are  work  in  scheduling  in  sereral  directions : 

•  We  take  part  in  the  EUREKA-project  PROTOS  II,  where  rese¬ 
arch  in  the  direction  of  global,  distributed  and  local  production 
planning  and  scheduling  Using  Al-technology  is  the  main  goal. 

•  We  are  constructing  the  equivalent  of  a  “scheduling-shell”  ab¬ 
stracting  from  the  fibre-experiences. 

•  A  second  experiment  —  now  regarding  a  more  complex  multiple 
step  production  —  is  in  the  queue. 
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Abstract 

Within  the  EUREKA-projcct  PROTOS  (Prolog  Tools  for  Building 
Expert  Systems)  (Appclrath  871  a  prototype  of  a  knowlegde  based 
production  planning  system  is  proposal  consisting  of  an  user- 
interface  enabling  the  user  to  plan  'by  hand",  a  heuristic  planning 
system  providing  facilities  to  create  and  maintain  schedules  using 
heuristic  knowledge,  and  a  factual  knowledge  base  of  the  planning 
area. 

This  paper  describes  a  heuristic  planning  algorithm  implemented  in 
Prolog  winch  aims  in  creating  a  "good"  schedule  of  production  using 
the  experience  of  an  expert  planner.  The  algorithm  tries  to  model  the 
planning  behaviour  of  the  expert  planner  in  using  a  specific  planning 
strategy  (how  to  create  a  plan  step  by  step)  and  some  of  the  heuristics 
applicable  in  guiding  the  search  fora  good  solution. 


I.  The  Scheduling  Task 

The  task  of  the  planning  personnel  is  to  create  and  maintain  a 
schedule  of  production  using  different  kinds  of  input  informau'on, 
e.g.  a  master  schedule  with  given  orders,  information  about  products 
and  resources  and  several  conditions  and  constraints  as  well  as  goals 
to  be  satisfied  (Fox  87,  Sauer  90J.  1  c.c  desired  solution  is  a  plan  that 
tries  to  satisfy  all  of  the  given  goals  and  conditions.  In  most  of  the 
cases  this  will  not  be  pos$;ble,  so  the  planners  task  is  to  find  a 
solution  which  fits  best.  For  this  he  uses  his  knowledge  and 
experience  in  guiding  the  search  for  a  "good"  solution. 

Creating  an  optimal  production  schedule  is  a  np-hard  task. 
Scheduling  approaches  from  Operations  Research  trying  to  find  the 
optimal  solution  have  some  weak  points  which  limit  their  applicability 
in  real  planning  environments: 

-  due  to  the  combinatory  complexity  of  the  problem  area  they 
usually  work  on  a  reduced  problem  model  which  does  not  fit  to 
the  real  planning  situation 

-  the  algorithms  are  complex  and  ’iie  solutions  and  the  way  they  arc 
produced  are  difficult  to  jndersunJ 

-  most  of  them  lack  flexibility  in  the  case  of  changing  situations  or 
parameters. 

Intention  of  the  knowledge  based  approach  is  to  use  the  knowledge 
and  experience  of  a  production  planner  within  a  heuristic  planning 
algorithm  to  build  a  production  schedule.  Several  heuristic  scheduling 
approaches  have  been  proposal,  e.g:  (Fox  87,  Keng  88,  Liu  88), 
most  of  them  try  to  model  a  specific  planning  behaviour  using  certain 
objectives  only  implicit.  The  first  PROTOS-approach  is  similar  to  that 
as  the  heuristics/  strategies  of  the  expert  planner  shall  be  used  to 
create  an  algorithm  which  simulates  his  planning  behaviour. 

The  planner  uses  an  order  based  planning  strategy  where  he  never  has 
to  look  for  the  whole  problem  area.  He  selects  an  order  to  be  planned 
next  and  tl«en  he  tries  to  plan  the  whole  order  at  once.  Orders  that 
have  been  planned  before  usually  will  not  be  changed  and  so  they  act 
as  new  constraints  for  the  orders  to  be  planned  afterwards. 

The  planning  heuristic  can  be  described  schematically  by  the 
following  procedure: 

a  master  schedule  with  n  orders  is  given 

execute  the  following  steps,  until  all  orders  are  scheduled. 

-  select  an  order  to  be  scheduled 

-  Select  an  interval  for  the  execution  of  this  order 

*  select  a  production  variant  for  the  manufacturing  of  the  product 

-  execute  the  following  steps,  until  all  steps  of  the  variant  are 
scheduled 

*  select  a  step  to  be  scheduled 

-  select  a  possible  apparatus  for  manufacturing  *he  step 
concerned 

-  if  there  is  a  conflict,  try  to  resolve  iL 


For  every  selection  the  planner  has  to  apply  his  knowledge,  c.g. 
selecting  'crincaT  products  before  others  (see  below). 

The  main  conflict  the  planner  has  to  resolve  in  creating  the  schedule  is 
that  of  ore riappin  gs  (an  appropriate  apparatus  is  already  occupied). 
One’  of  the  possible  strategies  to  solve  this  conflict  is  land  this 
strategy  i*  also  used  in  the  algorithm):  first  look  for  an  alternative 
apparatus,  if  there  is  no  one  available,  look  for  2n  alternative  variant, 
if  it  is  not  possible  to  use  the  alternative  variant  within  (he  given 
interval,  look  for  another  interval,  this  means  shift  the  interval  until  it 
is  possible  to  plan  tie  order. 


Fig.  1:  AND/OR  Tree  -  Problem  Space  of  Scheduling 


The  problem  area  of  this  planning  methodology  can  be  visualized  by 
an  AND/OR -tree.  The  node-  of  the  tree  contain  a  statement  about  the 
contribution  of  the  childre*.  nodes  to  the  solution  and,  additionally,  a 
value  that  can  be  of  different  domains  at  each  level. 

A  solution  for  the  AND/OR  tree  is  a  sub-tree  with: 

-  the  root  belongs  to  the  solution 

-  if  an  AND-nodc  belongs  to  the  solution  then  all  its  children-nodes 
too 

-  if  an  OR-node  belongs  to  the  solution  then  exactly  one  of  its 
children  nodes  too 

Finding  a  solution  for  the  schooling  problem  is  equivalent  to  finding 
a  solution  far  the  AND/OR  tree  where  all  the  conditions  are  regarded 
In  figure  1  a  solution  is  indicated  by  the  solid  lines. 


2.  The  Scheduling  Approach 

The  so  called  Basic-Algorithm  is  part  of  the  PROTOS-planmng 
approach  which  consists  of 

-  a  user-interface  IMichaux  90],  allowing  planning  "by  hand"  and 
used  as  man-machine-interface 

-  a  heuristic  planr.jig  system  (consisting  of  the  Basic-Algonthm  and 
a  Control-Algorithm) 

-  a  factual  knowledge  base  used  by  both  of  them. 

Main  intention  of  the  Basic-Algonthm  is  the  creation  of  a  schedule  of 
the  production  using  the  expert  knowledge  of  the  production  planner. 
Other  ones  arc  that  this  creation  shall  be  efficient  and  shall  use  the 
basic  problem  solving  strategy  of  Prolog. 

The  Basic-Algonthm  tnes  to  model  the  planning  behaviour  of  the 
planner,  but  as  it  is  impossible  to  acquire  all  the  knowledge  and 
’feelings'  of  the  planner,  only  a  part  of  his  knowledge  and  some  rules 
from  literature  arc  used  in  the  planning  process. 

Result  of  the  Basic-Algorithm  is  a  schedule  regarding  the  objectives. 

-  all  orders  (of  the  master  schedule)  are  planned 
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-  doe  dales  shooM  be  met  -  .. 

-  already  planned  orders  which  arc  e-g.  planned  oy  hand  are  re¬ 
garded  as  hard  constraints  . 

-  the  production  requirements  (e.g.  relative  start  dates,  durations  ox 
slept,  predefined  apparatuses)  have  to  be  met 

-  no  overlappings  exist  .  .  _  f 

-  if  possible  the  'stem-apparatus  and  -variants  are  used  Cstem 
means  those  variants  and  apparatuses  which  preferably  should  be 
used) 

-  the  whole  production  interval  for  ooc  step  is  occupied  at  once. 


Meeting  the  due  dates  shall  be  achieved  by  planning 'critkaTprodocts 

first.  A  'crincaT  product  here  means  a  product  critical  in  time.  A  value 
describing  the  criticality  is  determined  for  every  product  to.be 
planned:  a  product  is  critkal  if  it  has 

-  less  alternative  variants  than  others 

this  is  evaluated  by  dividing  the  number  of  alternative  apparatuses 
by  the  number  of  steps 

-  to  be  finished  before  othas  (earliest  due  date) 

-  a  smaller  slack-interval  than  others 

where  slack-interval  means  the  distance  between  the  earliest 
possible  finishing  date  (the  given  start  dare  plus  the  duration  of  the 
production)  and  the  given  due  date 

-  a  higher  user  priority  than  others 

this  is  a  user  given  priority  to  prefer  products. 


Additionally  user  given  weights  for  the  priorities  arc  used  to  provide 
the  possibility  to  prefer  one  of  the  rules.  In  the  current  version  higher 
weights  for  rules  one  and  three  art  used. 

The  scheme  of  the  Basic-Algorithm  (fig.  2)  looks  much  like  the 
planning  methodology  presented  above  Every  time  a  choice  is 
possible,  heuristics  are  used. 


The  results  of  the  Basic-Algorithm  are  presented  by  the  user-interface 
or  as  listings. 


H  check  preconditions 

look  for  orders  to  be  merged  or  splitted 

2.  create  a  list  of  orders  sorted  by  'criticality1 

heuristic,  weighted  combination  of  the  four  rules  listed  above 

3.  choose  order 

heuristic,  use  the  most  critical  product  first  (first  of  the  list) 
if  all  orders  planned  •>  FINISH 

4.  choose  production  interval 

heuristic :  Gy  the  given  start-date  first,  if  this  leads  to  no 
solution  then  shift  adding  the  minimal  necessary  delay  (this 
rule  makes  sure  that  a  solution  will  be  reached) 

If  order  planned ->  3 

5.  choose  variant 

heuristic,  try  variant  0  first,  then  variants  with  increasing 
number  (variant  0  is  the  'stem'  variant) 
if  order  planned  or  not  plannable  •>  4 

(not  plannable:  all  variants  failed  for  the  actual  interval) 

6. 

heuristic,  try  steps  in  reverse  order  (the  last  step  first) 
if  all  steps  planned  or  one  step  not  plannable  •>  5 

(not  plannable.  all  possible  apparatuses  failed  for  the  actual 
interval) 

7.  fiheese  apparatus 

heuristic:  Gy  'stem-apparatus  first,  if  not  possible  then  try 
alternative  apparatuses 

_ if  apparatus  free  or  no  alternative  left  ->  6 _ 

Fig.  2:  Scheme  of  Basic-Algorithm 

The  Basic-Algorithm  is  implemented  in  BIM-Prolog  and  Quintus 
Prolog  on  sun-workstadons  and  is  being  tested  in  several  selected 


plants.  First  results  showing  quantities  of  test  data  and  runtime 
performance  of  the  Basic- Algorithm  are  summarized  in  the  table  of 
figure  3. 


Parameter/ Plant 

Plant  A 

Plant  B 

Products 

33 

54 

Steps  per  product 

10 

3 

Altera,  apparatuses  per  sep 

4 

3 

Variants  per  product 

2 

1 

Apparanaes 

54 

32 

Orders 

44 

50 

Planmng-period  (month) 

6 

6 

Runtime  (BIM-Prolog)  (sec) 

165 

23 

Fig.  3  First  Results 


3.  Further  Research 

Some  weak  points  of  the  prototype  version  of  the  Basic-Algorithm 
are: 

-  Not  all  of  the  necessary  conditions  are  checked.  Among  them  are 
the  availability  checks  of  personnel  and  raw  material. 

-  A  main  point  which  limits  the  usability  of  the  Basic-Algorithm  is 
Aat  it  is  designed  for  the  special  problem  case  of  creating  a 
schedule  regarding  the  objectives  and  assumptions  listed  above. 
The  planning  heuristic  (strategy)  used  is  implemented  fix.  Tire 
algorithm  is  therefore  useful  only  in  similar  problem  areas. 

-  Another  crucial  point  is  the  combination  of  several  heuristic  rules, 

e.g.when  selecting  orders.  One  can  think  of  situations  where  the 
default-weights  will  not  lead  to  an  acceptable  solution,  so  the 
weights  have  to  be  changed  or  in  some  cases  new  rules  (e.g.  re¬ 
garding  costs)  should  be  added  to  take  into  account  all  conditions. 

These  topics  lead  to  demands  for  a  'Contror-Algorithm  which  is  now 
under  development  Some  of  the  characteristics  of  such  a  'Contror- 
Algorithm  arc: 

-  Selection  of  heuristics  for  planning 

Several  heuristics  (strategics)  for  planning  should  be  integrated  in 
-the  system.  The  planner  may  choose  appropriate  ones  or  the 
system  should  propose  appropriate  ones. 

-  Using  planning  skeletons,  adapting  strategies 

Planning  heuristics  may  be  represented  as  skeletons  using  abstract 
operators  for  rules  and  parameters  c.g.  for  the  selection  of  orders. 
With  these  skeletons  and  appropriate  rules  or  parameters  the 
planner  is  able  to  build  an  "own"  planning  algorithm  for  a  given 
situation. 

-  Integration  of  the  Basic-Algorithm 

The  Basic-Algorithm  is  used  as  ore  of  the  planning  strategies. 

-  Selection  of  heuristics  for  replanning  in  specific  situations 

Not  only  planning  heuristics  but  also  heuristics  for  replanning 
should  be  integrated.  This  leads  to  the  possibility  of  a  flexible  and 
efficient  reaction  to  unforeseen  events,  lhc  selection  may  be  done 
interactively  or  after  the  system  proposed  appropriate  heuristics. 

-  Integration  of  "new"  heuristics  by  the  user 

The  user  may  change  existing  or  define  not  yet  integrated 
heuristics  using  a  provided  definition  language. 
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Abstract  -  In  this  paper  we  present  BUT,  a  knowledge  based  system 
for  the  planning  of  underground  Summation  in  coal  mines.  The  task 
is  to  find  lighting  configurations  which  are  optimal  from  the  ergo¬ 
nomic  and  economic  point  of  view.  The  problem  is  that  there  is  a  huge 
number  of  possible  configurations  for  Kghting  satisfying  the  ergo* 
noraic  guidelines  on  underground  illumination  of  coal  mines  devel¬ 
oped  by  the  European  Coal  and  Steel  Community.  We  present  the  ar¬ 
chitecture  and  mam  features  of  BUT  which  is  designed  to  restrict  the 
set  of  all  possible  solutions  to  a  manageable  set  of  ergonomically  and 
economically  acceptable  or  even  optimal  ones. 

I.  INTRODUCTION 

Appropriate  lighting  minimizes  the  risk  of  accidents  in  hard  coal  min¬ 
ing.  In  order  to  identify  objects  sufficient  light  intensities  arc  required, 
the  value  of  which depends  on  the  special  task  to  be  performed.  At  the 
same  time  gradual  changes  in  light  intensity  arc  desired  In  order  to 
case  the  accommodation  of  the  eyes.  Recently  the  European  Coal  and 
Sled  Community  has  adopted  guidelines  which  arc  an  abstract  of  dif¬ 
ferent  research  projects  and  define  the  requirements  of  underground 
illumination  In  axJ .mines  (IJ,  For  several  reasons,  however,  it  is  not 
a  trivial  task  to  fulfill  these  requirements,  and  it  b  even  more  difficult 
at  minimal  or  at  least  reasonable  costs. 

Until  now,  there  has  been  no  computer  support  for  tins  kind  of  plan¬ 
ning.  The  difficulties  to  be  solved  arc  manifold.  Different  tasks  have 
different  requirements  on  light  intensity.  The  patterns  defining  these 
tasks  cannot  be  characterized  by  simple  schemes.  With  regard  to  the 
mine  layout  a  lot  of  different  sccncncs  have  to  be  taken  into  aocount, 
which  agam  cannot  be  characterized  by  simple  schemes.  Each  see- 
nano  depends  on  the  umbering,  the  length  of  the  nunc  layout,  the  in¬ 
stalled  objects  and  equipment  including  their  dimensions,  to  mention 
only  a  few  parameters.  For  example,  a  stone  dnft  may  contain  explo¬ 
sion  barriers,  a  monorail,  rails,  a  manway,  ramps,  conduits  and  a 
band  conveyor  as  equipment.  Finally,  a  large  variety  of  types  of  ex- 
plosaon-proot  lights  is  available  which  can  be  equipped  with  different 
lamps  and  installed  in  many  different  configurations  (one  or  two  lines 
etc.). 

H.  PLANNING  UNDERGROUND  ILLUMINATION 

The  problem  to  be  solved  can  be  described  as  follows.  Ch'en  a  par¬ 
ticular  mine  layout  including  its  dimensions,  installed  objects  and 
equipment,  identify  the  tasks  to  be  performed,  determine  which  mini 

This  research  tamed  out  by  die  Ruhrkohlc  AG.  Essen,  is  supported 
by  the  European  Coal  and  Steal  Community  in  the  context  of  the  fifth 
ergonomics  program. 


mal  light  intensities  are  required  in  different  areas  of  that  mine  layout 
and  fold  out  which  lights  with  which  lamps  can  be  used  and  where, 
how  and  in  which  distances  they  have  to  be  fixed.  An  acceptable  so¬ 
lution  b  one  which  satisfies  the  ergonomic  requirements  in  each  area, 
and  an  optimal  solution  docs  this  for  minimal  costs 

This  is  a  typical  planning  problem.  A  subproblcm  b  the  calculation  of 
the  light  intensity  m  the  mine  layout,  given  the  specification  of  the 
lights,  the  mine  layout  and  its  equipment  This  subproblem  can  at  least 
be  efficiently  approximated  with  algorithmic  approaches.  For  die 
planning  problem,  no  algorithmic  solution  b  at  hand.  The  special 
characteristics  of  this  problem  suggest  a  knowledge  based  approach, 
since  database  retrieval  (which  lighting  fulfills  given  specifications) 
combined  with  numerical  computations  (calculation  of  bght  intensity 
caused  by  a  given  configuration)  would  be  too  inefficient  A  certain 
amount  of  reasoning  is  necessary  to  identify  the  areas  in  which  tasks 
requiring  a  sufficient  amount  of  illumination  arc  performed  and  to  de¬ 
cide  where  the  lights  should  be  fixed.  Whereas  the  knowledge  to 
identify  the  rmnunum  requirements  can  be  dialed  outof  the  guide¬ 
lines  on  underground  illumination,  the  knowledge  suggesting  appro¬ 
priate  lights  and  mounting  places  has  to  be  acquired  from  experts 
planning  the  lighting. 

III.  THE  PLANNING  SYSTEM 

The  requirements  to  a  planning  system  for  underground  illumination 
are  the  ability  to  represent  all  necessary  informations  about  nunc  lay 
out  and  to  implement  the  guidelines  and  the  heuristics  to  determine 
optimal  mounting  places  Such  a  system  also  serves  as  a  tool  for  the 
acquisition  of  the  heuristics  used  by  the  planning  experts  to  find  cr 
gonoraically  and  economically  almost  optimal  solutions,  it  must  pro¬ 
vide  a  great  degree  of  flexibility  and  user-friendliness,  which  is  a  ba¬ 
sic  precondition  for  the  tool  to  be  accepted  by  the  planning  people 
Furthermore,  it  must  be  able  to  represent  complex  objects,  to  perform 
inferences  with  the  rules  stored  in  its  knowledge  base,  and  must  pro¬ 
vide  a  graphics  interface  to  simulate  different  lighting  configurations 
so  that  the  planning  people  directly  can  test  their  heuristics  To  faoli 
late  the  implementing  and  test  phase,  an  explanation  tool  is  required 
allowing  the  access  to  technical  documents  concerning; the  domain. 
Finally  it  must  be  possible  to  integrate  this  planning  system  into  the 
existing  environment  containing  personal  computer,  workstations  and 
company’s  data  bases  and  software.  The  architecture  of  BUT  and  its 
interfaces  to  internal  software  systems  is  shown  in  Figure  1. 

To  guarantee  a  maximum  amount  of  flexibility  we  deaded  to  imple¬ 
ment  BUT  In  Prolog,  since  logic  programming  provides  both  a  clear 
semantics  and  a  practical  language  so  that  dear,  efficient  problem 
solving  programs  can  be  written  within  logic  programming  [2],  [3] 

{ 4j ,  Furthermore,  every  piece  of  knowledge  Is  represented  explicitly, 
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Fig.  1,  The  architecture  of  BUT 


nothing  remains  implicit  ‘in  fancy  data  or  control  structures'.1  Finally, 
n.cta-programming  can  be  used  for  different  extensions  of  logic  pro¬ 
grams  providing  uncertain  reasoning  or  explanations  ($). 

In  order  to  fill  the  knowledge  base  of  BUT  we  hayc  to  represent 
complex  underground  situations  and  knowledge  about  lighting.  First 
we  have  to  interpret  the  ergonomic  guidelines  respecting  the  particular 
requirements  of  the  situation  To  implement  them,  we  apply  tech¬ 
niques  described  in  (6).  Second  we  have  to  acquire  the  heuristics  of 
the  planning  people.  It  turned  out  that  these  hcunstics  can  easily  be 
expressed  by  rules  stating  declarative  knowledge  which  themselves 
can  be  translated  into  a  torn  directly  executable  by  a  Prolog  Inter¬ 
preter. 

The  representation  of  underground  situations,  both  logically  and 
graphically,  is  done  in  an  object-oriented  style,  which  provides  ab¬ 
stract  data  types  •  especially  for  graphic  objects.  Following  (7)  the 
object-oriented  extensions  arc  still  completely  realized  in  standard 
logic  programming.  So  we  have  a  uniform  declarative  description  in 
first-order  logic  yielding  a  neat  integration  of  the  user  interface  Into  the 
problem  solving  process. 

The  objects  arc  graphically  described  by  vector  graphics.  In  the  actual 
implementation  BUT  allows  a  top  view  including  the  lighting  intensity 
andasectiuial  view.  A  special  interpreter  draws  all  objects  in  a  par¬ 
ticular  nunc  layout  according  to  the  actual  view.  However,  there  is  a 
strong  relationship  between  the  shape  of  an  object  and  the  installation 
of  the  lights.  Some  kind  of  spatial  knowledge  is  used  to  choose  opti¬ 
mal  places  where  the  lights  have  to  be  fixed.  For  example,  if  the  lights 
have  to  be  mounted  to  the  band  conveyor  then  ns  shape  determines  the 
concrete  place  |S]. 

The  current  prototype  of  BUT  performs  the  following  tasks. 

Following  the  guidelines  on  underground  illumination  in  coal  mines  it 
identifies  tasks  to  be  performed  and  determines  minimum  light  inten¬ 
sities  for  each  mine  layout  Then  it  tests  different  reasonable  configu¬ 
rations  to  find  out  acceptable  ones /An  algorithmic  approach  is  applied 
to  optimize  the  distances  of  thelights.  Subsequently  all  solutions  arc 
judged  with  respect  to  their  costs.  Figure  2  shows  a  screen  dump 
displaying  the  result  of  a  session  in  which  the  illumination  of  a  stone 


drift  containing  a  ramp,  a  rail  a  band  conveyor,  explosion  barriers 
and  conduits  is  planned. 

IV.  CONCLUSIONS 

In  this  paper  we  presented  a  planning  system  called  BUT  for  under¬ 
ground  illumination  in  coal  mines  To  implement  BUT,  we  integrated 
logic  programming,  object  oriented  programming,  algorithmic  ap¬ 
proaches  and  graphics  interfaces  BUT  applies  the  guidelines  oh  un¬ 
derground  illumination  in  coal  mines  to  constrain  the  set  of  possible 
lighting  configurations.  Furthermore,  heuristics  acquired  from  plan¬ 
ning  experts  arc  used  to  select  reasonable  lights  and  lamps  and  ways 
to  mount  them  in  the  mine  layout. 
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Abstracts  PROTOS-L  u  *  logic  programming  language  will*  types,  modules,  de¬ 
ductive  database  access  and  ■■  object-oriented  wiadow  interface,  A  brief  overview 
of  wise  planning  applications  developed  m  PROTOS-Li»  given. 

1  Introduction 

Tie  EUREKA  project  PROTOS  (Appelrath  <t  at,  1990] is  concerned  with  (be  de¬ 
velopment  of  logic  programming  tools  tad  tVeir  application*  ia  building  expert  ij- 
sleras  for  planing  oppEcatrons  Carreat  pattaers  ia  PROTOS  arc  BIM,  Hoeckst, 

IBM  Germany,  Sandoi,  and  ike  Universities  of  Bona  and  Oldenburg. 

Wiiki*  PROTOS  the  logic  programnuag  Uagaage  PROTOS-L  ({Bererle,  1969), 
(Bottcler,  1990]}  kti  been  developed,  PROTOS-L  kas  types,  modules,  ded ac¬ 
tive  database  access  ((Meyer,  19S9])  aad  an  object-onented  interface  to  OSF/Motif 
((Jasper,  1991),  (Schenk,  1991])  The  purposeoftkispapeustogiveab.nef  overview 
oa  some  planning  application*  tkat  kave  been  developed  in  PROTOS-L 

2  Map  Colouring 

For  illustration  pirposet  we  kave  implemented  a  version  of  tke  wcQ  known  map 
edouring  problem.  Given  a  set  of  coaatries  tog'tker  witk  their  neigkboanng  rela¬ 
tions,  each  of  tkese  countries  is  palated  with  one  of  fotr  colons  (say;  greeo,  red, 
y?Uow,  bine)  suck  that  no  neighbour  countries  kave  the  same  colour  For  this  pr©: 
blem  domain  tke  facilities  of  PROTOS-L  of  constraining  variables  to  i  iblypei  were 
exploited.  For  Instance,  given  a  map  of  Earope,  assame  that  Switserland  gets  tke 
coloar  red  Immediately,  all  nrighbours'of  Switserland  are  constrained  to  be  not  red, 
i  e.  tkeir  eoJoar  mist  be  in  {green,  yellow ,  blae)  only.  If  Germany  then  is  assigned 
Ike  colonr  gteea,  •  g,  Fraace’a  colons  valne  mast  be  in  {yellow,  blae}.  Tk‘«  prop*' 
gallon  of  constraints  significantly  redaces  tke  search  space.  When  tke  constraints 
yield  an  empty  set  of  possible  colonrs  for  any  country  this  failne  is  detected  as 
early  as  possible,  and  not  only  when  tie  coloariag  of  tkat  coaatry  is  toed.  To  give 
an  example,  tke  standard  backtracking  solttioft  in  Prolog  nsed  116  backtracking 
steps  for  couloring  a  map  of  Enrope,  whereas  tke  PROTOS-L  solatlon  nslag  type 
constraints  as  sketched  above  fontd  the  first  soUtion  withont  a  single  backtracking 
step.  This  example  illnslrates  that  a  similar  benefit  can  be  gained  by  constraint 
propagation  techniques  it  prediction  plaining  (tee  e  g.  (Dincbai  <t  «l,  1969]). 

3  IC  Routing 

Fiadmg  *  roste  ia  a  graph  nader  certain  conditions  It  another  classical  planing 
sit  nation  A  particular  iaitaaca  of  this  problem  domain  Is  solved  in  the  1C  ronts 
planning  system  tkat  has  been  developed  in  PROTOS-L 

A  jnajor  part  of  tka  German  railway  system  is  tke  InteiCity  net  that  connects 
over  JO  cities  in  Germany  la  one  system,  tke  direct  city.to-city  1C  connections  are 
stored  in  a  relational  database,  and  the  problem  is  to  find  a  good  IC  connection 
between  two  cities  given  by  tke  nser,  possibly  with  changing  trains.  Besides  tke  two 
Cities,  n  time  T  mmt  be  given  and  optimisation  criteria  Sker 

*  arrival  as  soon  as  possible,  dtpatme  at  T  or  after  T 
»  departnrtai  late  as  possible,  arrival  at  T  or  before T 

A  high  level  nser  Interface  has  been  developed  on  an  IBM  RS/6000  workstation 
nsmg  PROTOS  L’s  objft-onented  interface  to  ,X  Windows  and  to  OSF/Motif 
({Schenk,  1991)).  Tie  departnre  tad  arrival  cities  can  be  selected  by  mouse  clicks, 
and  them  ar*  means  for  th*  different  option  The  system  finds  tke  epftmnl  connec¬ 
tion  ander  the  given  option. 

Some  of  tke  e xperienecn  we  made  with  the  IC  ronte  planning  system  were- 

*  An  efficient  and  mteUigeat  dor*  connection  to  an  external  relational  database 
Is  vital  In  sick  application.  A  deductive  database  component  allowing  also 
recattive  queries  as  In  PROTOS-L  it  meftl  for  easy  formnla lions  of  path  fin¬ 
ding  problems  and  can  oven  »r«d  termination  problems  In  case  of  cycles  in 
tke  data  ([Meyer,  1969]),  The  database  access  strategic*  kave  to  be  accomo¬ 
dated  to  tke  ict-oncnted  evalaalson  in  databases,  which  it  opposite  to  tke 
backtracking  method  in  logic  programming. 

•  Tkn  part  scalar  transparent  database  access  realised  ia  PROTOS-L  made  it 
very  easy  to  more  from  a  prototypical  test  Implementation  with  oaly  a  few 
atie*  and  some  connection  in  main  memory  to  tka  fall  database  version.  This 
rapid  prototyping  aspect  of  tk*  database  access  has  alto  been  exploited  in  tke 
PPS  application  to  be  discnsed  below. 

•  The  system  I*  very  flexible  w  r  *  wkat  is  an  optimal  connection  For  instance, 
instead  of  the  failtil  connection  one  coaid  easily  tale  the  connection  with  the 
smallest  member  of  changes  as  tke  optimal  one 
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•  High-level  «ser  interfaces  are  not  only  aiefvl  for  application  repairing  a  great 
a  moist  of  mer  interaction,  became  applications  as  this  one  can  be  nsed  with 
virtnally  no  training  effort  at  nil, 

4  Multi-step  production  planning 

The  main  application  domain  of  the  PROTOS  project  is  production  planning  and 
sckedaliag  A  PPS  problem  description  provided  by  Ike  PROTOS  partner  Saados 
((Saner  it  *1,  1969))  occurs  ia  the  area  of  pharmaceutical  production  planning 

•  There  is  n  set  of  orders  to  be  produced.  Each  order  specifies  a  product,  an 
amonnt,  aad  a  dae  date.  Orders  are  also  assigned  priorities 

•  Each  product  can  be  prodicrd  in  a  number  of  variants  Each  vanant  eomi.sti 
of  a  number  of  production  steps,  and  each  production  step  can  be  earned 
out  on  a  number  of  differreat  machines. 

•  Machines  have  different  characteristic  features,  not  every  step  of  a  variant  can 
be  produced  on  every  machine.  But  tke  machines  are  multi-purpose  machines 
at  they  can  do  many  different  jobs. 

•  Production  constraints  have  to  be  obeyed  For  instance,  the  individual 
production  steps  of  a  given  vanant  must  be  executed  in  a  predefined  sequence, 
without  any  time  delay  between  them. 

Fot  this  multi-step  production  planning  system  we  have  developed  different  stra 
tegies' 

!  Guided  by  Expert  As  soon  as  an  order  with  highest  priority  can  not  be 
produced  before  its  due  date  the  planning  process  stops  in  order  to  allow  tke 
planning  expert  to  deads  what  to  do  about  this  situation. 

J  Maximal  Planning.  If  higher  priority  orders  can  not  be  produced  before  tkeir 
dun  date  any  mors,  the  planning  process  still  plans  a  maximal  number  of  next 
level  priority  orders  tkat  can  be  produced  before  their  due  da*t, 

J  Delay  Orders)  Kan  order  with  highest  priority  can  not  be  produced  before  its 
due  dote  it  is  nevertheless  put  into  the  plan  An  information  is  given  to  tke 
user  how  much  such  order*  will  be  delayed  after  their  doe  date 

4  Combination  of  ())  and  (6)  As  many  orders  as  possible  are  planned,  stalling 
with  highest  priority  orders  first  and  allowing  exceeding  of  due  dates  where 
necessary. 

According  to  experiences  made  by  Saados  St  was  tried  to  solve  this  multi  step 
PPS  problem  with  classical  OR  methods,  bat  without  success.  Tie  four  different 
planuiug  strategies  sketched  above  do  not  guarantee  an  optimal  plan  in  n  matke 
matieai  stnse,  but  kumsn  planning  expert  can  use  tkera  and  experiment  with  them 
to  see  what  effects  they  have  for  a  particular  set  of  orders  Among  tke  experiences 
drawn  from  this  application  was  tkat  it  was  very  kelpful  to  kave  tke  system  imple¬ 
mented  in  a  high  level  logic  programming  language,  foe  instance,  the  changing  of 
any  parameters  and  itrntrpr*  in  the  planning  system  was  much  easier. 

Also  in  this  appbcation  there  was  a  smooth  transition  front  a  small  test  version 
using  only  main  memory  to  the  full  version  with  the  external  database  relations 
containing  the  orders  information,  the  production  requuemoeti  with  their  variants, 
steps  and  machine  alternatives,  etc.  The  typing  concept  helped  to  discover  many 
programming  errors  already  in  an  early  phase  of  program  development  which  might 
have  otherwise  earned  much  more  difficulties  in  locating  and  eliminating  them  Ho¬ 
wever,  the  first  versions  of  this  planning  system  were  developed  while  tke  OSF/Motif 
interface  of  FROTOS  L  was  not  yet  available  Thus,  to  work  with  the  system  with 
out  a  window-baled  user  interface  is  by  far  not  as  comfortable  as  wi»h  such  an 
interface  as  it  Vai  been  developed  for  the  BPS  application  presented  in  the  next 
section. 

5  Single  Step  Production  Planning 

The  following  PPS  appbcation  deals  with  fiber  production  as  it  occurs  at  »k«  PRO¬ 
TOS  partner  Hoechit  AG  The  problem  is  to  assign*  set  of  orders  (typically  between 
100  aid  100  per  month)  to  n  number  of  production  bats  (say,  for  instance,  1 J  lines) 
Oace  the  production  of  an  order  h»*  been  started  on  a  production  hne  it  will  also 
be  finished  on  that  bat  before  anythin*  else  is  produced  on  the  same  bne  Thus, 
this  problem  is  an  instance  of  a  single  a  ep  prodnction  planning  problem 
Tke  main  characteristics  of  the  problem  situation  are. 

•  Each  order  specifies  a  certain  ype  of  fiber,  tka  amount  to  be  produced,  a 
bobbin  type,  whether  tke  fiber  should  kave  n  profile  or  not,  a  diameter,  aad 
a  die  date  Furthermore,  it  may  specify  a  certain  colour  or  it  may  specify 
colourless. 


Figure  Is  Gantt  Chart  poduced  by  the  HoPlei  system 


•  The  production  line*  may  differ  in  their  chinctemtia  There  are  alow  and 
fait  production  fines,  each  hue  can  prodacc  only  a  subset  of  all  typer  of  fibril, 
etc.  From  the  production  line  characteristics  it  is  pomble  to  determine 
which  order  can  be  prodoced  on  which  prodaction  fine.  Thu  rruy  depend  on 
all  parameters  of  an  order  She  »U  fiber  type,  Sts  bobbm  type*  it*  diameter,  etc. 

«  Producing  an  order  S  ifter  an  order  P  on  the  ramt  Eat  will  came  a  certain 
a  moult  of  resetting  costs,  The  amount  depends  on  the  line  and  P'a  and  $’» 
fiber  type,  colour,  diameter,  and  bobbin  type.  This,  these  costs  can  only  be 
determined  osco  a  prodaction  (in?  has  been  selected  for  these  Orders 

•  Apart  from  the  prodnction  restrictions  already  mentioned,  there  is  one  major 
restriction  which  is  called  the  colour  constraint.  There  are  special  fiber 
types  which  form  a  sabiet  of  all  possibla  fiber  types.  The  constraint  is  that  no 
two  orders  for  these  sped*!  types  can  bl  produced  at  the  same  time  on  any 
of  the  prodaction  fines  if  (Vt  orders  specify  two  different  colons.  Therefore, 
orders  for  Coloared  Special  fiber  types  (ailed  CS  orders,  for  short)  mast  be 
hand’-^d  especially  carefal  in  the  planning  process.  For  instance,  a  green  CS' 
omr  (inning  on  fine  4  from  time  Tl  to  time  T2  will  prohibit  the  prodaction 
of  any  blae,  yellow,  red  etc.  CS-order  on  any  other  line  between  Tl  and  T2 

AH  data  mentioned  so  far  is  stored  in  an  external  database.  The  goal  of  the 
pUinsg  process  is  to  fiad  a  schedule  inch  that  all  orders  are  prodaced  in  time  and 
aH  restriction  and  constraints  are  satisfied-  Since  the  resetting  costs  can  b#  rather 
ligV  (e  g,  when  cVaiging  from  oae  colour  t*  another)  n  important  objective  is  to 
minimise  the  overall  resetting  costs  of  a  plan. 

According  to  tha  experiences  of  Hoechil,  also  this  PPS  could  not  be  solecd 
successfully  with  Enear  programming  lechniqaes.  The  computation  lime  was  too 
long  (leeeiul  hauri),  the  resulting  plan  could  not  be  modified  (which  had  to  be  done 
frequently  da«  to  modified  orders),  end  the  plan  often  did  not  refiect  additional 
constraints  the  human  experts  knew  end  obeyed  but  tket  were  not  reflected  in  the 
program  (wh«h  would  also  require  a  plan  modification)  Therefore,  a  knowledge 
based  approach  was  taken  at  Hoechst  ((Jtchenuch,  1990))  where  the  planumg  know, 
legda  of  the  expert*  eould  be  expreited  directly.  Certain  heuristic*  to  select  a  "best 
next"  order  eul  down  the  search  space  slgaificently.  For  instance,  there  are  rules 
like 

♦  A  best  next  order  should  haee  the  same  parameters  a*  its  predecessor. 

*  If  one  has  to  switch  colours  e»  diameters  rather  take  the  same  colour  and 
switch  the  diameter. 

*  If  one  has  to  iwit*  colons,  it  u  better  to  switch  from  a  bghtei  colour  to  a 
darker  one* 

#  If  one  has  to  awitch  diameters,  it  u  better  to  switch  from  a  smaller  to  a  greater 
one. 

Bach  of  these  ales  expresses  a  planner’s  knowledge  Vow  to  keep  the  resetting  costs 
low  A  system  Used  on  such  rules  has  been  developed  at  Hoechst  using  OPS5,  and 
„  ’  k*‘  *U!  ,fc*  b»*«  tk*  PROTOS-L  planning  system  eailed 

HoPl*  (Hoechst  Planning  System).  In  addition  to  the  data  Ested  above,  the  planner 
pm  a  time  interval  for  the  planaiag  period,  aid  a  Eat  of  colour  interval*  Each  of 
these  colonr  laterval*  specifies  a  suWatemJ  ,f  (h,  pUneiag  interval  in  which  the 
system  ».y  pfo,  CS-order.  of  that  colour.  The  HoPl*  system  generates  •  Gantt 
chart  ™pl«yi»|  the  produced  plan  that  ia  coloured  according  to  the  order*’  colours 
(Fig.  1),  After  a»  order  has  been  placed  iato  the  plan  it  is  immediately  visualised 
*  ,k*  G**U  Already  dunag  the  still  ongoing  planning  process,  every  order 
that  has  been  planned  so  far  can  be  clicked  with  the  mouse.  A  new  window  appears 
and  show*  tha  details  of  the  etched  order,  tha  resetting  costs  it  has  caused,  and  the 
reason  why  it  has  been  placed  into  this  posit**  in  the  plan.  This,  a*  explanation 
facility  is  available  for  the  planner,  enabling  him  to  modify  or  add,  if  necesttry,  any 
constraints  (like  placing  one  order  directly  after  another  one)  or  any  planning  rules. 


The  experiences  with  the  HoPIu  system  were  similar  as  with  the  Sando>  plan¬ 
ning  system  described  above  at  far  as  the  database  access  and  the  me  of  a'  typed 
logic  programming  language  are  concerned  The  modification  and  addition  of  the 
knowledge  base  is  quite  easy  In  addition,  the  window  based  user  interface  made  it 
much  easier  to  work  with  the  system,  e  g  to  see  what  effects  a  change  of  the  colour 
intervals  or  changes  or  additions  of  planning  rules  have 

The  typical  time  to  produce  a  plan  with  300  orders  is  approximately  10  to  18 
nunutts  in  out  current  implementation.  Optimizing  the  run  time  of  the  system  is 
possible,  bnt  one  has  to  be  artful  not  to  trade  run  time  efficiency  for  flexibility 
while  stifl  in  a  prototype  phase.  A  detailed  comparison  of  the  HopPlu  system  with 
the  system  developed  at  Hoechst  and  implemented  in  OPS8  h*s  still  to  be  earned 
out.  Currently,  wejsre  planning  to  add  further  functionalities  to  the  system  which 
ate  in  the  focus  of  the  second  phase  of  the  PROTOS  project,  namely  capacity 
iaformation  for  prodeution  hues,  ware  houses,  etc. 
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PPS  Views:  Scheduling  and  Replanning 

G.Michaux  . 

BIM /  .  , 

KwUcstnut.4  -  3078  Everberg  (B) 

Formany  years,  planning  and  scheduling  applications  are  the  field 
of  intensive  research  and  development  in  operation  research  as 
well  as  in  Artificial  Intelligence;  in  particular  through  the  EUREKA 
protect  PROTOS.  We  give  main  conclusions  drawn  from  our  experi¬ 
ence  in  this  field.  After  a  brief  description  of  the  production  plan¬ 
ning  and r scheduling,  process/'we  rive-some  outlines  of  the 
integrated  environments  we  developed.  In  spite  of  their  success  in 
real  world  PPS  applications,  such  tools  including  scheduling  algo- 
rithms  and  graphical  facilities  Were  too  restrictive.  People  in  charee 
of  PPS  applications  let'appear  new  requirements,  especially  with 
respect  to  replanning  and  reasoning  issues.  The  scxalled  descrip¬ 
tive  view  We  propose  to  approach  these  requirements  is  presented 
in  the  last  part  of  his  paper.  General  conclusions  deal  with  the  use¬ 
fulness  of  prototypes  within  the  framework  of  PPS  applications. 

1.  Introduction  "  . 

Am firial  Intelligence  techmoues  currently  present  a  growing  inter- 


merous  developments,  have  own  achieved  in  order  Jo  obtain 
knowledge  based  systems  related  to  PPS  applications  [OUPT  88, 
AIESM  9%  PROTOS  88,  PROTOS  9«J.  These  systems  aim  at  «>mirie- 
menting  the  functionalities  and  scope  offered  by  existing  MRFH 
systems.  These  deal  with  longterm  planning  and  manage  informa¬ 
tion  such  as  sales  orders,  work  orders/bills  of  materials,  as  well  as 
-inventory  status.  These  systems  however  present  major  restric¬ 
tions:  they  are  heavy  environments  and  not  well  adapted  to  the  dal¬ 
ly  changing  conditions  of  the  busy  real  world  lire.  Furthermore,  the 
actual  capacity  of  the  plants  is  not  taken  into  account.  There  is  defi¬ 
nitely  a  gap  between  dassical  MRPH  systems  and  the  daily  practice 
of  the  plan  managers.  This  paper  aims  at  presenting  results  of  the 
experience  we  acquired  in  practical  applications  In  the  field  of  job 
shop  scheduling,  discrete  parts  manufacturing  as  well  as  of  scien¬ 
tific  experiments  planning.  Commercial  contracts  and  the  EUREKA 
project  PROTOS2  enabled  us  to  propose  methodological  views  on 
the  concerned  scheduling  problems  as  well  as  to  build  executable 
models  [Mlchaux  90]  Mam  condusions  of  these  developments  first 
relate  to  the  scheduling  process,  i.e.  how  the  resources  of  shop 
floors  are  allocated  to  production  steps.  We  then  mention  the  en¬ 
largement  of  this  view  -  such  as  proposed  in  the  second  phase  of 
PRuTOS-  and  show  how  reasoning  and  replanning  issues  could  be 
integrated.  Genera]  conclusions  .emphasize  the  development  of 
Prolog  based  explorative  prototypes  as  a  powerful  mean  to  work 
out  the  many  facets  of  complex  applications  such  as  PPS  ones. 

2.  Scheduling  Issue 
2.1  Problem  overview 

The  PROTOS  problem  we  refer  to  In  this  paper  concerns  the  sched¬ 
uling  of  a  few  dozen  work  orders  which  might  be  achieved  accord¬ 
ing  to  one  production 'routine  selected  from  a  set  of  routines, 
themselves  composed  of  1  to  20  steps.  The  application  deals  with  a 
multi-purpose  apparatus  environment.  There  exists  no  commercial 
shell  to  manage  or  support  the  following  activities  whi  h  are  per¬ 
formed  m  many  PPS  areas,  even  within  the  restneted  framework  of 
one  single  shop  floor 

-  order  acceptance:  all  the  work  orders  must  not  be  necessarily 

filanned.  Because  of  the  numerous  factors  conditioning  the  PPS  ac- 
ivities,  the  plan  managers  have  the  responsibility  to  select  which 
orders  they  want  to  schedule.  They  can  also  suggest  modifications 
of  these  orders; 

-  industrial  engineering;  .re  organizational  constraints  according  to 
which  the  resources  are  involved  In  the  production  recipes  cannot 
be  considered  as  static  informatioa  They  might  be  regularly  updat¬ 
ed  to  take  into  account  the  continuously  evolving  techniques  of 
production  as  well  as  exceptional  opportunities  or  Impossibilities 
to  use  particular  resources.  Support  is  needed  to  ensure  the  consis¬ 
tency  of  the  knowledge  base  implementing  the  organizational  con¬ 
straints; 

-  scheduling:  classical  operations  research  techniques  failed  to  find 
solutions  for  real  world  applications  because  the  combmatonal  ex¬ 
plosion  phenomena  is  tremendous  in  these  applications.  Order-ori¬ 
ented  algorithms  perform  breadth-first  searcMn  the  deasional  tree 
corresponding  to  the  successive  selections  of  appropriate  routes, 
production  steps,  resources  and  time  windows  [PROTOS  90]  As 
many  scheduling  algorithms,  they  however  rely  on  semi-cmpincal 
heuristics  which  refer  to  some  local  property  of  the  PPS  problem; 

-  decision-making*  beyond  the  order  acceptance,  the  plan  managers 
may  also  decide  to  add  additional  constraints,  whatever  the  rea¬ 
sons  therefore -internal  or  external  to  their  shop  floor  For  example 
they  may  want  to  impose  a  given  production  route  for  one  particu¬ 


lar  product  within  a  given  time  window.  The  constraints  as  well  as 
the  goals  evolve  dynamically  according  to  the  current  PPS  context 
which  is  changing  and  evaluated  on  run  time  by  the  user. 

2.2  Executable  Environment 

Providing  thepian  manager  with  efficient  support  requires  to  inte¬ 
grate  these  different  facets  into  one  integrated  environment  Part  of 
Sie  PROTOS  protect  was  to  build  one  explorative  prototype  reflect- 


plemented  in  Prolog.' It  was  actually  in  use  in  several  dye  stuff 
plants.  It  is  a  multi-windowing  system  capable  of  integrating  some 
scheduling  skills:  /•'  , 

>  work  load  and  application  management:  the  work  orders  can  be* 
retrieved  and  selected;  arid  their  properties  updated  after  some  role 
checking  -e.g.;  the  duration  of  production  has  to  fulfill  some  defaul  t 
rule  based  on  the  recipes  but  this  rule  can  be  overridden  if  the  user 
confirms  a  different  option!  Besides,  the  order-oriented  scheduling 
algorithms  can  be  launched;  ’*  . 

-  shop  floor  model*  the  production  routes  are  displayed;  with  high¬ 
lighting  features  in  order  to  draw  user's  attention  on  the  selected 
resources  or  on  the  conflicts.  Besides,  these  routes  may  be  interac¬ 
tively  created  or  updated  at  run  time; 

-  plan  control:  its  first  purpose  is  to  allow  the  user  making  a  visual 
evaluation  of  the  current  plan.  The  allocations  -for  production  or 
maintenance-  can  be  seen  through  different  filters;  built  interactive¬ 
ly  at  run  time.  Plan  control  enables  to  simulate  the  planning  activi¬ 
ties « of  the  plan  managers:  these  can  select: either  work  orders, 
production  steps  or  allocations.  Scheduling  actions  can  be  applied 

t  to  these  selected  items  in  order  to  modify  the  existing  schedule,  e  g. 
through  the  simple  shift  of  Gantt  charts  or  by  decreasing  the  nor¬ 
mal  duration  of  production.  The  conflicts  are  notified  in  a  message 
window; 

-  planning  survey:  providing  the  plan  mangers  with  one  additional 
view  of  the  plan. 

Many  others  facets  could  have  been  added,  such  as  the  integration 
of  shop  floor  tracking  or  yet  the, generation  of  diagnosis  reports. 


>As  they  felt  very  comfortable  with  this  tool,  this  development  en¬ 
abled  to  face  the  well  known  knowledge  acquisition  bottleneck 
(Prtrau  87], 

3.  Replanning  Issue 
3.1  Hierarchical  PPS  View 

Planning  generally  deals  with  the  sequence  of  tasks  leading  to  the 
achievement  of  orders  while  scheduling  relates  to  time  assignments 
to  such  tasks.  Actually,  the  distinction  between  both  concepts  is 
practically  rather  fuzzy.  Moreover,  plan  mangers  are  concerned  by 
both  replanning  and  re-scheduling  activities.  Taking  into  account, 
new  orders  or  yet  events  occurring  m  the  resources  park  obviously 
involves  to  re-schedule  some  parts  of  the  existing  plan.  Adequacy 


'critical'  scheduling  operation  [Keng88J,  these  algorithms  just  per¬ 
form  a  rough  and  simple  management  of  scheduling  actions*  these 
always  attach  time  frames  to  resources  Moreover  they  must  still  be 
applied  to  a  predefined  set  of  work  orders.  Qualitatively,  the  plan 


assessments  on  the. current  plan^Rathe^  than  focusing  on  the 
chances  for  particular  scheduling  actions  to  succeed  (or  fail),  they 
use  a  mixed  reasoning  At  the  same  time,  they  consider  the  critical 
production  steps  and  also  estimate  the  opportunity  to  plan  which 
orders  could  be  scheduled  partially,  completely  or  not  at  all  They 
play  with  more  flexible  planning  constructions  which  tolerate  both 
incompleteness  or  even  inconsistency.  The  local  scheduling  prob¬ 
lem  is  actually  an  open  one,  in  continuous  exchange  with  the  logis¬ 
tic  departments  ana  the  plan  mangers  who  cooperate  m  order  to 
define  the  master  schedule.  More  than  just  solving  scheduling 
problem  at  the  level  of  individual  plants,  the  planning  and  schedul¬ 
ing  process  continuously  evolves  according  to  an  hierarchical  deci¬ 
sion  schema.  Such  as  proposed  in  PROTOS  [PROTOS  90],  'glob?] 
planning*  would  cope  with  the  information  normally  managed  by 
the  MRFII  systems  but  integrating  moreover  the  current  -changing- 
cap  ad  ty  of  the  plants,  while  'distnbuted  scheduling*  would  sped- 
fy,  for  the  different  production  plants,  the  work  orders,  i.e.  the  in- 

Sut  tc  the  'local  scheduling*  activity  of  these  plants. 

.2  PPS  Descriptive  View 

Mastenng  this  local  scheduling  -and  planning-  activity  ask s  for  be¬ 
ing  able  to  assess  a  few  general  statements  on  the  PPS  problem. 
These  statements  should  help  in  focusing  on  the  targeted  goal  by 
restarting  the  number  of  possible  tasks  and  constraints  to  be  con¬ 
sidered  at  the  most  detailed  level.  Managing  such  statements,  with¬ 
out  hampering  the  solving  process  at  this  level,  is  the  concern  of 
planning  techniques  These  need  desenotors'n  qualify  the  state  of 


the  concemedapplication-'ln'real  .world  PPS  applications,  tl 
working  basis  on  which  the  plan  managers  assess  their  dedsio: 


is,  the 

„  ^  r, decisions 

does  hot  always  take  into  account  all  constraints.  In  buildinga 
plan,  the  plan  managers  may  very  well  focus  on  the  reservation  of 
apparatus  for  some  specific  operations.  On  the  opposite/ for  other 


_ _  , _  _ , .  Ji  managers' 

first  assume  they  will  always  find  a  solution.  In  the  practical  appli¬ 
cations  we  encountered,  we  could  more,  generally  distinguish  es¬ 
sentially  relaxable,  i.e.  very  soft,  constraints  from  hard  constraints. 
Fulfilling  the  latter  should  be  automatically  solved  by  the  system: 
the  soft  ones  could  be  ignored  but  taken  later  into  account  in  an  in¬ 
teractive  process  between  the  user  and  the  system.  The  user  would . 
be  guided  through  dialogs  in  searching  solutions  for  the  soft  con¬ 
straints,  on  basis  of  his  evaluation  of  the  current  context  of  the  shop 
floor  as  well  as  suggestions  made  by  the  system /The  scheduling  ac¬ 
tions  which  can  be  applied  to  solve  a  conflict  depend  on  the  charac¬ 
teristics  of  these  conflicts.  It  is  the  purpose  of  the  descriptors  both 
to  achieve  such  characterization  ana  to  filter  the  appropnate  sched- 
uling  actions.  Integrated  into  a  hierarchical  planning  framework, 
such  descriptors  would  enable  to  avoid  dealing  directly  with  the 
innumerous  detailed  allocations  of  the  PPS  wond  but  well  to  rea¬ 
son  on  some  descriptions  of  the  schedule  or  of  conflicts.  This  is  the 
reason  why  this  approach  is  called  "descriptive  view*. 

The  figure  shown  oelow  presents  a  very  simplified  PPS  case  where 
one  order,  02,  has  to  be  scheduled  on  one  of  two  possible  appara¬ 
tus,  Ml  or  M2;  in  a  context  where  2  single-steps  work  orders/  OO 
and  01,  are  already  scheduled.  In  order  to  schedule  this  order,  one 
is  forced  to  create  conflicts  with  existing  allocations  for  both  avail¬ 
able  machines. 


M2 

Ml 


i::-i 

;arr 


Possible  Tasks: 


• change  machine 
•  move^allocatlon 
•reduce  amount 


JO  time 


even  tor  such  a  simple  problem,  wiil.ng  lo  scneduie  the"order  2~c5n 
give  rise- to  numerous  solving  paths.  Scheduling  actions  could 
modify  the  existing  allocations  -  and  allow  to  Insert  the  new  order 
without  conflict-  but  these  scheduling  actions  may  well  Introduce 
new  conflicts:  for  example,  moving  the  order  1  -using  same  ma¬ 
chine  as  order  2  -  could  violate  a  precedence  constraint  on  the  or¬ 
ders  0  and  1.  Hierarchical  planning  techniques  have  been  proved  to 
constitute  the  basis  to  guide  the  reasoning  of  planning  experiments 
-in  the  field  of  molecular  genetics  ISteflk  88).  The  desenptive  view  we 
propose  aims  at  organizing  the  knowledge  according  to  a  hierar¬ 
chical  representation  enabling  to  reason  on  the  possible  scheduling 
tasks  in  order  to  solve  well  specified  PPS  problems.  Wilting  to 
achieve  one  goal  -here,  to  remove  one  overlapping  conflict-might 
ask  for  considering  a  large  number  of  allocations  (PPS  world).  This 


achieved,  such  as  iff 


overlap 


_ .  change  allocation 

no-overlap  ^  _ 


reduce  amorinW  small  ^ 
uOYcairocatloir  xv 

O  overlap 

change  machinK^ 

—  large 


it  must  be  emphasized  that  the  evaluation  of  some  descriptors  oc- 
airring  in  a  solving  process,  for  example  the  Importance  of  an  over- 
lapping  conflict  -  small  vs,  large-  might  be  prompted  to  the  user.  A 
lot  of  detailed  features  indeed  determine  the  functioning  of  the 
physical  resources;  and  sometimes  may  dramatically  influence  the 
decisions  taken  by  the  plan  managers.  One  should  consider  that  all 
production  requirements  might  not  necessanly  be  known  by  the 
system  for  example,  intermediate  products  might  be  transferred 
into  another  production  center  where  other  resources  are  avail¬ 
able-  but  3ucn  a  transfer  depends  on  several  specific  circumstances 
quantity,  nature  of  the  product  The  goal  is  to  support  dialogs  be¬ 
tween  the  user  and  the  system  and  to  integrate  into  the  solving  pro¬ 
cess  some  judgements  the  user  might  declare  at  run  time. 


4.  Conclusion  • 

Since  the  production  planning  and  scheduling  process  is  a  complex 
and  ill-structured  domain,  many  conclusions  could  be  drawn  from 
the  applications  we  encountered.  We  want  here  to  point  out  two 
major  ones.  First;  the  usefulness  of  explorative^  prototyping  has 
been  proved  in  the  field  of  Artificial  Intelligence.  In  dynamics  con- 
texts  such  as  encountered  in  large  business  companies,  we  again 
confirmed  that  explorative  -and  quick-  prototyping  are  requiredin 
order  to  capture/  and  even  to  create,.  the  domain  knowledge.  It 
must  be  stressed  that  Prolog  a  logic  programming  language  In¬ 
cluding  an  inference  engine  and  defining  a  rule  syntax,  can  play  a 
major  role  in  such  contexts.- It  indeed  provides,  in  one  unified  cor¬ 
pus,  a  .high  level,. conceptually  sound,'  readable  and  executable 
specification  of  the  problem., Finally,  for, many  years,  scheduling 
has  received  a  lot'ofattention  from  operations  research.  However 
because  of.the  NP-hard' nature  of  the  real  wortd,  problems,  and 
though  achieving  perfoimant  algorithms  still  remain  an  intensive, 
research  field;  the  major  concern  of  plan  managers'  appears  more 
and  more  as  being  able  to  reason  on  their  daily  scheduling  practice 
The  experts  now  expect  tools  capable  of  guiding  them  in  the  way  to 
solve  specific  PPS  problems  through  interactive  dialogs  with  the 
system. 
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The  Fisher  Hypothesis  and  Uccertaioty 

The  Fisher  Hypothesis  has  been  the  subject  of 
extensive  empirical  and  theoretical  analyses  since  it 
was  proposed  by  Fisher.2*3  The  Fisher  hypothesis 
assumes  r that  noainal  interest  rates  are  related  to 
anticipated  price  changes,  leaving' the  real  interest 
rate  constant.  In  other  words,  the  nominal  interest 
rate  can  be  expressed  as  the  sia  of  two  cocpccents: 


R 


-  r  ♦  a 


(1) 


where  &  is  the  nominal  interest  rate,  r  is  the  real 
interest  rate,  and  a  ia  anticipated  -inflation. 
Classical  tests  of-the  hypothesis  replace  unobservable 
anticipated  inflation  in  equation  (1)  with  a  weighted 
lag  distribution  of  past  price  increases,  (Ivj  p,/ 
£,-£>,  wh«r«  the  vj  are  the  distributed  lag  weights. 
The  Fisher  hypothesis,  as  a  statistically  testable 
proposition,  takes  the  following .fora: 


(2) 


V  Ft  - 
R  -  o  Sjv.  — -  ♦  t. 

t-i 

where  cj  is  a  randoa  error  tern. 

Equation  (2)  is  a  dysaaie  impulse  response 
equation.  For  paraaeter  estimation,  the  lag  structure 
is  truncated,  and  the  following  equation  is 
considered: 


't  *  °  *  «o*t  *  elVl  ♦•••■*  * ‘c  <’> 

with  ifoei<  ~  *nd  *t  **  the  r4te  °*  chsnge  in  the 
price  level.  One  important  concern  shout  equation  (3) 
is  the  fact  that  this  fora  precludes  the  existence  of 
feedback  between  interest  rates  and  prices. 

A  second  aspect  of  the  hypothesis  that  deserves 
further  attention  is  theoretical  in  nature.  The 
theoretical  fora  of  the  Fisher  hypothesis  can  be 
derived  froaa  sicple  choice-theoretic  model.  Market 
participants  nust  select  between  present  and  future 
consuaption  cl* ins  in  an  intertemporal  aodel.  The 
details^  of  this  approach  are  presented  by 
Hirahleifer.  This  aodel  is  not  discussed  in  this 
short  version  of  the  paper,  but  a  longer  version  is 
available  froa  the  authors  by  request.  We  proceed  in 
this  psper  to  a  direct  extension  of  the  fraaevork. 

If  we  reconsider  the  subject  within  the 
theoretical  fraaevork  of  Debreu1  and  Mirshleifsr4, 
market  agents  must  consider  the  possible  stales  of  the 
world  upon  which  the  receipt  of  future  consuaption 

cWiu  depend,  let  tj,  . cj  be  •  eet  «f 

alternative  consuaption  claias  for  future  tiae  period 
1  and  *^f  ig,  «  •  «f  ijj  the  respective  subjective 
probabilities  assigned  to  the  alternative  claias  at 
tiaa  1.  This  o*y  easily  bt  generalised  to  M  periods 
with  the  same  set  of  respective  probabilities, 
assuaing  the  independcrce  of  the  assigned 
probabilities  with  respect  to  tiae.  The  behavioral 


rule  that  governs  the  decisions  of  market  agents  in 
this  context. is  the  naxiautico  of  expected  utility 


A<cJ . «£  ♦  -  -  -  *  . e?> 


where  v  is  the  utility  ftsctioa  cf  a  representative 
individual.  Aj  sari  eg  caly  esc  future  tiae  period  (1 
versus  present),  and  two  alternative  states  A  and  B; 
the  maximization  of  expected  utility  tsder  a  given 
budget  constraint  yields  the  following  equilibrium 
conditions: 
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where  0|A  and  0jg  are  real  future  prices  associated 
with  alternatives  A  and  B  respectively,  while  and 
r*  are  risky  real  rates  for  alternatives  A  and  B. 

Equations  (A)  and  (5)  ioply  that  real  interest  rates 
sre,  in  a  world  of  uncertainty,  also  functions  of  the 
probabilities  assigned  to  future  events  snd  the  aurket 
participants’  utility  functions.  The  iaportance  of 
this  result,  with  respect  to  the  Fisher  hypothesis,  is 
the  following. 

We  can,  more  realistically,  write  *g(A),  x&(A), 
.  •  in(A)  where  A  is  a  general  set  of  information 

that  is  utilized  by  a  representative  market  agent  in 
the  current  period  to  assess  the  possible  states  of 
the  world.  This  fora  iaplies  that  the  probabilities 
assigned  to  future  events  are  based  on  all  currently 
available  information.  The  incorporation  of  a  new 
signal  can  affect  the  subjective  probabilities  of  a 
representative  agent  and  hence,  through  equations  (4) 
and  (S),  the  real  interest  rate.  It  is  clear  that  the 
real  interest  rate  may  be  subject  to  change,  even  over 
a  short  tiae  period,  given  this  context  iM  a  modem 
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tcca any:  with  vtll*4mle^  his  tedis  mad- relatively 
efficient  mrlrts.  I:  is  criitst  that  ite  fisher 
hype thesis  is  illicitly  restricted  to  the  utilization 
of  ssfomtiea  ceeceraisf  ccljr  the  future  depreciation 
of  the  aecetxr jr  east  ia  t  mosey  tccoacy. 

Tins  the  informatics  set  implied  by  e^utioa  (3) 
is  JU  *  (I) where  (Z)is  the  set  of  prescat  sal  pest 
Uflstiea  rites.  Tie  icfomticn  set  suggested  by 

•4  ait  ices  (4)  zzd  (5)  is  *£  •W'X'i  vterr  1  is  the 
icfomtioQ  sebset  tilt  includes  other  types  of 
icfomttM  (excluding  inflitioa)  £a  the  assessment  of 
future  stores  cf  the  world. 

Furthermore,  these  two  approaches  yield  two 
differ eat  statistical  type theses  that  caa  be  tested. 
Tie  Fisher  hypothesis  implies  eqaitica  (3)  as  a 
statistically  testable  vector-autoregressive  or 
wetor-AWKA  fora.  The  distcrbsace  tern  is  an 
imdepeedeat  identically  distributed  shite  aoise 
process  which  is  assumed  to  be  no correlated  with  both 
the  explanatory  *ad  depeade at  variables. 

Tie  statistical  for*  implied  by  eqiutioat  (4)  and 
(5)  is  more  subtle  and  requires  additional 
explanation.  Acy  new  informatics  1- ,  where  lj  is  an 
«!««3t  of  set  1,  alters  tbe  overall  information 
structure,  A.  Consequently  the  probabilities  *A,  *R, 
.  .  i(  are  altered  since  they  are  functions  of  A. 
A  charge  ia  the  probabilities  changes  the  real  price 
of  future  consumption  claies  in  terns  of  present 
conscript  ion  cIiiks,  asd  therefore  the  real  interest 
rate  changes  through  equations  (4)  and  (S).  Beal,  bond 
prices  (inversely  related  to  interest  rates)  will  tend 
to  increase  since  bonds  are  assets  that  give  claim  on 
future  consumption.  If  the  situation  is  characterised 
by  normal  capacity  utilization,  efficient  processing 
of  information,  continuous-past  increases  in  the  price 
level,  and  a  two-asset  (bond -commodity)  market;  any 
new  indorsation,  ij,  that  increases  the  premium  on 
present  consumption  versus  the  future  consumption 
would  theoretically  have  a  positive  effect  on  the  bond 
supply  and  a  negative  effect  on  the  real  demand  for 
bonds.  This  would  exert  downward  pressure  on  the  real 
interest  rate.  Conversely,  the  increase  in  tbe 
prenjus  cn  present  consumption  claies  will  increase 
current  (real)  aggregate  demand,  thus  putting  upward 
pressure  on  tbe  price  level  and  **fueling-up"  tbe 
inflationary  process  if  tbe  increase  in  real  output 
cannot  totally  absorb  the  increase  in  current 
*ttregate  demand.  Accordingly,  new  information  lj 
incorporated  in  the  set  A  may  simultaneously  lead  to 
an  increase  in  both  the  inflation  and  real  interest 
rates.  The  observable  nominal  interest-rate  will  then 
increase  due  to  the  increase  in  real  rate. 

The  real  causal  factor  in  this  scenario  is  tbe 
incorporation  of  new  information  in  a  representative 
individual's  information  set,  A.  The  Xj  are  usually 
incorporated  an  the  disturbance  term  since  these 
factors  are  usually  nonquantifiablc.  The  implication 
is  clear.  Any  regression  of  nominal  interest  rates  on 
the  inflation  rate  will  be  affected  by  the  correlation 
of  both  variables  with  the  disturbance  term.  This 
renders  efficient  estimation  impossible.  If  the 
adjustment  of  both  the  commodity  and  bond  markets  to 
new  information  as  rapid,  under  the  assumption  of 
efficient  markets;  inflation  and  the  nominal  interest 
rate  will  be  observed  contemporaneously  (or  nearly 
contemporaneously  if  there  is  a  slight  adjustment  lag) 
correlated  with  the  disturbance  term. 

The  remaining  issue  is  the  following.  Can  this 
theoretical^  proposition  be  statistically  verified? 
Toe  conventional  econometric  approach,  using  a  reduced 
form  or  a  simultaneous  equation  model,  can  be 
ineffective  given  the  problems  of  correctly  modeling 
the  information  seL  A  and  specifying  the  correct  a 
priori  restrictions.  There  are,  however,  techniques 
that  are  designed  for  estimating  parameters  in 
wltipls  equations  given  time  series  data. 


Malt i variate  models  are  used  to  represent  two  or 
more  multivariate  stochastic  processes.  The  general 
form  of  the  model  can  be  written  as  follows:* 


4(B)2t  -  tit )It 

(6) 

insert 

4(8)  *  X  -  4jB-  .  . 

V- 

(7) 

0(B)  *  X  -  OjB-  .  . 

-  *vQ- 

(8) 

In  the  above  formulation  4(B)  and  0(B)  are  kxk 
matrices  of  autoregressive  and  coving  average 
operators  respectively.  Both  4(B)  and  0(B)  are  matrix 
polynomials  ia  the  backward  shift  operator  (B)  of 
degree  P  and  Q  respectively.  Tbe  vector  £„  contains 
tbe  variables  to  be  included  in  tbe  moftivariate 
stochastic  analysis.6  Finally,  a£  is  a  vector  of 
random,  independent,  identically  normally  distributed 
shocks. with  mean  zero  and  covariance  matrix,!. 

Parameter  Estimation  in  the  Multivariate  Model 

We  initially  estimated  a  Kiltivariate  model. with 
the  nominal  interest  rate  and  inflation  rat>  as 
variables  for  the  1939-1983  period.  A  total  of  292 
monthly  observations  were  used.  Tbe  data  for  interest 
rates  are  90-day  monthly  treasury-bill  yields  from  the 
Federal  Reserve  Bulletin.  The  inflation  rate  series 
is  constructed  by  taking  the  first-difference  xof  tbe 
.natural  logaritha  of  the  consumer  price. index  (JPI). 

The  identification  and  estimation  stages  suggested 
the  following  model 

<i-«6b6)Ju  »  (I  -  (9) 

as  the  most  adequate  and  parsimonious  model  on  the 
basis  nf  diagnostic  checking.  The  2x1  vector,  2  , 
contains  the  transformed  ^inflation  and  nominal 
interest  rate  sereis,  and  a  is  a,  2  x  1  vector  of 
independently  distributed  random  shocks.  The 
parameters  are  contained  in  the  2x2  matrices  4g,  Oj , 
and  62*  The  residual  diagnostic  checking  confirms 
that  the  model  is  adequate;  that  is,  the  model 
transforms  all  the  sample  cross-correlation  matrices 
into  realizations  of  white  noise  processes. 

The  generating  mechanisms  for  the  data  are  more 
easily  seen  using  the  following  explicit  form: 

<!*.216B6)Xt  -  (l-.763B)a,t«  .004B*2[  (10) 

(I*.I70B^)R  • 

1  ,  (11) 
(5.174B  -  3.53B)alt,.(l*.«128>  «n 

In  equations  (10)  and  (11)  is  the  first  difference 
of  the  logaritha  of  nominal  interest  rates,  Xt  is  the 
first  difference  of  the  logaritha  of  the  inflation 
rate,  a},  is  the  innovation  driving  the  inflation 
rate,  and  a2t  is  the  innovation  driving  nominal 
interest  rates. 

The  model  is  dominated  by  the  moving  average 
terns.  The  inflation  rate  series  is  driven  by  the 
innovations  of  the  interest  rate  with  a  short  lag,  and 
the  interest  rate  is  driven  by  lagged  innovations  of 
the  inflation  rate.  The  series  show  strong  seasonal 
autocorrelation.  The  distributed  lag  of  the  actual 
inflation  rate  does  not  have  a  significant  effect  on 
the  current  nominal  interest  rate,  suggesting  that  the 
information  relating  to  past  inflation  rate  changes 
may  not  have  a  very  signficant  role  in  the  overall 
information  set  A.  The  overall  results  are  strongly 
suggestive  of  the  alternative  hypothesis  put  forward 
in  the  previous  Section  since  it  is  the  inflation  rale 
innovations  that  explain  nominal  interest  rate  rather 
than  the  inflation  rates  itself. 

To  extend  the  analysis,  the  multivariate  model  was 
re-estimated  after  including  a  money  supply  vector. 


TA8L£  1 


The  xoclufioo  of  Che  money  supply  is  suggested  by  the 
following  reasoning.  The  observed  correlations 
between  the  inflation  end  interest  rate  innovation 
my,  to  a  certain  extent,  be  explained  by  the  joint 
correlations  of  both  series-  with  the  money  vector. 
The  aoney  supply  series  is  the  real  Monthly  MlA 
series,  transformed  into  a  stationary  form  by  the 
subsequent  operations  of  a  logarithmic  transformation, 
first  differencing,  and  seasonal  differencing.  The 
re-estimated  Multivariate  aodel  with 'a  three- eleaent 
2  vector  suggested  the  following  form: 

1  (12) 
(I  -  »6B6)it  •  (I  -  OjB1  -  e2»2)  (i-e12B12)It. 

The  paraaeters  in  equation  (12)  were  estimated 
using  the  conditional  likelihood  Method.  The 
estimated  coefficients  are  presented  in  Table  1. 
Equation  (12)  indicates  that  there  is  a  relatively 
strong  correlation  between  the  aor.ey  supply  and 
nominal  interest  rates.  In  fact,  this  correlation  is 
saich  stronger  than  that  between  the  interest  and 
inflation  rates.  In  general,  all  three  series  are 
driven  by  the  innovation  tens,  but  the  lag  structure 
is  complex  because  of  the  multiplicative  seasonal 
polynomial.  The  innovation  lag  structure  is 
simplified  by  eliminating  those  parameters  that  are 
not  significantly  different  from  zero,  but  still  the 
structure  is  complex. 

This  analysis  indicates  that  the  information  set  A 
also  responds  to  money  supply  movestenls.  The 
correlations  among  the  innovations  of  the  different 
series  suggest  that  unsystematic  information  X|  it 
simultaneously  utilised  in  the  bond,  money,  and 
commodity  Markets;  and  this  information  may  be 
relevant  in  the  determination  of  short-run  decisions 
in  all  three  Markets.  Money  supply  announcements  seem 
to  be  a  major  factor  in  determining  the  short-run 
variations  of  nominal  interest  rates. 

Conclusions 

An  alternative  approach  based  on  the  previous  work 
of  Debreu  and  Hirshleifer  is  proposed  for  the  analysis 
of  nominal  interest  rates.  The  essential  point  of  the 
analysis  is  the  importance  of  new  signals  as  a  basis 
for  decision  making  in  an  uncertain  world.  The 
analysis  suggests  that  the  Fisher  hypothesis  alone  can 
be  a^  too  restrictive  approach  in  explaining  the 
behavior  ojf  nominal  interest  rates  over  time.  The 
unsystematic  information  that  changes  the  subjective 
probabilities  assigned  to  the  possible -future  states 
of  the  world  can  be  at  least  as  important  as  the 
information  in  the  vector  of  past  inflation  rates, 
piis  suggests  the  possibility  of  an  unstable  real 
interest  rate,  even  in  the  short-run.  The  test  of 
this  proposition  versus  the  Fisher  hypothesis, 
however,  poses  difficult  problems.  Part  of  these 
problems  can  be  overcome  by  using  recently  developed 
sultivariate  stochastic  analysis  techniques.  The 
application^  of  these  techniques  to  a  monthly  data  set 
gave  empirical  results  supporting  this  hypothesis. 
The  results  cannon  be  interpreted  as  conclusive  given 
the  subjective  criteria  involved  in  the  testing 
procedure,  but  they  are  strongly  suggestive  of  a 
mechanism  more  complex  than  the  one  implied  by  the 
Fisher  hypothesis.  This  follows  from  the  importance 
of  information  other  than  the  actual  inflation  rate  in 
the  determination  of  nominal  interest  rates  in  a  world 
of  uncertainty. 


Estimation  of  the  Extended  Multivariate  Model 

Series  1:  Actual  Inflation  Bate 
Series  2:  nominal  Interest  Bates 
Series  3:  Real  Money  Supply  (Hl/P) 

Method  of  Estimation:  Conditional  Likelihood 
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RELATIVE  FCaECASTIVC-  FEXFORNANCE  Of  A*  ECOSOKETEIC  NOSEC 
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Dept.  of  EctMilci.IlUnt  Vul.CiUrt.Tirker 


Abstract:  The  m!>  foces  of  the  »t«»<Iy  1*  to  model  end  forecast 
four  Australia*  quarterly  macroeconomic  scries*  Monetary  Aggregate 
.Ml  .  Real  Gross  Domestic  Frodect  (CDF)  .  Inflation  rate  and  *0-day 
tank  Accepted  IIU  Ratos  (RAR).  Forecasts  of  these  variables  are 
feaerated  by  a  ~  staple  nacroeconcmetrlc  model,  univariate  tiaa 
series  model  .  multivariate  VAX  sod  RYAX  models.  To  c oaf-are  like 
with  like  all  these  variables  are  expressed  Id  legs  except  Ike  BAR 
rates.  The  data  eied  to  estimate  these  models  are  quarterly 
observations  fro*  1R70II  to  itlllf.  lke^Mt  of  saaple  forecasts 
are  reaerated  for  the  period  lltll  »  IM11II.  The  forecasting 
performance  Is  judged  by  the  Root  K»»«  Sheared  Percentare  Error 
(RK3FX). 

In  specifying  the  aacroaodel  we  have  Introduced  Rational 
Expectations  Hypothesis  and  aa  Error  Correction  technique  to  take 
Into  account  of  the  le*E  rue  econoatc  theory  in  the  dynamic 
context.  Our  slaple  aodel  of  the  Australian  econoay  consists  of 

*•»  •  i  •  i  ’t-(  •  -A-.  *  -.'-'•".•I  * '» 

Tt  *1  *  Vt-I  -  Vt-4  A  *  V**f*  ~  ■"  1-4*  *  'll 

’  <« 

*,*,.«  *  V"--’1,-!  •  *4, 

where  a  growth  rata  of  soney  supply  ;  y^  a  real  COp  ; 

{»t  •  inflation  rate  J  R^  •  JO-day  EAR  rates  . 

The  shove  specification  has  been  retained  after  »  rood  deal  of 
experiments  with  different  specifications.  Now.  In  nodslllnr  tbo 
aultlvarlate  tlae  series  fraaewarka  w«  use  tha  Bans  four  variables 
expressed  In  ltft  except  t.  In  this  context  »•  will  consider  three 
different  VAR  models  using  I  (1)  variables  In  lor  levels  5  (11) 
lor  levels  plus  trend  components^  and  (111)  dlf ference-logged 
variables. The  four  VAR(p)nodela  which  hsva  bet* found  to  be 

adequateare  VARl(J)  .  VAR1(I)  .  VAAJ(J)  and  VAR2<4).  The  first 
two  refers  to  Ul  levels  with  trend  while  the  last  two  refers  to 
difference  loft.  Overall  ,  we  choose  VARl(l)  ss  the  best  adequate 
•odel  for  forecastle!  based  on  AlC  >  $C  and  *0  criteria.  The 
results  are  also  consistent  with  Sles  LR  test. 

RVAR  models  have  been  chosen  based  on  dlffernt  coeblnatloes  of 
prior  paraaetern  sad  adequacy  test  statistics  o(  Box*Fief<e  and 
LJvng.Rox.  Finally  .  the  RoxOenklna  approach  la  used  to  find  a 
parslnontous  ARINA  nodal  for  each  Of  the  lias  aeries.  This  could 
bo  used  as  a  benchaarh  for  comparison  with  Other  models ■  Tha 
aodels  selected  ss  most  sppropriata  sro  ARINA  (1.1.1)  .  (1.1.0)  . 
(1,1.1)  and  (2,1.1)  for  the  series  OOF  ,  Ml  ,  CPX  ,  and  RIB 
respectively  . 

Let  Us  now  briefly  compare  the  forecasting  performance.  The  out  of 
saaple  forecasts  Of  tha  macro  model  and  VARI  seem  to  predict  the 
future  path  of  actual  real  CDF  quits  wall  «  the  ARINA  model  end  to 
n  lesser  extent  RVAR  underpredict  the  sctusl  values  as  the  out  of 
saaple  forecast  Borises  increases  .  Tor  HI  .  tha  out  of  saaple 
forecast  of  tha  macro  modal  Is  not  slgniflcaatly  different  from 
the  actual  values  .  RVAR  stays  quite  Close  to  tho  actual  except 
for  the  Itliu  •  mmi  period  .  Aa  the  out  of  saaple  forecast 
horlsen  increases  .  Ya|1  ovsrpredlcts  tha  actual  whereas  ARINA 
Underprcdlcta  HI  .  For  CFI  the  forecasting  perforaence  In  quite 
Close  In  the  cases  of  VAX  ,  RVAR  and  nacro  nodal  .  ARINA 
overpredicta  the  actual  for  CFX,  In  tha  case  of  RAR  .  nacro  model 
does  the  lest  .  The  ARINA  ,  VaR  and  RVAR  models  miss  most  of  tha 
turning  points  by  a  significant  aaount  for  RAR. 

Tha  macro  modal  produces  better  forecast  than  the  ARINA  «  VAR  and 
RVAR  for  RJx  .  1001  and  itx  of  tha  cases  respectively.  Tha  RVAR 
does  batter  thsn  unrestricted  VaR  for  lit  of  the  esses,  then  v* 
compare  the  univariate  with  the  aoltlvarlete  time  series  models  , 
both  VAR  ana  RVAR  pet  f  0  re  better  than  tha  univariate  ARINA  for  (It 
and  190t  of  tha  cases  respectively  , 
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SFtCTJii  hums  c?  tmisi  notmu  puck 


iidrsej  Sctolcvshi 

iilint  VtittrtHj,  him,  Ttritf 
lulnj  of  £eeifliie*,'fr*bev,  ftlui 


Tiriitl  Iholeule  Price  Ida  cevert  1422  conodities  it  I3(  itei 
troipt.  Tie  vlsle  nrlet  hu  lets  divided  lito  23  sitseetors. 
Priie  luiitrj  State  IntUste  of  •  Statistics  ’  polishes  loithlp 
vtlies  of  vtolesale  price  iadez  (or  -  eich  sibsectc;  vitb  hue 
nlu  1})  mified  to  tVe  mrift  level  of  the  jcar  fill.  Data 
Ulei  for  march  premie!  it  tbit  piper  covert  tie  period  froi 
lump  1)22  tbroiph  iifut  1)1)  vhith  {ivet  23  tiie  series,  etch 
oie  ccuiititj  of  164  cbsemtiois.  bholetile  prices  rote  dirii( 
mlpted  period  (for  half  of  sibseotors  tote  tbu  30  tilts)  aid 
ill  Uit  secies  thovtd  stroif  eipcmtul  tread.  This  tread 
coipomt  its  lett  filtered  bp  tihiif  first  differeice  of  lo(  of 
obttrviKoit  (resiltiii  series  it  deleted  bp  (x^), 
(i>l,2,. ..,23;  t>), 133)).  friurp  hypothesis  of  research 
stated  tbit  ill  tiie  series  (sibstetersl  ire  (eieritei  bp  tbe 
sue  stochastic  process,  li  order  to  verifp  this  stiteiest 
spectre*  Its  bees  estiuted  for  tub  tiie  series.  Firm  viidov 
vis  tied  vitb  cit'Off  poiit  u2(.  tin  distaste  utrix  0  betveea 
23  tiie  series  bis  beet  cilcilited.  tiituce  mute  vis  defiled 


•J,  ■  J-  l  [u»  IlJlW.l/fJlto/.ll]1 

k«o 


If  speetul  estmtet  f>)  aid  f,(u)  were  cilcilited  froi  series 
*  J  * 

{derated  bp  tbe  sue  prceest  tbu  3. ,  follovs  chi-siured 
diitribitioi  vitb  (ill)  defrect  of  (reedoi  (fithtii,  livlit  (2J). 
JliUtet  utrix  Dbu  beet  truiforied  Itto  biurp  utrix  S  it 
vbieb  b^»J  vie*  i|j  vis  (tuter  tbu  corrtspoidiif  critical 
vilie  tales  fret  chi-stureJ  distribitiei  for  sifiificuce  level 
0.03.  Tbei  tbe  set  of  23  stbsectort  bis  ben  sintered  bp  tens 
of  vector  clUUttiti  ittbod  (Cboiitoubi,  Sclolcvshi  (!}).  Ibis 
utbed  creites  iroips  vitb  eleieits  (objects)  vbieb  ire  aituUl 
m*diffenai.  la  other  voids  tbe  stillest  tuber  of  ilWero 
ubiitriees  ire  tilei  cat  of  utrix  8  vitb  til  uit  dit|oul 


eleieits  to  be  selected,  be  but  fessd  tilt  Tirlisb  viol* tile 
urbet  coild  be  divided  uto  Uree  segmts,  so  there  ire  Uree 
differeit  stochastic  processes  vbieb  (estate  the  behuiotr  of 
vboleule  price  iadex. 


There  ire  oily,  fishery,  predicts  fa  tbit  sefiut.  Primly 
tru'sforied-  dita  cm  be  described  by  first-order 

utorcfresiioa  process  vitb  idditioul  periodic  coipomt  of  3 


Tits  iejitit  ccuists  of  tvo  sibseetors.  textile  ud  util 
uebiierp  I  vehicles.  Tbe  spectra  cf.ieu  ulits  (froi  tvo  tiie 
lerles)  shoved  10  seuouhtp  bit  strosf  tread  bu  beei  remit!* 
first  different!  of  (xj^  ire  veil  fitted  bp  first' order 
toviij  tverije  process.  Tiriuee  of  (x|l^  is  stitiourp. 

taiLIIi 

Ibis  is  tbe  uit  body  of  Tirhish  vboleule  urbet,  vitb  23 
sibsefieats  iuide,  so  tiers  ire  23  tiie  series  u  tie 
ret!  (ration  of  the  sue  stocbutic  process,  litocoviriuce 
utrix  bu  beu  estiuted.  Hunts  it  nil  ditfoul  potsned 
sifiificut  positive  treid,  vbit  irut  tbit  virlute  vitbii  tbit 
seftnt  vu  lot  statiourp.  Tbe  aulptis  of  sibdufoult  (ip  to 
tbe  lif  2<)  prooved  tbit  there  vis  10  tread  la  couriiuet.  Tbtt 
utocoviriuee  fnetiea  bu  beet  estiuted  u  tbe  uerife  i;d 
tode  froi  the  respected  ubdiifouls  of  utocmriuce  utrix. 
loth  estiutors  ltd  to  the  sue  couluioit.  Tbe  estiuted 
spectna  shoved  iiportut  seitoul  variation.  Seciise  for  eicb 
tiie  poiit  vt  bid  23  obit  ration,  tbti  ve  veto  ible  to  stidy  tbe 
tin  series  of  tuple  characteristics  of  urfiul  distribitiou 
«(  uilpttd  stocbutic  process.  Tbe  best  todtl  for  itu  ulie  vu 
first  order  utorefrenioi  vitb  stuoul  coipomt.  The  bebuioit 
oi  tbe  tode  vu  described  bp  jut  stuoul  utorejrniici 
$lt(32i«  The  uipittrp  mure  shoved  slijbtlp  dtcliiiif  treid. 


ft)  CboutoviU  Sololovtli  1.,  Iilmoaii  stnltcr,  Pmfltf 
Stitpstpuip,  1)11,  It  2 

(2)  Fnhtn  6.3. ,  liviit  M.»  Tbe  lulptis  of 
SuaUtioi-Cmnted  Tiie  Series,  rmftmt  Scieace,  HU, 
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AhSTHACTS-curlty  prices  cannot  be  colntegrated  it-  they "are  XrMiiM 
»5.n  effUUnt  -ariel.  The  nfff-1  •"  *rror  correct  ion 
^presentation  of  security  pric-s  would  UP  Jr  the  l.prove.e»t  of  the 
return  forecasts  vhleh  Is  against  the  weak  for»  efficiency.  In  this 
Study  .  ve  lnve.llg.te  the  dyna.lc  b.h.riouf  of  prices  of  three 
industry  based  portfolios  of  coeson  stocks  tr.ded  In  the  Istanbul 
Stock  Exchange  <ISE>  for  the  1988-89  period.  Although  no  pairwise 
cointexr.t ion  is  detected,  we  reject  the  nonalationarlty  of  •  linear 
eopbtftstio*  of  the  three  prices.  In  addition  .  the  estls.tion  of  the 
erro*  correction  representsllon  of  these  prices  Is  carried  out.  A 
rinding  rule  be*ed  on  this  representetion  gave  profits  which  are 

increases  our  doubt*  .boUt  the  weak ‘for*  efficiency  of  the  ISE. 
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TABLE  II  Resell*  of  three  variable  cointrgrsting  regression 


Dep.Vars 


Cl 


HI 


F8 


Constant 


•0  346  ( *4. IS) 
0  822  (10  32) 
3.S3Z  <22  33) 
0i80 
0  46 
-3.63 
>4.19 
0.12 


-0.437 '<-4  19)  0  36  <10  32) 
-  0  63  (21.38) 

I  294  <21  37) 

1.0X3  <2  439)  -1.54  (-3  95) 
0  96  0  98 

0.35  0.48 

•3.5  -4.01 


I.  INTRODUCTION 


On#  of  the.faaeu*  and  esplrloallr  tegular  Instance*  of  randos 
walk  occur*  In  stock  price  aerie*  In  the  light  of  the  recent 
d« vs lepeenta  in  the  theory  of  oelntegration  end  error  correction 
aechanlan*  led  by  Granger  and  V*lss(1963)  Ingle  en<3  Granger  (1987) 
Johan *»n( 1966 ) .  we  know  thet  certain  linear  combination*  of  aore  than 
one  nonstationary  tise  aerie*,  such  as  randon  walks.  say  be 

stationary  Such  nonatatlonary  series  are  said  to  be  colntegrated 
Granger  and  Ingle  (1187)  suggest  estimating  the  cointegrating  vector 
by  OLE  regressions.  The  properties  of  the  estisates  «*  «!}»* 

cointegrating  regression  heve  been  investigated  by  . 

Philips! 1987)  and  *  nusber  of  other  authors.  If  ther#  are 

colntegrated  stock  pricas.  there  would  be  an,  error  correction 
representation  of  at  least  «"•  of  *h««  *  whleh  «ould  b#  «*••!  for 
forecasting  purpose*.  In  an  afficlent  sarkat  •  a*  1#  wall  known,  such 
forecasts  are  of  n#  value.  '  ,  ...  ...  , 

In  this  study  ,  th*  sain  ala  I*  *h  eaprlcal  investigation  of 

cointegral  ion .and  error  c#rrection<EC)  In  the  context  of  stock  price* 
rather  than  driving  econoelo  condition*  under  vhleh  stock  pricas  ar# 
co integrated . 

T^  B.ti:  Th*  data  used  in  our  Invaatlgation  la  the  weekly  price*  of 
three  industry  baaed  coaaon  stock  portfolios  traded  in  Istanbul 
Stack  Exchange  far  th.  period  lliMI  <104  observations),  Th* 

particular  stock*  are  chosen  auch,  that  the  data  Is  contlnuoualy 
•va|] able  for  the  period  under  consideration.  Th#  Industrie*  *r* 
Che. leal  Industry  (CD  which  eonalata  of  five  stocks- 9*gfss. 

f gegubre.  lav.  Cubra  Fabrikalari.  lotus*  Tati*.  Financ#  and  Sinking 
Industry  <f*>  which  coaprls#  four  atook*.  Eciaolbasl,  Roc  Yatirl*.  Roc 
Holding .  T  l*b*«k**l(b>;  »*t*l  Industry  (HI)  vhleh  canalats  of  six 
stocks  Cellkhalat,  Irdaalr.  Fa***.  Rabak,  Sarkuysan  and  Oaalrdokua. 

ll.COIHTt«ATIOM  IX  STOCK  PM  CCS 


ERROR  CORRECTION  REPRESENTATION  OF  STOCK  PRICES 

Although  th#  teat  results  chow  that  th*  Boat  suitable 
cointegration  relation  occur*  when  FB  is  th*  dependent  variable  wa 
fora  an  ,  error  correction  repreaentation  for  ,all  th*  three 
cointegrating  vectors  above  so  thst  w#  sc#  clearly  the  relation 
between  an  error  correotien  and  eointegration  nechanisa  The  results 
are  presented  in  Table  Ill  V# ■  proceed  by  applying  a  general  to 
apeclfio  approach  and  reporting  only  th#  significant  lags  and 
variables 


TABLE. Ill  Results  of  orror  correction  astination 
Dap'.  Vara.  oCI  tit  ah  I 


tCI(-l) 

EF8(-I> 

EHK-I) 

F(-I) 

F(-3) 

H(»3> 

M{-4) 

Constant 

C* 


•0.19  (2.92) 


0.39  (3.86) 
-0.30  <-l,9> 
-0.32  (-2.1) 
0.01  (1. 37) 
0  22 
1,64 
0.06 


-0.02  <-0.43> 
0?S3  (5.32) 


0.27 

1.93 

0.08 


•0.28  (-2.46) 
0.02  (2.63) 
0,23 

1.44 

0.06 


voto  lv«m  w.  >«  twwxMai.  all  w*r««Maa  are  vn 

let*.  t*«  ver.eMec  era.  CHI  W  xci  »•*»•••** 
Ue  imAtU  *•*»»•#  f  *•—  |h* 

*4  **<  «  W  Ml  rere,ie*  en  TeM*. II.  v»  HK..I  here  ay 
»*  eaee.f.e  eiinuh 


In  accordance  vlth  our  expeCtattans  .  the  hypothec!*  #f  rtndoa 
walk  la  not  rejected  for  any  ©?  th*  price  aeries  The  test  statistics 
w*  used  are  Olekey  FuilarlDF)  and  Augaentad  0lck#y-rotl#r<A0r>  for¬ 
th#  ADr  last,  lag  length*  are  taken  up  to  four  and  •  general  to 

specific  approach  I*  followed.  .......  .  , 

For  tha  auper-cenai alency  result  of  St*ek<l1S4)  to  apply.  no 
subset  of  the  explenatory  variable*  akowld  be  colntegrated  Sa.  va 
have  first  investigated  vh.ther  there  art  any  Mtuaily  coinlegrated 
stork  price*  or  not  Reaulta  *r*  prasentad  Jin  Table  I  Regretting 
each  of  th*  pricas  on  aaother  gave  Cointeg rating  Regreation 
Durbin  Watson  (CRDWI,  DF.  AOF  atatiatlc*  which  veil  below  their 

critical  values  aven  at  a  101  significance  Uvel  Theae  lead  u*  to 

conclude  that  ther*  exlata  »o  paitvii*  eointegration  between  theae 
three  portfolio  price*  ......  ..  .. 

To  detect  any  cotntagraling  vector  which  bind*  tha  three  price* 
we  have  run  three  calntegratmg  regression  every  Use  saklng  one  of 
tk*  price*  th*  dependent  variable  The  result*  are  reported  in 
Table. ii  looking  at  the  raault*.  w*  see  that  in  all  cssea  ve  reject 
th-  null  of  *on*col8t*gi*tlon 


TABLE  1  Results  of  palrvia*  eointegration  regretiont 


Dep  Yars 

Cl 

r» 

Hi 

. 

1.38(27.26) 

1.06(46.36) 

-0. J8(-3.22) 
0.93 

Constant 

4.33  (22,24) 
0.7* 

0.88(3.60) 

0.66 

0.20 

Df 

•1.9»g 

•  2.32 

•  2.76 

AOf 

•2.33 

•1.99 

0,11 

0.14 

MOm  Iumm.  h.  in  )ww*H..i,  ail  .wwU.t  ere 


V#  use  th*  critical  values  or  AOr  and  CROW  tact*  for  th*  three 
variable  caaa  provided  by  Hal 1(1988),  and  of  DF  taal  for  th*  two 
variable  css*  provided  by  Xagl*  and  Granger  (1967)  Cosparad  to  th* 
oriticai  valvar  of  tha*#  taats  ■  th#  taat  statistic*  at#  hlgly 
significant  Tor  axaapl*  .  the  ragresaion  *f  18  on  Cl  8  HI  gav*  CROW, 
OF,  AOF  statlatics  which  *r*  slgairicaat  even  at  II  Ural  of 
significance,  Tortharsora,  all  th#  regreation  parssatars  ar*  higly 
slgnif icaot  supporting  th*  ratult  of  cointagratian  Tha  colntagration 
relatieo  *****  to  b*  th*  highest  when  th#  dependant  variable  la  FR 
sine*  R  »  CROW,  OF  are  highest  la  all  cases  coaparad  to  that  of  th* 
Other  two  $t*ck(  1984)  astahliahas  that  th*  aatisata*  of  th* 
cointagrating  regrtssloa  are  consistent  and  subject  to  a  satller  bias 
than  th*  CIS  regression  #r  stationary  aerial,  if  th*  series  ar# 
colntegrated.  rurtkeraare,  thl*  bit*  sees*  to  b#  related  to  th# 
overall  good#***  *f  fit  *r  th*  ragraaaloo.  and  so  th*  eointagrsting 
rtgtaasion  with  th*  highest  R  is  sipsctad  to  b*  subject  to  th* 
asallaat  bias  V*  nolle#  that  in  Table  It  th*  regression  of  FB  on  Cl 
8  HI  gav*  OF  statistic*  which  is  the  highest  coaparad  to  thst  of  th* 
Other  two 


|  ti»  nuu«i  »«W.«  •«  *  IM  leva*  *1  Wt  <**»•  lit 
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Th#  results  in  th*  above  table  clearly  suggest  that  th*  error 
correction  representation  is  supported  by  the  sari**  Tha  error 
correction  teraa  ar#  significant  in  two  of  th*  regressions,  naetly 
All  and  AC!  Th*  inaignif loanee  of  th*  error  tars  in  tha  AH! 

regression  taplias  the  w«ak  exogeneity  of  the  natal  industry  portfolio 
prices  In  the  *****  of  Engl*  and  Granger  (1987) 

To  be  abl#  to  avast  ion  weak  fora  pricing  efficiency  hove>ar 
the**  R-*ov*red  figure*  are  net  sufficient  Th#  value  of  a  forecast 
la  s  function  of  how  it  is  used,  as  wail  a*  how  precis#  it  is  So 
on*  ha*  t*  explain  th*  daoiaion  rule,  in  our  cat*  a  trading  rule 
which  eaploy*  the  forecasts  generated  by  the  error  correction 

representation  given  above  w*  us*  here  the  following  siapl# 
stretegy  In  case  the  expeoted  change  in  the  atook  a  pric#  is 

positive,  hold  100X  Of  your  wealth  in  that  stock,  otherwise  hold  100X 
sonar  which  Is  aaauaed  to  have  rer#  return  Transaction  costs  *r# 
•  sauced  to  be  negligible  This  tredlng  rule  it  emulated  on  our  data 
(ar  th*  *»•«  period  Ve  alaulete  the  wealths  of  three  different 
investor*  who  start  with  sll-fSiir  wealth  inverted  in  one  unit  of  one 

of  the  sector  portfolio*.  Each  #f  thea  is  aaauaed  to  follow  the  five 

trading  rul*  bated  on  the  error  correction  representation  of  th# 
sector  in  nutation  The  results  are  aunnariled  in  Table  IV  Alt  of 
our  three  investors  have  'beaten  the  aarket  in  th#  tana#  that  their 
strategy  ha*  provided  higher  ratura*  than  th*  buy  and  hold  strategy 
for  th*  two  years  Th#  axcaar  ratorn  is  th#  graataat  <31  2D  for  th# 
chealstry  industry.  Tha  natal  industry  which  va*  found  to  be  weakly 
axogsntous.  gav*  th*  lowest  excess  return  <5.3X>, 


Tabla  IV  Results  of  tzadlag  rule  simulations 


Starting  Wealth 

Chealstry  8.389 

Ending  war 
Buy  and  Hold 
9,421 

>lth 

Trade 

9,733 

Excess  Return 
(X) 

31,2 

Hetal 

T.672 

9.642 

9.693 

3.3 

f.i  Sacking 

9.170 

9.922 

10.037 

16.0 

X*tO  AU  v**t|he  *»•  vn  iwtwM 


That*  finding*  indicate  the  possibility  of  oointegrated  conson 
stock  pricas  Th*  rejection  of  the  weak  fora  pricing  efficiency 
however  doe*  act  readily  follow  Our  evidence  froa  tha  trading  rule* 
however  increase*  our  doubt*  on  tha  pricing  efficiency  of  the  Istanbul 
Stack  Exchange. 
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Dynamic  Oligopolies  as  _Neural  Networks 
by 

Ferenc  Szidarovszky 

Afraiasi 

Multiproduct  oligopoly  models  are  investigated 
under  the  additional  assumptions  that  if  the 
difference  between  the  present  and  profit, 
maximizing  outputs  is  too  small,  then  no  output 
exchange  is  performed,  and  if  the  same  difference  is 
too  large,  then  only  a  portion  of  the  desired  output 
change  is  performed  by  the  firms.  These  binary 
variables  transform  the  classical  oligopoly  models 
into  special  neural  networks.  The  stability  of  these 
dynamic  systems  is  analyzed  and  a  least  squares 
learning  procedure  is  outlined. 


t.  introduction 

The  theory  of  oligopoly  has  been  investigated 
very  intensively  by  researchers  in  mathematical 
economy.  The  existence  and  uniqueness  of  the 
equilibrium  in  oligopoly  markets  were  discussed,  for 
example,  by  Szidarovszky  and  Yakowitz  (1977,  1982). 
A  survey  on  the  different  variants  of  oligopoly 
models  and  their  properties  is  given  in  Okuguchi 
and  Szidarovszky  (1990).  with  a  comprehensive 
literature  summary. 

In  this  paper  a  new  version  of  single-product 
oligopolies  is  introduced,  where  the  mathematical 
model  can  be  interpret'd  as  a  special  neural 
network.  A  solution  m.ihodology  1$  suggested  to 
discuss  the  stability  of  the  resulting  dynamic 
system.  We  note  that  a  special  case  of  this  model  was 
investigated  earlier  by  Szidarovszky  and  Okuguchi 
(1988). 

2.  The  Mathematical  Model 

Let  n  denote  the  number  of  firms  without 
product  differentiation  in  an  oligopoly.  Let 
p  *  a  •  bQ  (  a>0,  b>0)  be  the  market  demand  function, 
where  Q  ■  2?-txl  is  the  sum  of  all  firm’s  output,  x  i 
is  the  Mh  firm’s  output  and  p  is  the  common  market 
price  of  the  goods  produced  by  the  firms.  We  assume 
that  the  market  demand  function  is  completely 
known  to  all  firms,  but  that  they  do  not  have 
complete  information  on  the  other  firms*  marginal 

cost  functions.  They  arc  therefore  obliged  to 

perceive  other  firms*  marginal  cost  functions 

which  are  consistent  with  the  past  information  on 
the  firm’s  outputs.  Let  the  /-th  firm's  marginal  cost 
perceived  by  the  Mh  firm  be  linear  and  given  by 

MCj  «  +  cj  +n^xj,  i  *  j,  j  *  1,2 . n,  where  cj  and  mj 

are  both  nonnegative  constants.  Assuming  that  the 
Mh  firm  thinks  that  at  any  period  the  /«lh  firm 
selects  its  profit  maximizing  output,  which  is 
assumed  to  be  interior,  the  marginal  revenue  and 

marginal  cost  are  equal: 

a  •  b  (xj  +  Q,)  -  bxj  =*  cj  +  mjx/i.j),  (*> 

where  Qj  ■  £fc*ixk  is  the  output  of  the  rest  of  the 


industry.  Let  yjbethe  Mb  firm's  expectation .  of  the 
/•  th  firm's  output.  At  time-’ a  when  the  Mb  firm 
forms  expectations  about  the  other'  firm’s  -  outputs  on 
the  basis-  of  its  perceived'  marginal  cost  functions, 
the  following  similar  equation  has  to 'hold: 

a  -  b  (x,(t)  +  2  yiW)  -  byj(t)  =  cj  +  njyj'i),  (i*j)  (2) 


On  the  other  hand,  at  lime  r,  outputs  of  all  firms  at 
the  previous  time  x.  v  1  are  known,  and  they  must  be 
consistent  with  (2).  Hence 

a-bMi-D+X  xt{t.l))-bx,(i-l)  »  tj  +  mjx/t-l)  Ci*j).  (3) 

M 

By  solving  these  equations  for  yj(t)  simple 
calculation  shows  that  the  expected  profit 

maximizing  output  x,(t)  of  firm  i  can  be 
expressed  as 

x,(l)  =  g,(x,(t  - 1),  Q,(t  - 1)),,  i  -  1,2,.  ...n.  (4) 

This  is  the  reaction  function  of  the  Mh  firm  derived 
on  the  basis  of  its  perceived  marginal  cost  functions 
of  all  other  firms  which  are  consistent  with  the 
information  on  all  firms-  outputs  from  the 
preceding  time  period. 


3.  The  Modified  Model 

Assume  next  that  each  firm  specifies  a 
positive  number  &i  »  which  has  the  following 
meaning.  If  the  profit  maximizing  output  (4)  does 
not  differ  from  the  output  *i(t  ■  1)  of  the  preceding 
time  period  more  than  8i»  then  firm  i  docs  not 
change  its  output.  It  is  also  assumed,  that  each  firm 

specifics  another  positive  constant  As  >  6j.  which  is 
the  maximal  allowed  change  in  its  output  during  a 
single  time  period.  The  first  assumption  can  be 
imetpreted  as  very  small  changes  arc  not  made, 
since  the  costs  of  changing  the  output  at  all  are 
higher  than  the  loss  by  selecting  suboplimal 
outputs.  The  second  assumption  means  that  the 
changes  in  the  output  during  a  single  time  period 
are  bounded.  Based  on  these  additional  assumptions 
the  following  dynamic  process  is  obtained: 

Let  Xi(0),X2(0) . xa(0)  denote  the  initial 

outputs  of  the  firms.  For  t£l.  define 
Xj  (t)  **  g»(xj(l  •  l)».Qi(t  ■  1))-  Then  the  output  of  the  /- 
th  firm  at  time  period  x  is  given  as 

X.W  i  x,0-l)-5lSx,’(l)Si1(t-l)iSi 

\‘(0  itaiw  j,fll)-4,Sj,’©<x1(HK 

j,®  ,  i  «  x>Q-l)*8i<il,0Si1(M)*il 

if  *,'(t)Xi(t-l)*4 

(5) 
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The  global  asymptotic  stability  of  this 
dynamic  system  can  be  discussed  in  a-  similar 
manner  as  it  is  documented  in  Liu  and  Szidarovszky 
(1991),  and  by  using  the  -  theory  of-  perturbed 
contractions  (Szidarovszky.  and  Yakowitz,  1978). 

This  model  can  be  easily  accompanied  with  a 
special  learning  process  on  the  unknown 

parameters  ejand  mj  Assume  first  that  at  each  time 
period  t,  each  firm  measures  the  outputs  of  all  other 
firms  at  all  preceding  time  periods.  Then  from 
equation  (3)  each  firm  has  the  time  series 

Vj(t-r)  =  c5  +  m)  x;  (t  -r)  for  r  =»  ,.frora  which  the 

values  of  ejand  m*  can  be  obtained,  for  example,  by 
using  a  least  squares  method.  Since  at  each  hew 

time  period  a  new  value  Mft)a<5+mWrt  is 
computed  from  equation  (3),  the  parameter  values 
can  be  updated,  and  these  updated  values  can  be  used 
in  forming  the  new  expectations.  If  the  individual 
outputs  can  not  be  measured,  only*  the 

Qi(t-r)  values,  r»  1,2,.. „t,  then  the  following 

procedure  can  be  proposed.  By  using  equation  (3) 
and  the  definition  of  Q(t)  an  ovcrdctcrmincd 
nonlinear  equation  system  is  obtained: 

. >-  (6) 

The  solution  of  this  overdetermined  system 
can  be  obtained  by,  for  example,  a  nonlinear  least 
squares  method. 

3. _ Conclusions 

The  above  dynamic  model  is  a  typical  example 
for  a  neural  netwoik.  The  two  major  features  of  this 
model  are  the  adaptive  learning  procedure  and  the 
presence  of  the  binary  variables  determining 
which  case  is  to  be  used  in  obtaining  the  new  values 
of  x>(t)  in  relation  (5). 

The  dynamic  system  discussed  in  this  paper  is 
a  generalized  version  of  earlier  works  in  oligopoly 
theory. 
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THE  POSSIBILITY  OF  OUTPUT  FLUCTUATIONS  IN  DUOPOLY  MODELS 


PROFESSOR  CARL  CMARELLA 
SCHOOL  OF  FINANCE  AND  ECONOMICS 
UNIVERSITY  OF  TECHNOLOGY,  SYDNEY 


1.  Introduction 

The  literature  oh  the  stability  of  the  .Cournot, model,  in  both 
discrete  and  continuous  time,  is  one  of 'the  richest  in' economic 
dynamics.  See,  eg.  Okuguchi  and  Szidarovsky  (1990).  In  recent 
years  the  advances  in  the qualitative  ,  theory  of  differential 
equations  and  the  theory  of  nonlinear  dynamical  systems  (see  eg. 
Arnold  (1978)  and  Guckenheimer  and  Holmes  (1983))  have  led  to  a 
consideration  of  the  dynamic  behaviour  of  the  Cournot  model 
when  the  equilibrium  point  is  locally  unstable.  The  key  studies  in 
this  vein  are  Seade  (I960),  Al-Nowaihl  and  Levine  (1985)  and  Furth 
(1986).  In  the  died  works,  quite  general  Cournot  oligopoly  models 
were  considered  and  some  quite  general  qualitative  results  about 
the  dynamic  behaviour  were  obtained. 

Our  aim  in  this  paper  is  more  modest.  Wc  consider  a  duopoly 
model  in  its  region  of  local  instability  and  explore  the  possibility 
that  output  tends  to  limit  cycle  motion.  Our  discussion  here  is 
necessarily  brief,  however,  a  more  detailed  examination  of'  limit 
cycle  motion  in  duopoly  models  is  contained  In  Chiarella  (1991)-** 

2.  Some  Possible  Nonlinear  Mechanisms 

To  obtain  sustained  output  fluctuations  in  duopoly  models  wc  will 
need  to  introduce  time  lags  and  nonlinearities  into  tiie  standard 
formula.  Time  lagt  can  be  introduced  as  In  Chanter  7  of  Okuguchi , 
(1976)  where  firms  adjust  their  output  to  desired  output  with  a  lag 
and  use  an  adaptive  expectations  scheme  to  form  their  expectation 
of  the  other  firms  output.  A  range  of  nonh/uaritiet  Is  possible 
and  include>  (a)  Introducing  cubic  and  higher  order  terms  into 
the  cost  function,  and/or,  introducing  Quadratic  and  higher  order 
forms  into  the,  demand  .function;  (w  introducing  constraints  on 
output  adjustment  Suppose  for  example  that  x,  is  the  desired 
output  of  firm  i  and  that  the  firm  adjusts  to  the  desired  level  with 
a  lag  according  to:  x,  ■  k,  (x,'  -  x,), ,  k,>0.  However,  it  may  be 
costly  for  the  firm  to  make  large  output  changes.  Indeed  output 
changes  above  or  below  certain  levels  may  be  impossible.  This 
effect  could  be  captured  by  assuming  a  relationship  between  x,  and 
x,  as  shown  In  Figure  1, 


Figure  2 

(c)  An  alternative  way  to  capture  the  effect  In  (b)  would  be  to 
iiteiudc  an  x*  term  in  the  cost  function,  eg. 
x//2  +  e,  x,VS,  (Q>AtiO). 

3,  Simple  lag  Structure 

In  order  to  perform  an  initial  analysis,  let's  assume  a  nonlinearity 
of  the  type  (a)  abow.  Suppose  demand  is  given  by 

p  «  a  •  b  (xj  +  xj),  (b>0)  (1) 

and  the  cost  function  for  firm  i  is  given  by 

QOO  ■  c,  x<  ♦  d,  x<72  ♦  ^  x»/3,  (ctO/ipOt#))  (2) 

The  profit  function  for  firm  1  is: 

n(x„  x2«)  -  la-br  x,  +  x«)]  x,  *  C,  (xt)  (3) 

where  x2w  is  firm  l's  expectation  of  firm  2's  output. 

The  profit  maximizing  condition  9FL  -  0  implies: 

(2b  *  d,)  x,  ♦  ti  x,1 » (a- c,) - b x2w. 


from  which 

Xi-SiW*!  -07 

where  g,  is  a  downward  sloping  ..reaction' function,  which  is 
displayed  in  Figure  2.  Similarly  from  the  profit  maximising 
condition  of  firm  2,' we  find  that: 


(2b  +  d,)  x,  ♦  ^  » I*  “ 
ie.  Xj.gjIxjWj 


(6) 

(7) 


This  reaction  function  is  also  illustrated  in  Figure  2.  We  assume 
that  each  firm  adjusts  to  the  desired  level  of  output  with  a  lag,  te. 

(8a). 

*r  ■  ^2  Igi  (xiW) "  *»!•  (8b) 

where  the  k,  (>0)  (i  ■  1,2'  are  speed  of  adjustment  parameters. 

Wc  assume  that  each  firm  forms  its  expectation  of  the  other  firm's 
output  adaptively.  Thus,  for  firm  1  we  would  vmte: 


x2w  (t)  -  JL  V 
T2  >o 


x/s)  ds. 


(9) 


where  T2  >  0  may  be  interpreted  as  a  time  lag  in  the  formation  of 
expectations.  Equation  (9)  may  be  expressed  in  differential  form: 


(10) 


The  dynamics  of  the  economy  arc  then  governed  by  the  four* 
dimensional  differential  system: 


(11a) 

(lib) 


*1  •  Yt  ta  •  Hi).  *1  •  li  Igj  (*i">  -  Hi). 

i,"-i  i», •*.“). 

*1  7, 

4,  Analvsi*  of  the  Dynamics 

The  equilibrium  point  of  the  differential  system  (11)  is  given  by: 

T|  ■  gj  (X2),  Yj  ■  gj  (Xj),  (12a) 

x,w  -  X„  Xjw  •  Xj.  (12b) 

The  local  stability  properties  of  the  model  arc  determined  from  the 
Jacobian  of  the  differential  system  (2),  which  is  given  by: 


0  kiS/ 
fcjgi'  0 

d  o 


(13) 


To  get  a  feeling  for  how  the  four  dimensional  system  is  behaving, 
consider  the  special  case  in  which  both  firms  are  perfectly  identical 
so  that. 


x»«  x2«x,g,-g2«g. 

Then  we  need  only  consider  the  two  dimensional  system. 

x  ■  kfg(x,)-x)  and  x*-  1  (x-x*), 

T 

which  has  the  simpler  Jacobian  matrix 

h  - 


'■  i  i 


We  see  that 


(4) 


T  r 


(14) 


(15) 


(16) 


(17) 
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(28) 


So  the  equilibrium  b  stable- for  all  Tr»  0.  Hence,  no  output 
fluctuations  will  occur  in  the  model  wnen  the  two  firms  become 
identical  In  this  preliminary  study,  we  do  not  consider  the 
stability  properties  when  the  two  firms  are  allowed  to  differ. 

5.  Higher  Order  Lag  Structure 

Now  let's  introduce  the  higher  order  lag  structure: 

x«„-  (m  {•**  Ms>-x«<s)]ds,  (18) 

Jo  T 

The  expectations  mechanism  in  (18)  (with  m  >  0)  takes  past 
differences  between  *•<»). and  x(sK(sSt)  and  weights  these  with 
exponentially  declining  weights  from  t  to  0.  This  scheme  is  a  way 
of  capturing  approximately  in  continuous  time,,  the  -  lagged 
expectations  scheme: 

x(t>.mfx  (t-T)  -  xfe>  (t*T)]. 

It  can  be  shown  that  in  the  limit  T  0,  equation  (18)  yields  the 
standard  adaptive  expectations  scheme; 

x^O-mMO-x^t)]. 

In  (19)  set 


y<t)-f  £♦*:  fx(*>-x«<i)J<b,  09) 

ro  T 

which  satisfies 

Ty(t)  ■  lx(t)  •  x  w(t))  -  y(t).  (20) 

Wc  then  replace  (19)  with  (21)  and 

x«(t)  -  my(0.  (21) 

If  wc  consider  the  case  of  identical  firms,  wc  have  the  three 
dimensional  system: 

x.kIg(x*)-xL  (22a) 

x*  *  my,  (22b) 

y  ■  i  MO-x'dMyd).  (22c) 

T  T 

The  Jacobian  of  this  differential  system  Is: 


*k  kg'  0  ' 

J,  ■  0  0  m 

-Lid. 

T  T  T 

The  characteristic  equation  is: 

X*+(k  +  JJX*  +  (m  ■»  k)  X  •  m  k  (g*  - 1) »  Ol 
T  T  T 

Note  that  the  product  of  the  roots  is; 

X,  Xj  X,  ■  ink  (g'.-  1)  <  0. 

T 


(23) 

(24) 

(25) 


In  order  to  apply  Hopf  bifurcation  theory  we  need  to  establish  the 
existence  of  a  pair  ot  pure  imaginary  roots,  X  ■  iw.  In  such  a 
case,  w  would  nave  to  satisfy: 

•iw*  •  (k  ♦  Jjj  w1  ♦  i  (m»k)w  •  mk  (g*  - 1)  *  0  (26) 

T  T  T 


ie,  wc  would  require: 

w*  ■  m»k  and  w*  »  »nk  (1-g* )  (27) 

T  kT+i 


Equating  the  two  expressions  for  w2,  we  find  the  value  of  T  at 
which  pure  imaginary  roots  can  occur,  viz. 


T  -  -<m  +  K) 

+  k). 

To  have  X  >  0  we  would  require, 

k  <  -  mg'.  (29) 

The  conditions  (29)  poses  an  u piper  limit  on  the  speed  adjustment 
of  output  to  desired  output  Differentiating  (25)  with  respect  to  T 
and  setting  T  *  TT  reveals  that 

r  -  y  >  o  CO) 

IStJm.  2T?  [w*  +  <k  ♦1/D‘l 

Thus  by  the  Hopf  bifurcation  theorem  (see  eg.  Guckenhetmer  and 
Holmes  (1983))  we  can  assent  the  existence  of  a  limit  cyde  for  T  in 
the  neighbourhood  of  T  ■  T\  'Since  the  R(X)  increases  as  T 
increases  through  T*  the  model  displays  the  dynamic  behaviour 
illustrated  in  Figure  3. 

T<r  T>r 


6.  Conclusion 

We  have  seen  that  by  introducing  nonlincaritics  Into  firms'  cost 
functions  and  allowing  for  a  sufficiently  high  order  lag  structure  in 
the  formation  of  expectations,  it  is  possible  for  the  duopoly  model 
to  exhibit  fluctuating  output  in  the  form  of  a  limit  cycle. 

Further  research  could  consider  the  situation  when  the  two  firms 
arc  not  identical  and  extend  the  analysis  to  oligopoly  models. 
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I.  INTRODUCTION 

Until  quite  recently,  it  Has  been  assumed  in  most 
works  on  the  global  stability  of  the  Cournot 
oligopoly  equilibrium  that  the  rates  of  change  of 
actual  outputs  are  sign-preserving  functions  of  the 
divergences  between  the  expected  profit-maximizing 
and  actual  outputs.  An  alternative -output  adjust¬ 
ment  system  uay  be  formulated,  where  the  firms  are 
assuaed  to  -increase,  decrease  or  do  not  change  their 
actual  outputs  depending  on  the  signs  of  the  margin¬ 
al  profits  with  respect  to  their  own  outputs. 

In  this  paper  we  will  show  the  stability-wise 
equivalence  of  the  two  dynamic  output  adjustment 
systems.  We  also  introduce  an  alternative  adjust¬ 
ment  system  where  outputs  will  remain  nonnegative, 
and  prove  the  global  stability '  under  strict  monoto¬ 
nicity  of  the  marginal  revenue  functions. 

II.  COURNOT  DUOPOLY 

In  this  section  we  consider  Cournot  duopoly  with 
product  differentiation,  tot  Pj"f  and  C^(Xj) 

be  the  inverse  demand  and  cost  functions  for  the  l-th 
firm,  Pj  and  Xj  be  the  price  and  output  of  the  i-th 
firm,  where  i-i,2.  The  l-th  firm’s  profit  is 
given  by 

(1)  l  x1fi(x1,Xj)-Cl(xi),  ifj, 

Assumption  1.  8f VdXj  I  <  0»  f  a  ,C  for  (x^,x2) 
«  JOpXjMO.Xj),  where  f^Xj.Xj)  -  0  for  *  Xj  and 
for  any  x^,  i?J,  i»J»l,2. 

Assumption  2.  The  second  order  condition  is  satis¬ 
fied  for  the  two  firms,  that  is, 

(2)  aV/9*j  «  <  0.  1-1,2. 

The  cross  partial  derivative,  i|j,  llM.  however, 
may  take  any  sign.  If  two  goods  are  substitutes, 
fj<0  but  the  sign  of  f Is  indeterminate. 

The  firms'  actual  outputs  are  assumed  to  be  adjusted 
over  time  according  to  (3a)  or  (3b): 

(3a)  dXj/dt  -  gjtS.j/fcCj),  8l(0).0.  gj>0,  1-1,2, 

(3b)  dXj/dt  -  hjtyj-x,),  hjW-O,  hj>0,  1-1,2, 
where  y^  defined  by  (5)  satisfies 
(A)  fHyj+x^+yjf^yj+x^-C^yj)  -  0,  liM,  1-1.2. 
(5)  y,  i  »S(Xj),  i#j,  1 ,3-1.2, 

First,  we  derive  for  the  adjustment  process  (3a): 
(6*1)  dx2/dxj  -  |/‘b12  for  dxi/dt*°» 

(6*2)  dx2/dXj  -  -s2j/o22  for  dx2/dt*0. 


In  view. of  (2),  (6*1). and  (6-2),  the  following  five 
cases  'are  possible. 

Case_l.  vj2<0,  «2l<0,  ,,ll’22>*12,,21' 

Case_2.  «}2<0.  *21<0*  *11*22^12*21' 

Case_3.  «}2>0.  ’21>0*  *ll*22*,12*2r 

C«se_A.  *}2>0,  ’21>0,  *ll*22<’l2*21‘ 


We  can  easily  observe  on  the  basis  of  the  phase 
diagrams  that  the  Cournot  equilibrium  is  globally 
stable  in  Cases  1,  3  and  5,  but  unstable  (saddle 
point)  in  Cases  2  and  4,  and  that  no  cycle  arises. 
That  there  arises  no  cycle  has  been  observed  earlier 
by  Furth  (1986)  for  Cournot  oligopoly  without 
product  differentiation. 

We  now  consider  the  alternative  adjustment  process 
(3b).  Taking  into  account  hj,>0  and  «]^<0,  WQ  8et 
sgn  dhj  ■  sgn  i"l»2« 

This  shows  the  stability-wise  equivalence  of  the  two 
adjustment  systems  (3a)  and  (3b). 

III.  COURNOT > OLIGOPOLY  WITH  BOUNDARY  CONDITION 
Let  there  be  n  firms  producing  differentiated 
goods,  and  let  p^  1  f*(Xj,**  •  ,xn)  and  Cj(x^)  be  the 
i-th  firm’s  price  and  cost  functions,  where  x^  is 
the  i-th  firm's  output.  A  finite  positive  number  Xj 
has  a  similar  property  as  in  the  preceding  duopoly. 
Lot  Assumptions  1-3  be  valid  also  for  oligopoly, 
where  Is  now  defined  as 

(7)  ,j  -  XjdUj.x.jJ-C^x,),  1-1,2, ■■■,!>, 

where  x.j  -  <*1.,">x1.i»x1+1.'">xn)T- 

Assumption  A.  There  exists  a  unique  Interior  Cournot 

equilibrium. 

The  boundary  condition  is  stated  as 
Assumption  5.  3^/ax,  >0,  for  x,-0  and  any  x.j,  and 

3it/3Xj  <  0,  for  Xj-Xj  and  any  x.j,  1-1,2, •••,n. 

As  in  Section  2,  ve  consider  two  alternative  output 
adjustment  systems  for  i*»l»2,***,n 

(8a)  dx,/dt  -  SjO.j/SXj),  8[(0)-0.  gj>0, 

(8b)  dXj/dt  -  h^yj-x,),  b((0)-0,  hj>0, 

where  yjl^x.j)  Is  the  solution  of 

(?)  f1<yl.*_1Hy1fJ(y1-*.i>-cl<yi)  * 0> 

Assumption  6. 

(10)  .{^1x1,1  <0.  M-1'-- 

Under  this  assumption  and  gj(0)>0,  It  is  known  that 

Jg,  the  Jacobian  matrix  of  the  adjustment  system 
(8a),  becomes  a  matrix  with  a  negative  dominant 
diagonal.  Hence  tho  global  stability  of  the  Cournot 
oligopoly  equilibrium. 

Consider  next  (8b).  The  Jacobian  matrix  of  (8b) 
has  a  negative  dominant  diagonal  If  and  only  if 


i 


1  >  £  |*‘|  -  £  —  ,■>, 

J  jti  J 

Hence. the  Cournot  oligopoly  equilibrium  is  globally^ 
stable  under  Assumption  6  for  (8b),  This  estab¬ 
lishes  the  stability-wise  equivalence  of  the  two 
adjustment  systems  (8a)  and' (8b).  Note  -  that  because 
of  the  boundary  condition,  outputs  remain  nonnega¬ 
tive  throughout  adjustment  periods. 

Rewrite  (10)  for  i"l,2,"'*»n  as 

<11. 1)  1  +  E  >  0. 

It  is  clear  that  (11,2)  is  satisfied  if 

(11.2)  jfj+x1fJ1|/(2fJ+xif{1-C'l‘)  >  -i/(n-l). 

Let  MR*  S  3(Xjf*)/3Xj  be  the  marginal  revenue  (MR)  of 
the  i-th  firm  with  respect  to  change  in  Its  output, 
hj  *  3MR*/dXj.  Rewrite  (U.l)  as 

(11.3)  MR?  +  E  |MR)|  <  C',',  1*1*2, ,n. 

1  3H  J  1 

If  the  demand  functions  are  all  linear,  and  if,  all 
goods  are  substitutes  and  Cj»0,  (U.l)  reads 

(11.4)  f}  <  E  (j,  1-1,2, •••,«. 

IV.  ALTERNATIVE  ADJUSTMENT  SVSTEH 

Let  all  assumptions  in  Section  3  are  valid  other 
than  Assumption  5  and  6.  Outputs  might  become 
negative  during  adjustment  periods  in  the  absence  of 
Assumption  5  if  they  change  according  to  (8a)  or 
(8b).  The  following  adjustment  system  Is  introduced 
to  avoid  this  possibility. 

(12)  dXj/dt  ■  max(Xj+sk^g*(x),0)-Xj,  M,2,***,n, 
where  s  and  kj's  are  positive  constants  and  g*(x)  * 
SiVdXj  for  all  l. 

Assumption  (6)  is  now  replaced  with 
Assumption  7.  g(x)  •  (g*(x), 4 4 4 ,gn(x))T,  X  «  K  * 
a(0,x. )  is  strictly  monotone. 

I 

This  assumption  is  equivalent  to 

(13)  (x'-x2)T<8<xl>-8 <x2))  <  0.  xW.x'.x2**. 

The  proof  of  the  stability  consists  of  two  steps. 
Step  Is  Let  x*  bo  the  unique  interior  equilibrium, 
and  let  a  Lyapunov  function  be  given  by 

(U)  V(x)  I  (x-x*)TK(x-x*>,  K-dl»80=I!."-.k*1). 

Let  U«K  bo  an  open  neighborhood  of  x*  with  suffi¬ 
ciently  small  raolus.  From  x*  *  sK~*g(x*)  >  0  and 
continuity  of  g, 

x  +  sK~*g(x)  >  0,  x«U, 
dx/dt  *  sK’*g(x),  xoU. 
dV/dt  «  s(x-x*)T(g(x)-g(x*))  <  0,  x^x*,  x«U. 
Step_2;  There  exists  a  positive  number  L  such  that 

S  k,e‘(x)2  S  M.  lot 
1  T 

0  I  sup  (x-x*)‘(g(x)-g(x*))  <  0. 
x«K\U 

From  (12), 


Xj  *  dx^/dt  >  xi+skigi(x)-xi  «  skig(x), 
x*TK k  >  sx*Tg(x). 

Xj+Xj  »  max(xj+skig*(x),0):i  0, 
x^+2xixi  <  x^+s2k^g*(x)2+2sk^g*(x)xi, 

xTKi  S  s2  E  k.g*(x)2/2  +  sxTg(x). 

I  1 

Hence  taking  into  account  g(x*)»0,  we  evaluate  dV/dt 
for  x«K\U  as  follows: 

dV/dt  *  xTKx  -  x*Tkx 

Ss2M/2  +  sB 

-  (H+2(J/s)s2/2. 

Since  0<O,  we  conclude  from  this  that  dV/dt<0  if  s 
is  sufficiently  small.  Steps  1  and  2  together  prove 
that  dV/dt<0  for  x*j*x«K.  Hence  the  Cournot  equilib¬ 
rium  is  globally  stable  for  the  adjustment  system 
(12)  if  s<-23/M, 
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ABSTRACT  Ibis  paper  investigates  tdretlier  and  to 
what  extent  the  presence  of  risk  aversion  affects 
the  welfare  implications  of  information 
transmission  in  one  of  tl>c  tost  fundamental 

oligopoly  ixxiels  - -  a  Cournot  duopoly  model 

with  cannon  demand  uncertainty.  It  ains  to  make 
a  bridge  between  the  literature  dealing  with 
Infomation  sharing  in  oligopoly  and  (!«  coo 
discussing  the  firm  under  uncertainty.  Ve  can  show 
that  the  average  output  level  of  each  firm  is  quite 
sensitive  to  the  type  and  amount  of  information, 
and  that  the  presence  of  risk  aversion  has  on 
effect  of  decreasing  the  welfare  of  firms,  wlrcnce 
infomation  transmission  may  sometimes  be 
harmful  rather  than  beneficial  to  risk  averse 
flrnr.  These  results  have  sane  policy  Implications. 

I.  A  Ikwpoly  Model  with  Risk  Aversion 

We  ossrme  tlwrt  the  market  demand  functions 
of  outputs  are  given  by 

Pi  =  rr-/?(xi+djT2), 

P2=rt*-/J(t2+0jri). 

xj  and  X2  ore  substi tules,  independent,  or 
complements  according  os  0  Is  positive,  zero,  or 
negative.  «  is  a  random  variable  whore 
distribution  is  normal  with  mean  /<  ami  variate  »r* 

Letting  III  denote  tlic  profit  of  film  «•» 

obtain 

Ui~  {a-CC~  Xi~  Oxj)xt  (hja  1,2;  i*  j). 

Each  firm  is  assumed  to  maximize  tire  ex|>cctcd 
utility  of  its  profits.  ^  suppose  Ural  fh„  f 
the  following  von  Neumann  >Mor  gens  tern  nil  lily  ; 


///(///)  =  a- b  exfA-  RtUt)  <b>0,  Rt>0;  1=  1.2). 

Vo  foens  on  tlie  following  three  types  of 
information  slinetuix-r  (i)  no  information,  written 
n«  »J°,  vtiere  no-firm  Ims  information  ulxxil  the  (Icoaid 
immmelcr  «:  (ii)  nonsyiineli  ic  information,  denoted 
by  >/v.  wiion*  Him -I  K  informal  of  rt  but  firm  Z  is 
not  so  informed;  and  (3)  sliared  information,  shewn 
by  >?*.  wtiere  l>olh  films  con  know  «. 

We  <ny  tlwt  a  pair  <rw,,  rwj>)  of  output  strategics 
Is  an  «M|iiS ) ihriiai  iviir  under  'P  If  for  each 

r°ts  Aiq  War  U,j=  l,Zi  i*j). 

x  i 

an  output  (>air  (x^i <<?).x*2>  is  said  an  equilibrium 
pair  under  >?v  If  for  each  <r, 

rvi(a)=  Aio  Hat  J/il/M<7,x*.-rv2)3 

xt 

x*?.a  A>o  >/<«■  (u).^)). 

x* 

Similarly  an  output  i»alr  (A'5! (al.-r^ia))  is  called 
an  eunlllbrlrw  pair  «n<tat  »/*  if  for  each  cf. 

xs t (u) «*  /h o  Mtw  (a, x t. Xs j (a) )  J 

x‘  <i.^=i.si  i*j>- 

?•  ('rxipn  risen  of  Expected  Oil  puts 
IIIIXXMJl  1 

(1)  If  <»«.  tlc*u  F.\*i> R.\S\>J°\  and  fjJ,2>xv2>.t°2: 
(il)  If  0-  0,  lluni  f.rr|5f.rvi>A<>i  and  Fxsz>  xN z*x?zl 
(5)  If  <K0.  linen  £ry»>£tJV»>A°t  and  F.xsz> x°z> 

In  rig' lie  l  i»oinls  (P ,  (p,  and  (p  represent 
Cournot -Rush  cqtillihf  h«r  tiixler  >P.  ip,  and  ip, 
ivspcc  lively. 

lo  s«*i  up.  tlio  production  activities  a  of  risk 
averse  films  mo  ^I’iaKly  affected  by  the  type 
and  nmrxuit  of  infoimntiou  acquisition  and  by  the 
rlegi*'**  «f  substitutability  lie  tween  goods.  Tills  is 
in  eruked  coni  past  with  tin'  risk  neutral  cose  In 
which  (Ik*  avetnge  outputs  remain  rmscatlicd  by 
imy  clwuige  of  Information  legaixllcss  of 

tlio  iJegice  of  technical  substitution. 
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(ii)  wfiere  I  Ik*  equality  liolds  iff  0=0. 


? 

* 


Ex  2 


3.  Ccei|*orisou  of  Hxj>cclcd  Uti lilies* 

TllEWilU  z  OP  versus  ip) 

EtPi*.EU°t  <=*  //O?.<XZ,0){<*7(I+  2/>/<2+0)2'|  (A-c)2, 

//(fctf«.0)a  [zp*  (2+0)2jfp+2+4?JV(0^>>z. 

As  is  seen  in  Figure  2,  IIkj  value  of  Information 
is  negative  when  (p-c)  is  sufficiently  large 
and/or  when  goods  arc  cither  foirly  strong 
substitutes  or  fairly  strotig  «*«>lewnts. 


M-C 


Figure  2.  Coeval risoti  of  rxjx'Ctod  tlti lilies: 

Mo  Infoimatinu  vs.  Huircd  Inhumation 

TIIEOflOi  3  Op  versus  ip) 

0)  <=*  7(R. a2. 0)1 09O +/*/?) |  </<-c)?. 

VdMJl-O  J(Ri0^t  0) 3 


A<  wjis  nolod  I  lie  iiucctmIous  si  illation  under 

which  H/v i  is  less  Hun  ttP i  occurs  if  (p-c)  is 
snff ieiciillv  laige  nnd  tin*  absolute  value  of'0 
is  fuiiiv  large. 


f 

I 


IJiljMim  4  Op  vorsuc  »/v) 


0(4  +  0)/>  |> 

<2+0>*<P+?)R 


(/<-  c)2. 


uIk'jx*  A(M,o?,  0) 

=  (-i)<ptd)l(pi<»2t?<’JI  (z-i»(>*2(?40)]2/ 

(<«»>  2l  (pi  0)  (p-n)pm(l  id)  J. 

(ii)  Hf5?|///Vp 

wtioix*  /.(M,u?,0)3  i ?/*♦  (?♦<)) ?J |  (?-0>0*2(2*0)l2/  (02Ru)2. 


Here  again.  Uw  values  of  (/<*o)  and  0  play  a  vital 
role  in  deciding  the  welfare  inpact. 


4.  Concluding  Remarks 

We  helieve  that  the  welfare  results  obtained  so 
far  have  sane  policy  inpl ientions  regarding  the 
effectiveness  and  liaiilntions  of  Industrial 
policies  r«4>loye<l  by  tl»e  govcnuient  wlicn  firms  in  on 
iiKtustiv  display  strong  risk  avoision. 

Hieix?  ixiuiin  sonic  other  din'd  ions  in  which  our 
study  may  Ik?  furtlier  pnisued. 
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Competitive  markets  are  characterized  by  a 
large  number  of  economic  agents,  at  both 
(supply  and  demand)  sides  of  the  market  and 
such  that  no  agent  has  enough  pover  to 
manipulate  prices-  That  is  all  agents  are 
price  takers. 

In  the  segual  we  will  define  an  economy 
through  a  set  of  agents.  Each  agent  has  a 
type,  characterized  by  a  vector  of  initial 
endowments  (from  a  fixed,  finite  dimensional 
commodity  space)  and  a  continuous,  monotone 
and  concave  utility  function. 

Aunann  (1966)  considered  the  case  where  the 
set  of  agents  was  a  continuum  and  proved  that 
for  this  case  the  set  of  Walras  allocations 
and  the  Core  coincide.  This  is  the  "Core- 
equivalence"  theorem. 

Nowadays  this  theorem  is  often  proved  by  a 
proposition  by  Anderson  (1978). 

Consider  a  finite  economy  (when  we  speak  of  a 
finite,  countable  or  uncountable  economy 
those  adjectives  refer  to  the  set  of  agents). 
We  know  that  the  Walras  allocations  are  in 
the  Core.  For  each  core  allocation  we  can 
determine  the  distance  from  the  nearest 
Walras  allocation.  The  supreme  of  those 
distances,  taken  over  all  core  allocations, 
is  the  distance  between  the  Core  and  the 
Walras  equilibria.  Among  others,  Anderson 
shows,  that  this  distance  goes  to  zero  when 
the  number  of  agents  goes  to  infinity.  This 
does  not  imply  that  in  the  limit  the  core  and 
the  Walras  equilibria  coincide,  because  both 
sets  could  be  empty. 

The  existence  of  Walras  allocations  is 
easilly  proved  in  the  case  of  replica 
economies  8  la  Debreu  and  Scarf  (1963) .  This 
is  due  to  the  "equal  treatment"  property  in 
the  Core.  All  replica  of  a  finite  economy 
with  a  finite  number  of  types  share  the 
"same"  Walras  equilibria.  So  also  the  limit 
economy  has  the  same  Walras  allocations,  that 
in  the  limit  coincide  with  the  Core. 

For  economies  that  *,re  not  replica  of  an 
economy  with  a  finite  number  of  types,  the 
"equal  treatment"  property  does  not  hold  and 
above  result  is  more  difficult  to  prove.  Tho 
way  to  prove  the  Core  equivalence  theorem  in 
this  case  is  with  competitive  sequences  of 
economies  (see  Hildenbrand  (1982)).  One  of 
the  conditions  is  that  the  distribution  (a 
counting  measure)  of  types  of  agents  in  the 
finite  economies  converges  to  a  distribution 


(a  measure)  of  agents  la  the  limit  eccaacry, 
that  has  a  ccatimsa  of  agents.  Jiost  proves 
of  the  core  equivalence  theorem  for  large 
economies  use  measure  theory  ca  a  continuum 
of  agents.  As  this  world  is  finite,  in  my 
view,  this  is,  with  the  continuum  hypothesis, 
one  step  too  many. 

Approximations  of  large  economies  should  be 
dealing  almost  with  countable  infinite  agents 
sets. 

Therefore  Z  consider  directed  systems  of 
(finite)  economies.  That  is  there  is  a 
directed  index  set,  that  is  a  set  with  a 
partial  ordening  (reflexive  and  transitive) 
such  that  for  all  two  indices  from  that  set 
there  is  a  third  larger  than  both  of  them. 
When  of  two  indices,  one  is  larger  than  the 
other,  we  have  an  embedding  of  the  economy 
with  the  smaller  index  into  the  one  with  the 
larger. 

The  inductive  limit  is  defined  in  the  usual 
way.  The  limit  economy  nay  have  a  countable 
infinite  agents  set.  The  following  Theorem 
can  be  proved: 

Theorem:  The  Core  of  the  inductive  limit  of 
a  directed  system  of  finite 
economies  is  non-empty. 

This  generalizes  the  Debreu-Scarf  result 
above,  as  replica  economies  form  a  directed 
system  of  economies,  but  also  the  case  of 
(non-replica)  economies  with  a  finite  number 
of  types  can  be  treated  this  way.  The 
resakable  fact  is  that  in  the  proof  of  above 
Theorem  nu  use  is  cade  of  measure  theory. 

As  each  economy  with  a  countable  infinite  set 
of  agents  is  the  inductive  limit  of  a 
directed  system  of  finite  economies,  also  the 
following  Corrolary  has  been  proved:- 

Corrolarv?  The  Core  of  a  countable 

infinite  economy  is  non-empty. 

Amsterdam,  februari  1991 
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Abstract  la  tii*  payer.  »e  propose  a 
cexpatiag  systes  thick  recogaises  the 
structural  characteristics  of  a  *i»es  large- 
scale  dyaitic  xodcl  axd  solves  the  aodel 
through  a  prograsse  astozatlcal ly  generated  os 
the  basis  of  recogaised  strsctsral 
characteristics.  Throwing  light  oa  the 
eassal  relatiosship  thich  is  iaaaseat  is  the  \ 
aodel.  this  kind  of  systea  is  also  ssefsl  for 
revislaz  the  aodel  so  as  to  ixprove  its 
dyaaaic  behaviosr  particclarly  vhea  the  aodel 
is  sot  oaly  of  large-scale  bat  of  high 
coapleiity. 


1  .  I KTBODDCT IOW 


la  spite  of  the  receet  rapid  progress  la 
coxpstiag  ability,  there  reaain  varioos 
difficulties  ia  solving  a  large-scale  dyaaaic 
aodel.  For  iastaace.  a  aecbaaical 
application  of  standard  approxiaat ioa-aetbod 
is  apt  to  reader  the  iterative  process  endless 
e?ea  if  the  process  is  to  be  convergent  or  to 
resslt  ia  inaccsrate  and  callable  numerical 
solutions.  Bovever.  as  vas  asserted  la  a 
series  of  vorks  [1]  through  [4].  the  above 
difficulties  are  kaova  to  be  overcoae.  at 
least  partly,  by  BtiHs.Bg  the  qualitative 
iaforaatloa  of  the  aodel  soch  as  the 
structural  knovledge  on  the  aodel  and/or  the 
causal  relationship  of  variables  contained  in 
the  nodel.  Fortberaore.  to  oar  knovledge. 
the  vorks  cited  (for  exaaple  CAOSOB)  confine 
ibeaseJves  to  the  analysis  of  interdependent 
causal  slractores.  Therefore  it  nay  be  of 
soae  valce  to  develop  a  coipstlng  systea  vblch 
naaerically  solves  the  given  dynaaic  aodel 
vitfa  the  aid  of  the  qoalitative  knovledge 
perceived  by  the  systea  itself. 


1)  This  vork  is  supported  by  a  grant  of  Japan 
Ministry  of  Education  (grant  Xo.  01300002). 


1.  Svabol 


Ve  list  the  syabol  to  be  esed. 
y  r  As  nil  vector  of  eadogvaocs  variables, 

z  :  Aa  azl  vector  of  predeteraiaed 

variables.  If  accessary,  ve  specify 
the  first  8j  eleaeats  of  z  as  the 


lagged  eadog eaoes  variables. 
f|(y. z)  ;  The  i-tb  equation  of  the  systea 
to  be  considered 

A* ( a j  j )  :  The  Jacob<aa  aatriz  of  the. aodel 


*PQ 


»r 


sader  coasiderat ioa.  More 
specifically. 

lj  *  i  • 

A  ssbaatriz  of  A  consisting  of  aj. 

such  that  i <P  and  jcQ.  there  P  (Q)  is 
any  subset  of  all  rov-indiees  (coloan- 
indices) 

A  ssbvector  obtained  froa  y  by 


extracting  yj  sich  that  !c? 


:  The  set  of  all  Indices  of 
eqoatlons.  Hovever.  M  is 
replaced  by  T  if  ve  vish  to 
eapbaslse  that  It  is  the  set  of 
all  endogenons  variables. 


Throoghoat  the  paper,  ve  are  concerned  vltb 
the  follovlng  simultaneous  systen  of  dynaaic 
eqaat ions; 

f (r.  x)  •  O  (1) 

.  thich  ve  henceforth  call  the  canonical  fora. 


B,  Tervinology 

Definition  I.  Systea  (1)  Is  said  to  be 
decoaposable  If  there  exists  a  noneapty  proper 
subset  S  of  M  soch  that  corresponding  to  S 
there  can  be  found  a  proper  subset  ?s  of  V 
vltb  the  sobseqoent  properties: 

(O  IS  ■  1 7 
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(i)  associated  with  aay  j  of  there  exists 
aa  i  of  S  such  that  a  j  ■  *  0.  and 
(I)  ajj  *  0  for  aay  I  of  S  aad  any  j  of  H-Tj. 
the  coapleaeatary  set  of  T$  with  respect  to 
M.  there  JS  (5T)  decodes  the  aaaber  of 
decent*  contained  in  the  set  S  (T). 

In  words.  Definition  1  scans  that  -S 
equations  whose  indices  are  in  the  aot  S 
contain  jest  the  sane  nenber  of  endogenous 

variables.  Bence,  if  proves  to  be 

STS 

noasiayelar.  the  nenerical  values  of  these 
endogenous  variables  are  deternined  as  the 
functions  of  predeterained  variables.  This, 
in  turn,  enables  ns  to  solve  the  renaininy 
endoyeaous  variables  fron  the  renaininy 
equations.  Concerniny  Definition  1,  it 
should  be  noted  that  AMT  is  a  reducible  nairix 
if  and  only  if  systen  (1)  is  deconposable  with 


Definiiios  2.  ¥e  call  systen  (I)  partially 
linear  if  there  exist  noneapty  'subsets  T  of  M 
and  fT  of  T  such  that  Is  a  constant 


aatrlx. 

The  concept  of  partial  linearity  is  also 
useful  for  our  purpose,  since  once  the  partial 
linearity  Is  found,  we  can  ellnlnate 
endoyenous  variables  whose  nunber  equals  the 
rank  of  A^y  .  Seedless  to  say.  It  does  not 


barn  yenerallty  to  £$$une  that  ST  does  not 
exceed  SVj.  for  otherwise  the  set  T  can  be 


contracted  until  ST  i  Vj. 


n .  THE  COMPOTIWC  SYSTEM 


The  proposed  conputlny  systen  is  sumarlsed 
as  follows; 

K,  Bear ranyeaent  of  the  equations  read  Into 
the  canonical  for*.  Needless  to  say. 
expressions  which  do  no  aeet  the  FOSTBAN77 
yraaaar.  If  any,  are  pointed  out  before 
proofing  further, 

2^  Checklny  the  deeoaposabi  I 1 ty  and/or  the 
partial  linearity 

2, l.  In  the  case  of  deeoaposabi 1  i ly.  It  Is 
further  checked  whether  or  not  the  *uDsystea 
correspond  law  Is  the  set  $  can  be  rewritten  as 

yT  *  y(yT  . *>  (2) 

Ts  *s 


.  to  which  the  socalled  Gacss-Seidel  aethod  is 
directly  applicable.  If  the  Gauss-Seidel 
nethod  is  unappl icable.  we  resort  to  other 
nethods.  such  as  Xewton  nethod  and.  if 
necessary,  the  Jacobian  natrlx  is  yenerated 
tbrouyb  nonnunerica!  differentiation.  The 
generation  of  the  Jacobian  natrlx  is  executed 
by  a  proyranne  written  in  PEOLOG.  ** 

2. 2.  If  the  partial  linearity  is  found,  sose 
endoyenous  variables  are  plielnated  to  obtain 
the  reduced  (or  contracted)  systen. 

3.  The  yeeeratiou  of' the  conputiny  proyranne 
in  accordance  with  the  coeputation  foraula 
specified  in  staye  2  above.  Briefly  stating, 
the  proyranne  for  numerical  solutions  is 
yenerated  by  replacing  the  nane  of  each 
variable  with  the  corresponding  location 
(array  nane)  in  the  nenory.  and  by  utilising 
the  stored  proyrannlny  knotledye  for  carry. ny 
out  the  specified  eonputat ional  procedure. 
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2)  Te  have  defined  the  deeoaposabi 11 ty  and 
the  partial  linearity  In  terns  of  the  Jacobian 
natrix  slaply  for  the  descriptive  clarity 
Evidently,  the  deconposabi 1 i ty.  as  well  as  the 
partial  linearity,  can  be  checked  without 
having  the  knowledge  or.  the  Jacobian  natrtx 
Therefore.  It  is  at  this  stage  that  the 
Jacobian  natrix  is  constructed  through 
nonnuaerical  differentiation. 
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ATtSTRAOT 

Central  to  mart  policy  optimuation  considerations  i*  *  - 
deterministic  optimisation  algorithm.  We  discus*  three  each  alfonthm* 
for  nonlinear  model*.  The  first  if  a  Gau**-Newton  alforithm,  ordinarily 
for  unconstrained  problem*.  It  require*  ample  derivative  information,  it 
b  slow  but  quite  effective  for  rough  _  computation*?  The  second  i*  a 
Goldftein-Lerit^Polyak  type  alforithm.  It  ea*3y  admit*-  linear 
constraint*  *uch  a*  bound*  on  the  control*.' The  derivative  information, 
for  this  algorithm  can  be  efficiently  obtained  urinf  a  backward  evaluation 
approach.  The  algorithm  i*  fart  and  involve*  a  quadratic  »ubproblem  with 
inequality  bound*.  The  third  i*  a  sequential  quadratic  programminf 
algorithm  that  admits  general  equality  and  inequality  constraint*.  The 
derivative  calculation*  are  expensive  but  the  algoritm  is  fast  and  solves 
the  uuort  feneral  formulation. 

L  INTRODUCTION 
Consider  the  policy  optimisation  problem 

”>fa{j(Y.O)|l'(Y.U)=<>}  (1) 

where  Y  an^  U  are  respectively  the  endofenous,  or  output,  variable*  and 
policy  instrument*,  or  control*,  of  the  *y*tem.  J  i*  the  policy  objective 
function  and  P  b  the  model  of  the  tconomy  which  I*  affected  by  the 
random  duturbanoi^ector  «.  In  feneral,  P  1*  nonlinear  witS  respect  to  Y, 
U,  Problem  (l)  i*  e**entlally  a  static  transcription  of  a  dynamic 
optiml ratlin  problem  in  diorete  time  where 


, 

U1 

7l 

Us 

;  Y  k 

7t 

"T 

Jt 

with  Ug  G  R*  and  yt  €  R"*  denoting  the  control  and  endofenous 
variable 

vectors  at  time  period  t.  The  optimlratlon  covers  the  periods  t  =  1, T. 
Thu»,  Y€R“xt,U€R“xi,  FFCR"'-RTX“«id  J:J  CD'1- 
Rl  (if=Tx(m{s)).  The  vector  valued  function  P  Is  essentially  an 
econometric  model  which  I*  a  system  of  nonlinear  difference  equation* 
represented  in  static  form  for  time  period*  t  =1, .. ,  T,  The  rise  of  the 
system  is  feserally  in  the  order  of  hundreds  of  equations  per  time  period. 
Problem  (1)  is  unconstrained  We  shall  discus*  a  simple  Gauss-Newton 
(G-N)  type  alforithm  for  its  solution. 

The  second  policy  optimisation  formulation  we  shall  consider 
admits  linear  inequality  constraints  on  the  control  variables 

mln{j(V,U)|p(Y,«)  =  l);K'rU  <  b}.  (3) 

We  shall  consider  a  Golditt  in-Levi  Un-Polyak  (GLP)  alforithm  for  giving 
this  problem. 

To  introduce  the  third  problem,  we  shall  use  the  vector  x  to 

denote 


IL  &N  ALGORITHM  FOR  UNCONglRAINBD  PROBLEMS 
Associated  with  the  model  P  (Y,  U)  m  0,  there  5*,a  model 
solution  that  computet  the  value  of  Y,  for  a  ■  fivers  value  of  U.  The 
computational  mapping  is  denoted  by 

"  Y=V(U).  (5) 

This  mappinf  can  be  used  to. eliminate  the  output  variables  from  the 
problem.  Hence,  we  can  express  (!)  a* 

mib{l(U)}  («,.) 

where 

j(u)  =  j(t(U),u).  (e,b) 

For  this  particularly  ample  alforithm,'  we  also  sasume  that  the  objective 
funcUaa  is  a'quadratie  and  is  given  by 

j<Y-Yd,'QJ(Y-Yd)>  +J  <U.Ud,Qu(U.trJ)> 

where  Qy,  Qu  are  symmetric  matrices;  Qy  is  posiUve  « mi-definite  and 
Qu  is  positive  definite,  Y*\  U^  are  pven  desired  value*.  Whris  the  model 
is  ured  to  eliminate  Y  from  this  function,  we  have 

0(0)  =  J  <  t<U)  -  ri.  Qy(,(U)  -  ri)  >  +  }  <  U  -  tf*.  QU(U  -  tf1)  > 

Startinf  from  an  initial  point  U0,  the  algorithm  fenerates  the  sequence 
{Uj.},  k=0, 1, ...  where 


uk+i  =  uk  +  ,k,V 
The  direction  dj,  is  defined  by 

Hj  dk  =  -  V  Q(Ut)  «tb  I)J  =  ilj  Q,  Nk  +  Qu 


The  Jacobian  Nj.  U  initially  approximated  by  the  dynamic  multipliers  of 
the  model  and  subsequently  updated  ueing  Broyden’s  rack-one  formula 

..  rI  .  M®k+1>  -  «W  ”  Tk  Nk  dk)  ak 
bk+,  a  Nk  + - '-~<-dkT3^ - - 


in  order  to  preserve  the  block  lower  tnanfular  structure  of  the  Jacobian, 
reflecting  the  causahty-tune  structure  of  the  model,'  Schubert’s  (1970) 
modification  to  the  above  formula  can  be  used.  The  itepsite  Tj,  q  (0, 1)  is 
determined  using  an  Armijo  stratify  to  satisfy 


Q(UkH)  -  Q(uk>  <  io’S  <VQ(yk),dk> 


Thi*  alf/ithra  i*  a  fenerah*ation  of  Holbrook’s  approach  of  linearising 
Ike  model  and  solvinf  the  quadratic  optimisation  subject  to  the  linearised 
model  (Holbrook,  1974;  Chow,  1975).  Further  deauib  and  properties  of 
the  alforithm  are  discussed  in  Rustem  (1981).  Although  the  alfonthm  is 
relatively  slow,  it  is  easy  to  implement  on  macro -models  and  requires 
little  coxupuutiond  effort  and-ba#  been  #ucc«-»fu!y  used  m  policy 
optimisation  exercises. 


*  =  [  u  }  (3) 

Considtr  the  feneral  nonlinear  equality  and  inequality  constrained 
problem 

rain  j  J  (x)  |f  (x)  -  0  .G(x)  £  0  J.  (<) 

We  shall  consider  a  sequential  quadratic  programmapj  ($QP)  alforithm 
for  this  problem. 


TTT  fii,T>  AT/tQRITlIM  FOR  UNBAR  INEQUALITY  CONSTRAINTS 

The  GLP  alforithm  is  based  on  ths  projection  of  the 
unconstrained  descent  direction  onto  the  convex  feasible  set  of  linear 
inequality  constraints.  Usinf  tht  mappinf  (5),  nroblero  (2)  can  be  written 

rob  {  J  (U)  I  NT  U  £  b}.  (7) 

The  algorithm  titrates  a  stqutnce  of  point#  {Uj.)  such  that 

vk+i  =  uk  +  'k(°  -  uk) 
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where  U  is  the  projection  of  the  unconstrained  quasi-Newton  step  on  the 
convex  constraint  set,  The  unconstrained  step  is  given  by 

“u  =  uk  -  «k #;*  vj (ut):  ,ke p, *), *  e[i,2) 

where,  6w  is  * ‘  positive  definite  qiU«-Newton  *pproxinution;.to  the 
Hessian  m  *nd  the  unconstrained  steptise  is  any  sequence 

with'  the  above  restrictions  and  that  {*k)-*l  (eg  ^  r  1,  V  k).  The 
projection  of  Tl  oh  the  constraint  #et  is  pven  by 

0  =  «tmioflU  -  “UJ1  |  NTU<  b  ) 

1  Hk  ' 

tad  the  stepsue  rk  €  {0, 1}  if  determined  turn*  an  Armijo  i tritely  rich 
that  the  following  meat  function  is  satisfied 

Wk+l)  -  lCUk)  <  rkf  <V)(Uk),U  -  Uk>,  ,,  €  (o,l  -  |) 

The  advantages  °f  this  algorithm  are  that  Vi  can  be  efficiently 
-  computed  urine  *  back  substitution  »chem«.  In  Addition,  the  ftepfite  ry, 
converte#  to  unity  end  the  algorithm  convene#  to  the  solution  at  « 
superiinear  rate  (Rustem,  1984). 

IV.  SQP  ALGORITHM  FOR  NONLINEAR  CONSTRAINTS 
Consider  definition  (3)  tad  the  Augmented  Lagrangian  of  (4): 

L(x,  X,  p,  c,  n)=  l  (x,  c,  o) + F(x)xA  +  G(x)x/i  (8) 

where  0  i  c  £  R+,  0  <C  a  €  And  A,  p  tit  the  Kuhn-Tuckw 
multiphers.  We  define  (  . )+  such  that  its  j  th  element,  (.)*+  a  m«x((.)*l 
0).  The  penalty  function  associated  with  (3)  is  defined  by 

e(x,c,,)=j(x)+§  n  pm  iij+^ii  (<Q(x)+<ti;.  <«) 

Neither  the  augmented  LagranjiAn  (5)  nor  the  penalty  function  (3) 
posse*)  continuous  HesriAn*.  In  order  to  overcome  the  difficulties 
Ariiinc  from  this,  we  shall,  At  AppropriAte  junctures,  consider  the  effect  of 
replacing  the  Hessian  of  (6)  by  thAt  of 

Jt(x,  A,  p,  c,  *)  c  £(x,  c,  o)  +  F(x)x  A  +  G(x)x  p  (7) 

where 

*(x,c,a)  *W(x)  +  §  ||  F(x)  flj  +  £  ||  (cG(x)  +  o)  |£  (8) 
i  s  And  t  differ  only  in  their  lA*t  term*  corresponding  to  the  Inequality 
Constraints,  In  contrAit  to  (3),  the  HessiAn  of  i  exists,  provided  J,  F 
*»d'Q  '#«  suitably  differentiable. 

The  successive  quadratic  programming  (SQP)  quasi-Newton 
Algorithm  considered  In  this  p*pt r,  Involve#  the  bask  IterAtive  procedure 


Ana 

J-  c k+1  ^k+j)  “  *  0 

4+l<?k>  ■*  <3!(1!k+l>>  0 

G.*tn  x,j,  £  R+,  l  e  (o,  <xi),  F,  €  (o,  l  -  jL),  «,  6  (},  ll  7  €  (o,  l), 

Sfcpfi:  Set  k=0,  e^  =  mex { -Cq <j)(xq), 3 ), j  =  1, .. ,  1. 

Step  1:  •  Solve  (10)  to  obtAin  dj.  And  p±+ j« 

‘  Step  2:  If  optimAlity  is  Achieved,  stop.  Else  go  to  step  3, 

sm*  i  if 

vjJt)k  +  tldJVk-(Gk)K(*k)-ck!ll  h  II,  +  IK0k)+lljl  S  * 


“  'k+l  =  ck'  B'?'"1 
j7^dt-f.,^Hkdk-(Gt)Iot(Xkl 


W  (  II  ?k  Hj+ll^kM,  *  J 

CAlculste  °k+i(x^)  using  (12, b). 

Step  ii  Determine  fj,  with  =  xk  +  rk  **k  ,4*kfying  the 
Inequihty 

e(xk+l’ck+l’<>k+j(,1k+lW  ~  etxk'ck+l'°k+l(1'k)>  S 

Vtv{<*k>el+l>“k+l(,k»T,Jk  (*•) 

<*k+l(xk+l) 11  ««“pwted  usin{  (12,c). 

Step  fi:  UpdAte  ,  to  compute 
Step  Setksik  +  lAndfoto  Step  1. 

The  Above  Algorithm  Allow#  the  most  fener*!  formulAtlon.  The 
stepsite  ry  converje*  to  unity,  the  pentity  petAmeter  e^  does  not  prow 
indefinitely  *nd  (xk)  converses  to  the  solution  At  a  two-»tep  Q- 
superlineAr  r*te.  However,  its  computAtion*]  demAnd*  in  termi  of 
derivAtive  evAluAtioiu  at#  conridetAble  for  lATfe  sc  Ale  problems. 

ACKNOWT.EDGRMKNTr; 

The  finAodAl  support  of  ESRC  U  pAtefully  Acknowled|ed. 


xk+i  =xj.^rkak  W 

where  dk  solves  the  Qvtdrttk  Progrtmmmf  Svbprobkm  (QP$) 

ntin  {  q (d, ck,  ^(xj.))  |  VFk  d  +  Fk  c 0,  V Gj d  +Gk.  ^  0  ^0) 

the  multiplier#  of  this  QPS,  comspondins  to  dk,  at#  denoted  by  Ak .  j, 
A,+i.  It  l*  Assumed  th*t  the  feAiible  set  of  (10)  !#  nonempty  And  thAt 
the  QPS  doe#  hive  a  solution,  The  objective  function  U  pven  by 

q  (d,  c,  o(xk))  s  dxV  £  (xk,  c,  o(xk))  +  J  dTftkd  (11) 

with  V£  a  Vx£.  fik  if  a  quAfi-Newtoh  ipproximtiou  to  the  UetslAn  of 
4  ,  with  respect  to  x  ,  #t  xk,  fa,  ck,  ,  Followinj  the  orifinAl 
sussection  by  Wilson  (1933),  feverw  vinAnt#  of  the  SQP  Alsorithm  Lay* 
been  proposed.  These  hAve  proved  to  be  effective  in  solvins  problem  (3) 
(*ee,  e^.  Fad,  S  Ark  at  And  LAsdon,  19SS;  And  the  references  in  Rustem, 
1990), 

Let  the  J  th  element  of  the  vector#  G  and  a  be  re#pectlvely 
denoted  by  (p  And  v.  Wc  ire  nuialy  concerned  with  two  specific  values 
of  a  where  Qq  it  #et  In  Step  0  with  <0  €  (0,  oo)  fjvea  A#  Input.  These 
Axe 

r<k+lGl(xk)  K  Gl(xk)<  0 

4+t<xkl  =  ,  ,  (UA>) 

4(xk)  II  d(xk)  >  0 
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Abstract, 

In  this  paper  we  present  a  computable  general 
equilibrium  model  with  production.  At  any  time,  the 
household  sector  comprises  two  overlapping .  generations 
of  adults.  We  obtain  the  following  results  : 

,«1)  In  the  case  of  no  taxes  and  no  government 
i expenditures,  the  model  has  two  steady  states  associated 
"respectively  with  a  strictly  positive  and  a  null 
Interest  rate. 

II)  the  Introduction  of  taxes  yields  two  steady 
states  with  a  strictly  positive  and  a  strictly  negative 
Interest  rate. 

HI)  if  the  government  expenditures  are  large,  then 
one  can  have  two  steady  states  with  strictly  positive 
interest  rates  or  no  steady  state, 

v)  the  steady  state  with  lowest  Interest  rate  Is 
locally  stable  while  the  one  with  highest  Interest  rate 
Is  unstable. 

INTRODUCTION 

The  use  of  Computable  General  Equilibrium  (CGE) 
models  is  now  current  practice.  The  early  1980’s  saw  a 
•flurry"  of  these  models  dealing  with  three  types  of 
problems  : 

-  Sectorial  problems  (energetic,  agricultural) 

-  International  trade 

-  Fiscal  Policy 

Isee  e.g  Kemal  Dervls  et  al.  (1982).  Shoven  and  Whalley 

(1984),  Fullerton.  Henderson  and  Shoven  (1983)).  Auerbach 
and  Kotlikoff  (1987)  constructed  a  large-scale  dynamic 
fiscal  CGE  model  where  consumers  Itave  a  life-cycle 
behaviour  and  enterprises  maximize  their  Intertemporal 
profit.  These  agents  are  assumed  to  have  perfect 
foresights  on  prices  and  wages.  Their  model  Includes  also 
taxes  and  public  expenditures.  It  is  used  to  study 
dynamic  fiscal  policy.  Mathematically  two  kinds  of 
problems  ,  rise  from  this  model  ; 

i)  the  existence  and  the  uniqueness  of  the  steady 
states  ; 

11)  the  stability  of  these  steady  states. 


It  Is  well-known  that,  In  a  two  periods  overlapping 
generations  model  without  production,  taxes  and  public 
expenditures,  there  exist  two  steady  states  ;  with  one  of 
them  Is  associated  a  positive  Interest  rate,  while  with 
the  other  one  Is  associated  a  zero  Interest  rate  (see, 
e.g  Benassy  and  Blad  (1990)), 


The  problem  of  the- existence  and  the  uniqueness  of 
the  steady  states  has  not  been  studied  by  Auerbach  and 
Kotlikoff.  In  order  to  study  the  stability  of  the'  steady 
states,  Laitrier  (1990)-  ‘points  out  that  the  model  of 
Auerbach  and  Kotlikoff  •  Is  -similar  to  rational  expectation 
(RE)  models.  Therefore,  if  around  a  steady  state  the 
number  of  stable  eigenvalues  Is  equal  to  the  number  of 
initial  conditions,  there  exists,  under  some  regularity 
conditions,  an  unique  transition  path  which  asymtotlcally 
converges  to  this  steady  state.  But,  Implicitly,  on  this 
path,  the  -  consumers  born  before  the  date  of  shocks 
(policy  changes)  are  constrained. 

The  main  purpose  of  this  paper  Is,  using  a  very 
simple  model  with  overlapping  generations,  production, 
taxes  and  public  expenditures,  on  the  one  hand  to  study 
the  existence'  and  the  uniqueness  of  the'  steady  states 
and,  on  the  other  hand,  to  introduce  a  definition  of 
transition  paths  where  the  consumer  born  before  the  date 
of  shocks  could  be  satisfied.  The  RE  transition  path  will 
be  a  particular  case  of  these  paths. 

Concerning  the  steady  states  we  have  the  following 
result:  If  the  government  expenditures  are  less  than  some 
amount  of  taxes  then  there  exist  always  two  steady  states 
associated  respectively  with  a  positive  and  a  negative 
Interest  rate. '  If  these  expenditures  are  more  than  this 
amount,  first,  one  has  two  steady  states  with  positive 
Interest  rates  and  beyond  some  value  of  the  expenditures, 
one  has  ua  steady  state. 

Concerning  the  transition  path,  we  assume  that  the 
consumer  born  before  the  date  of  the  policy  changes  has  a 
desired  consumption  for  this  date.  The  equilibrium 
transition  path  will  be  defined  as  the  one  where  every 
agent  is  satisfied.  In  particular,  the  consumer  born 
before  the  date  of  shocks.  The  RE  path  can  be  viewed  as  a 
particular  path  associated  with  some  desired  consumption 
of  this  consumer.  Then  we  have  the  following  result:  for 
most  of  the  desired  consumption  functions,  the  steady 
state  with  negative  interest  rate  (or  the  smallest 
positive  Interest  rate)  is  locally  stable  while  the  one 
with  positive  Interest  rate  (or  greatest  positive 
Interest  rate)  -the  steady  state  of  the  RE  path-  Is 
locally  unstable, 

I.  Hu  model 

In  this  model  there  are  two  goods,  labor  and  a 
physical  good  which  can  be  consumed  or  used  as  capital 
good.  The  consumer  lives  two  periods.  In  every  period  a 
new  trader  Is  born  :  therefore  at  each  date  there  are 
exactly  two  consumers,  a  young  and  an  old  one.  The 
economy  begins  at  date  1.  There  is  an  entreprise  which 
maximises  Its  Intertemporal  profit  from  date  1  to 
infinity.  We  assume  that  labour  is  exogenously  fixed.  The 
agents  are  assumed  to  have  perfect  foresight. 

l.l  -  The_consumer 
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Let  Cj  ^  ,  c^t  denote  the  consumptions  at  periods  t  Mj»  3q  and  his  (her),  consumption  at  date  0,  Cj  Q.  One  can 

and-t+1  of  the  agent  born  at  date  t;  He  (she)  maximises  write: 

his  (her)  Intertemporal  utility,  u(Cj  ♦  5*  u(c^ 

under  the  constraint  c_  _  -  Me.  ...  ,o..  a..  8  J 


Therefore,  our  economy  has  initial  data  (K^,  Cj  Q, 


Where  pt>  pt+J  and  W^,  are  prices  and  nominal  0 

_  1.5  -  Steady  state 

wages  at  date  t  and  t*l,  are  tax  rates  at  the  - 4— 

same  dates  ;  l  is  exogenous  leisure. 

1.2  -  Production  Assume  now 


There  is  one  producer  who  maximises  his  (her) 
intertemporal  profit 


max  £  <pt  yt  -  p,  .!,  -  W,  L,) 


G  -  g„t.  »  r. 


A  steady_  state  associated  with  there  sequences  Is  a 
triple  (K.C..E)  such  that  the  equilibrium  from  it 


under  the  constraints 


where  ■  w^l •!)  u  -  Cj  t 


Where  y^,  K^.  denote  production,  capital 

stock,  investment  and  labor, 

1.3  -  Markets_clearlng 

Let  <0^)  denote  the  sequence  of  public  expenditures. 
Then  one  has,  for  t  *  1  ! 


(S>  cl.t*c2,M*K».l-<1"5)Kt,C<-r(W 


(6)  L,  -  2(1-11 


1,4  -  Equi Hbrium 

At  date  2,  relation  (5)  Is  fulfilled  if  one  knows 
how  to  <  determine  q.  the  consumption  at  that  date  of 
the  agent  born  at  date  0. 

First,  this  agent  has  to  face  his  (her)  budget 
constraint 


c2,o  ■  •VW»  H  *  T0 


Secondly  one  can  assume  ,that  this  agent  has  a 
desired  consumption  at  date  1.  c2  Q,  which  depends  on  «j. 


It  can  be  easily  checked  that  the  associated 
function  *  has  the  following  form  : 

,|C1.0-  "r  V "  ci,o  *  ( -fj ) 

1.6  -  Transltorjr^gath 

Given  a  steady  state  (KJt  Cj  £).  a  transitory  path 
from  this  steady  state  and  associated  with  sequence  (G^> 
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Abstract  We  discuss  some  the  algorithmic  approaches  to  the 
solution  of  econometric  models  which  contain  both  forward 
looking  expectations  and  market  clearing  equilibrium 
conditions.  We  find  in  favour  of  first  order  iteration  methods 
as  being  the  most  robust  method  of  model  solution,  and  report 
some  simulations  on  the  LBS  model. 

IfiUttiuStiQfi 

The  application  of  numerical  methods  and  the  use  of  optimal 
control  techniques  in  economics  has  been  t.ansformed  over  the 
last  decade  by  the  impact  of  rational  expectations  (see  Holly 
and  Hughes  Hallett,  1989).  There  is  now  a  variety  of  methods 
which  can  be  used  to  solve  econometric  models  in  which 
expectations  are  forward-looking  or  rational.  There  is  the 
multiple  shooting  technique  of  Lipton  et  al  (1982),  the 
extended  path  method  of  Anderson  (1979),  ana  Fair  and 
Taylor  (1983),  and  refinements  of  this  approach  introduced  by 
Hall  (1985)  and  Fisher  et  al  (1986).  Finally  there  is  the  penalty 
function  method  of  Holly  and  Zarrop  (1983). 


Recently  there  has  also  been  a  increase  in  the  use  of 
equilibrium,  market  clearing  frameworks,  especially  for 
modelling  financial  markets.  In  this  paper  we  consider  how 
methods  for  solving  nonlinear  rational  expectations  models  can 
be  extended  to  allow  asset  prices  to  clear  spot  markets  -  so 
demand  is  equal  to  supply  -  as  well  as  allowing  expectations  to 
be  forward  looking, 


StiutioR.Mrthttlg 

A  deterministic  nonlinear  model  with  expectations  of  next 
periods’s  endogenous  variables  can  be  written. 

,xt  J  =  ®  (0 

Where  yt  is  a  m-vector  of  endogenous  variables  and  x{  is  a 

n-vector  of  exogenous  vanables  or  policy  instruments,  s  and  r 
denote  maximum  lags.  For  compactness  we  can  rewnte  (1)  as. 

F(Y,Y«,X)  =  0  (2) 

Where  Y  =  Y5  =  (y^^,..yji1...y^1). 

and  X  =  (x{,x£I..,xj,.,x£).  In  order  for  the  problem  to  be  well 
defined  we  need  a  terminal  condition  for  wnich  ye  shall 

discuss  later.  A  penalty  function  method  simply  requires  the 
minimisation  of  the  quadratic  loss  function. 

J(Y,Y*  ,X)  =  l/2[(Z>QZ)+(Ye,NVe')l  (3) 

where  Z  =  ),  and  a,  =  y®  -  y,+1,  for  t=l,T-l, 
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and-Zj  =  y,p-  <pt  where  <p  is  the  terminal  condition  Q  is  a 

semi-definite  positive  weighting  matrix,  and  N  is  a  positive 
definite  weighting  matrix.1  Clearly  the  unconstrained  minimum 
of  (3)  implies  that  the  solution  lies  on  the  saddlepath  and  that 
zt=0,  for  t=l,T. 

When  the  mode!  also  contains  equilibrium  conditions  that  m 
asset  markets- supplies  are  always  equal  to  demand  and  the 
asset  price  ensures  that  tins  holds,  then  it  1$  possible  to  extend 
the  penalty  function  method  by  defining  qt  =  dt-5t,  where  d  is 
demand  and  s  supply.  Then  the  addition  of  the  quadratic  terms 
Q’KQ  and  P’RP  will  enable  the  set  of  prices,  P  = 
(p|»p2»*  >p;."Px)  *°  ensure  Q=0,  where  K  is  a  semi-definite 
positive  weighting  matrix  and  R  is  a  positive  definite  weighting 
matrix. 

In  practice  as  with  most  applications  of  penalty  function 
methods  the  dimension  of  the  problem  can  cause  difficulties.  If 
a  large  number  of  expectational  variables  and  asset  prices 
appear  in  the  model  then  the  size  of  the  problem  can  become 
unmanageable.  This  suggests  that  alternative  methods  which 
do  not  require  the  direct  minimisation  of  (3)  and  the 
calculation  of  the  first  and  second  derivatives  may  be 
preferable. 

One  such  method  is  suggested  by  the  used  of  generalised  first 
order- iteration  methods  With  these  methods  the  process  of 
solution  involves  an  inner  and  an  outer  loop.  Also  within  the 
inner  loop  there  is  a  further  loop  of  the  form: 

p*+1  =  ?*(<),/$/’  (5) 

where  k  is  the  iteration  counter,  and  pt  is  a  vector  of  asset 
prices  which  clear  the  market  for  financial  assets.  The  use  of  a 
square  root  proved  to  speed  up  convergence.  Within  the  inner 
loop  there  is  an  extrapolation  parameter  (A^)  and  also  an 

extrapolation  parameter  for  the  outer  loop  (A^).  In  the 

Fair-Taylor  method  the  inner  loop  iterates  to  convergence 
before  using  the  outer  loop.  In  the  method  of  IIall(l985)  only 
one  iteration  is  used  in  the  inner  loop  pass.  However,  we  have 
found  that  confining  the  inner  loop  to  a  single  iteration  drove 
the  model  outside  the  feasible  solution  region,  so  the  number  of 
inner  loop  iterations  was  restricted  to  4,  This  also  had  the 
effect  of  restricting  the  iteration  count  of  (5), 

Some  Empirical  Illustrations. 

We  used  the  LBS  econometric  model  (see  Dinenis  et  a!  (1989) 
for  a  description)  in  order  to  carry  out  some  comparisons  of 
different  methods  of  solving  models  with  equilibrium 
conditions  and  forward  looking  expectations.  In  the  LBS  model 
there  are  three  market  clearing  prices  for  the  equity  market, 
the  gilts  market  and  the  foreign  exchange  market.  Lying 
behind  each  of  these  markets  is  a  dis-aggregated  set  of  asset 
demand  and  supply  relationships  for  different  sectors  of  the 
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domestic  economy  as  well. as  overseas.  Corresponding  to  each 
of  these  prices  is  also  an  expected  price.  Strictly  speaking  for 
the  foreign  exchange  market  it  is  the  exchange  rate  which  is 
used.  Also  the  degree  of  substituteabihty  between  domestic 
and  foreign  assets  governs  the  extent  to  which  an  uncovered 
interest  parity  condition  of  the  form; 

et=et,t+i'+rt_rswt  <6) 

actually  holds,  where  e  is  the  exchange  rate,  r  the,  domestic, 
and  rsw  the  foreign  interest  rate,  and  e^  ^  j  the  expectation  of 
the  exchange  next  period  conditioned  on  information  at  time  t. 
We  found  that  the  degree  of  substitution  between  domestic  and 
foreign  assets  was  implausibly  low,, and  we  therefore  imposed 
the  UIP  condition  given  by  (6).  Thus  the  number  of  market 
clearing  prices  was  reduced  to  two. 

In  Chart  1  we  show  the  effects  on  all  three'  asset  prices  of  a 
temporary  cut  in  short  term  interest  rates  of  1  percentage 
point  for  eight  quarters.  For  the  simulation  the  inner  loop 
extrapolation  parameter  was  set-to  0.7,  and  the  outer  loop 
extrapolation  parameter  was  0.8  for  the  exchange  rate,  0.8  for 
the  price  of  gilts  and  0.7  for  the  price  of  equity.  The  particular 
form  of  (6)  means  that  a  temporary  fall  in  interest  rates 
generates  an  immediate  downward  jump  in  the  exchange  rate 
in  order  for  the  exchange  rate  to  then  rise  just  enough  to 
produce  a  capital  gain  equivalent  to  the  fall  in  interest  rates. 

However,  because  an  explicit  arbitrage  relationship  does  not 
determine  the  behaviour  of  the  equity  and  gilts  prices,  we  rely 
on  the  structural  asset  supply  and  demand  equations  to 
provide  the  degree  of  substituteability.  Interestingly,  both 
equities  and  guts  jump  immediately  in  response  to  the 
unanticipated  fall  in  interest  rates  by  about  2%.  However, 
thereafter  they  diverge.  The  equity  pnee  continues  to  rise 
peaking  in  the  eigth  quarter,  and  then  declining  back  to  zero  as 
the  interest  rate  rises  back  to  its  original  level.  However,  the 
gilts  price  comes  back  only  slowly,  implying  considerable 
imperfect  substitutability. 


CHART  1 

lnt*r*st  Rat*  Fol  ( 1%,  8  p*rted«) 
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ABTRACT:  This  contribution  aims  at  presenting  some  methods  for 
solving  nonlinear  rational  expectations  models.  These  methods  are 
applied  on  a  model  of  the  French  economy/  built  up  by  Laffarguc, 
Malgrange  and  Pujol  (1990).  At  fust,  a  NEW  METHOD,  introduced 
by  Laffargue(1990),  is  used  for  deterministic  simulation  of  the  whole 
model.  This  method  is  a  relaxation  one  based  on  the  Newton- 
Raphson  algorithm,  contrary  to  the  usual  frameworks.  Then,  another- 
method  is  conducted,  consisting  of  a  backward  mapping  stochastic 
simulation,  using  a  normality  assumption.  Finally,  it  is  shown  how 
the  previous  assumption  can  be  relaxed:  especially,  the  method  of 
approximation  introduced  by  Tauchen(1986)  is  discussed. 

I,  THE  MODEL: 

The  model  considered,  called  PLM,  is  due  to  Laffargue,  Malgrange 
and  Pujol  (1990).  Wage's  nominal  rigidity  and  monopolistic 
competition  in  the  good’s  market  (based  on  an  arbitrage  between  the 
aggregated  domestic  good  and  the  imported  one)  are  associated  to 
generate  Keynesian  multipliers;  the  expectations  of  the  agents  arc 
assumed  to  be  rational,  The  production  block  uses  a  C.  E.  S 
production  function  and  includes  an  adjustment  cost  for  the  capital.  A 
constrained  intertemporal  optimization  of  the  aggregated  domestic 
firm,  achieved  by  the  Bellman  method,  provides  two  Euler  equations. 
The  consumpuon  block  is  solved  by  the  same  method  and  provides 
one  Euler  equation.  The  model  is  "closed"  by  introducing  the  national 
government’s  expenditures  and  taxs,  and  by  specifying  some  simple 
rules  of  its  foreign  economic  policy. 

The  model  is  considered  in  a  reduced,  calibrated  form;  it  can  be 
decomposed  in  two  inequal  parts;  an  interdependent  block  of  fourteen 
equations  including  the  three  Euler  equations  described  above,  and  an 
epilogue  of  five  equations.  The  first  block  can  be  seen  as  a  dynamic 
system  determining  at  each  date  t,  the  values  of  fourteen  endogenous 
variables,  using  the  lagged  values  of  five  predetermined  ones  and  the 
one-period  ahead  values  of  five  anticipated  ones,  thus  four 
endogenous  variables  arc  static.  Furthermore,  this  block  contains  six 
exogenous  variables  and  is  completed  by  giving  initial  values  for  the 
predetermined  variables  and  terminal  values  foe  the  anticipated  ones. 

n.  LAFFARGUE’S  METHOD: 

Consider  a  model  with  a  general  form; 

f( yi(t  l).  yO).  y2 (I+I).  I* )  *ut ,  for  i»l  to  T,  where  yi(t) 
is  the  (n*xl)  vector  of  the  model's  predetermined  variables,  y2  (t)  is 
the  (n2xl)  of  the  anticipated  ones,  y(t)  is  obtained  by  stacking  y2(t), 
y3(t)  and  yj(t)  in  this  order  with  (t)  the  (n$xi)  vector  of  the  static 
variables,  ut  and  L*  are  respectively  a  white  noise  and  the  exogenous 
variables  vectors.  Furthermore,  the  vector  of  the  initial  conditions, 
yi(0),  is  given  and  we  have  a  terminal  condition  of  the  form 

C(  y3(T),yiCO,  >2(T+l))*c,  c  is  a  constant  vector..  Our  purpose  is 
to  solve  the  constrained  system  described  above.  Given  that  f  and  C 
are  respectively  (nxl)  and  foxl)  vectors  of  functions,  with  n*ni+n2 
+n3t  the  system  to  solve  has  nT+nj+n2  equations  and  the  same 
number  of  unknown  variables.  Put  the  following  variable 
transformation.  xi(t)»  y i(t),  X2(0«y2(*H).  X3(t)*y3(0-  Denote  by 
x(t)  the  vector  obtained  by  stacking  X3(t),  xi(t)  and  X2(t)  in  this  order, 
and  by  x'(t)  the  vector  obtained  by  stacking  xj(t)  and  xi(i)  m  this 
order.  Thus  the  system  can  be  written: 

f(x,(M),x(i),LtM),  t-ItoT,  S0 x'(0)*xi(0) .  C(x(T))«c. 

Sots  a  well  known  matnx.  The  principle  of  Laffargue's  method  is  the 
following,  beginning  with  an  initial  solution  path,  the  previous 


system  is  successively  linearized  using  the.Newton-Raphson 
algorithm.  Denote  by  xH(t),  1*0  to  T,  the  solution  path  obtained  at 
the  step  (j-1);  the  solution  path  at  the  step  0)  is  determined  by: 
xJ(t) ■  xH(0+Axi(t)  , fort «* 0 to  T, 

with  AxJ«  (Axi(0) . Axi(t) )'  solution  of  the  linear  system: 

Si*l  Axis  $M;  where  SJ*1  is  the  Jacobian  matrix  evaluated  at  the  step . 
(j-1)  and  si*1  is  a  vector  of  constants.  Let  K»nT+ni+n2  For  each  j,Si 
is  a  square  (KxK)  matrix ;  Axi  and  si  arc  (Kxl)  vectors. 

To  solve  the  linear  system,  the  matrix  Si*1  is  seen  as  a  concatenation 
of  ( T+2 )  blocks;  Gaussian  elimination  is  conducted  inside  each 
block  and  when  associated  with  other  simple  matricial  operations,  the 
whole  matrix  is  transformed  into  a  triangular  form  with  each  diagonal 
term  equal  to  l.Thus,  Axi  can  be  computed  and  a  new  solution  path 
is  calculated.  The  algorithm  is  like  this  reproduced  until"  no 
improvement  by  linearization  is  possible. 

One  can  sec  that  the  main  problem  is  the  Invembiluy  of  the  matrices 
Si;  equivalently,  the  problem  is  to  guarantee  the  unicity  of  the 
solution  path:to  ensure  that,  it  is  possible  to  transform  the  model,a$ 
it’s  explained  in  Laffargue  (1990).  In  fact,  it  is  more  tractable  to 
manipulate  the  model  to  ensure  that  the  anticipated  variables,  in  their  - 
advanced  form,  only  appear  in  the  same  linear  combinations,  which 
is  a  sufficient  condition  for  the  unicity  of  the  solution  path.  The 
interdependent  block  of  PLM  is  solved  using  the  previous  result;  the 
Blanchard-Kahn  property  is  then  checked.Howcver,  the 
configuration  of  the  calculated  eigenvalues  impose  to  take  high  values 
for  T  -400. 


ni.  Backward- mapping  methods: 

The  backward-mapping  approach  was  suggested  by  Sims  (1985). 
While  the  standard  engineering  method  takes  arbitrary  stochastic 
processes  for  the  exogenous  variables  to  compute  the  corresponding 
endogenous  processes, this  approach  suggests  just  the  contrary.  A 
meamngfull  application  of  this  method  was  done  by  Lee(1989),on  a 
small  standard  stochastic  equilibrium.  Our  vicwdiere.is  to  see  some 
numerical  consequences  of  applying  this  method  to  more  complicated 
and  biggcrmodels.  At  first, the  model  is  simulated,  keeping  the 
normality  assumption  used  by  Lee, and  the  processes  of  the  available 
solutions  are  analyzed. 

A.-The  method.  The  number  of  the  endogenous*  variables' 
processes  should  lie  equal  to  the  number  of  the  exogenous  variables 
of  the  considered  block.thus.we  can  take  six  "entries".  The 
processes  of  the  available  solutions  arc, then,  explicitely  computed 
without  any  approximation,  given  that  only  the  endogenous 
variables  are  appearing  m  an  anticipated  form  m  the  Euler  equations 
of  the  PLM  model.  Precisely, we  choose  the  "entries"  so  as  the  Euler 
equations  can  be  easily  solved.  Let  (Xj^.with  i*l,2,  ...6  and  t  the 
time  index.be  the  chosen  forms  of  "entries".  Denote  by  x  the 
logarithm  of  X.  Assume,now,that  the  distribution  of  the  stochastic 

vector  xt»(xi,i . x$  t)'  is  an  autoregression  of  a  given  order 

pjiatural  integer,  so  as  the  following  formula  holds. 

xpCW+CO)  Xt.i+C(2)  xt.2+.  ..+C(p)xt  p+ut  (1) 

with.C(0)  a  (6*1)  vector  of  constants  and  C(k>  (k=l,2,...p)  (6*6) 
matrices  of  constants.  Following  Lee.we  assume  that  the  noises 
(ut)t  are  not  serially  correlated  and  that  ui  is  normally  distributed 
N(m,o),  for  each  t,  with  m*(nn,m2,.  ,m$)'  a  (6*1)  vector  and 
<j*(o,j)  the  (6*6)  vanance-covariance  matrix. 
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In  order  to  generate  dynamics  as  realistic  as  possible, the  coefficients 
C(k),k=l  to  p.m  and  c  are  estimated  on  a  real  data.  Given  these 
estimated  coefficients,we  can.at  the  beginning.compute'  the  Values 
of  the  processes  (XijOu.as  follows: 

p  6  - 

log(X;,)=C,(0>f  Z  £  Cjj(s)  log(X|1t.J)+ur.t 

S»lj»I 

from  (l)/or  each  i.t. 

Thus,the  values  of  the  processes  (Xj>t)i,i  can  be  calculated: 

P  6 

xi.l«exp(c,(0)) n  n<MV»«pw 

$■1  j*i  • 

Given  the  p  initial  values  (X|,X2, . ,xp)  and  generated  random 

values  from  the  law  of  u{,the  paths  of  each  vanable  are  easily 
computed.  Moreover, the  normality  assumption,  associated  with  a 
good  choice  of  the  auxiliary  variables  X,>t,  simplifies  greatly  the 
computations  involved  by  the  Euler  equations;  in  effect,  taking  two 
reals  a,b,  we  have  the  following; 

Ei(Xi,,+i)a(Xk,ul>b-«P(a)n<xj.i+l-s>^<s'i)'e’tPW 

sj 

with  for  $»1  top  and  ja!  to  6: 

a*a  Q(0)+b  Ck(0),  M$4)«a  C,j(s)+b  C^s) 

and  $«a  mi+b  mk+a^  o  fi  /2+b^  Ckk  /2+a  b  Cik  > 

Ei(.  )  is  the  expectation  operator  conditionally  to  the  information 
set  available  at  the  date  t.  These  formulas  allow  us  to  conduct  the 
main  computations  involved  by  Euler  equations. 


B.  Markov  process  approximation.  If  the  exogenous 
variables  appear  in  an  anticipated  form  under  the  operator  Et(.)  or  if 
we  want  to  relax  the  normality  assumption,  we  have, then,  to 
develop  an  other  framework.  That  is  Tauchcn's  methodology  which 
we  have  introduced  in  the  context  of  this  study.  Unfortunately,  this 
.method,  used  as  a  backward-mapping  one,  is  too  numerically 
expensive  if  conducted  on  models  including  more  than  three  or  four 
exogenous  variables, as  it  can  be  seen  further.  For  a  purpose  of 
simplification,  let 's  consider  a  model  with  n  exogenous  variables 
(no);  (Xj.t),.t  arc  the  auxiliary  endogenous  variables  chosen  for 
backward-mapping  (then,i*l  to  n).  With  the  same  notations  as  in 
CM, assume  that  the  vector  Xt  satisfies  the  property  (1)  with  p*I 
but  without  the  normality  of  ut.  We  have,  now,to  approximate  the 
AR(1)  process  by  a  Markov  one  with  a  finite  number  of  states.  Four 
steps  must  be  achieved: 

a)  We  transform  the  relation  (1)  to  obtain  an  AR(l)  process  without 
intercept  and  with  a  diagonal  vanance-covanance  matnx  for  the 
noises, which  we  note  (X\0  arc  the  transformed  "entries". 

b)  The  discrete-vaiucd  Markov  process.denoted  by  (X^a), 
approximating  the  AR(1)  process,  is  then  computed.  Assume  that 
each  variable  X'j.u  has  ki  va!ue$,given  by: 

(k)«Di  (k-(kj+l)/2>  ,i«l  to  n,  k«l  to  k| 
where:  Dj*2  r  (Oj  /  (k,-l);  r-3  was  found  to  work  well. 
Consequently,  the  vector  x\  has  N  states,  N*»ki  k2...  kn. 

c)  The  transition  matrix  Fl»(jt($, $')),$  and  s*-l  to  N,  is  then 
calculated,  using  the  estimated  coefficients  of  the  transformed  AR(1) 
process. 

d)  Finally, v/e  reverse  the  transformation  and  return  to  the  initial 
relation  (1):  the  values  of  the  Markov  process  are  modified,  but  not 
the  matrix  H 


By  the  Skorohod  construction,  the  discrete-valued  Markov  process 
converges  almost*  surely  to  the  AR(1)  considered.  Suppose, 
now, that- we  have  an  Euler  equation  of  the  general  form:  Z\= 
Et(g(X{+i  a+i)),  where  z^isa  process  to  be  computed  and  g  a  given 
function  .  Put  the  previous  relation  in  an  integral  foim: 

z(x)»J  g(x",z"(x))  dF(x"/x)  where  F( . )  is  the  conditional 
cumulative  probability  'distribution  function  of  the  process  xt. 
Usc.now,  the '  approximation’’  X(,<j  of  xa  the  integral  can  be 
discretised  as  follows:’ 

N 

Ms)a  X  g(xt.d(s*),2i/j(s') )  for  $=1  to  N 
s^l 

The  previous  system  has  N  equations  and  N  unknown  variables 
Zt/risM3*!  to  N;  it's  generally  easily  solved.  The  main  difficulty  of, 
this  method  is  that  N  is  likely  to  be  large  .provoking  high 
computational  costs,  that  is  why  we  have  suggested  that  n  must  be 
small.  To  give  ah  example  of  such  a  framework, we  have  considered 
the  production  block  of  PLM  and  we  have  transformed  it  so  as  it 
remains  three  exogenous  variables  (among  five  initially).  Thus,  n««3 
and  choosing  ki«k2*»k3«8,  we  have  N*512.The  block  solved 
includes  two  Euler  equations,  one  in  a  quadratic  form  and  the  other 
is  linear. 

IV.  CONCLUSION: 

Regarding  to  the  macroeconomic  models  curvature,  Laffargue’s 
method  is  likely  to  be  numerically  efficient:  that’s  because  we  expect 
a  small  number  of  iterations  of  Newton-Raphson  type.  Using  a 
formal  derivator,  the  method  is  very  tractable.  In  the  other  hand,  if 
the  model  in  its  sniul  form  does  not  ensure  the  unicity  solution 
condition,  it  is  generally  possible  to  transform  it,  by  simple 
operations,  to  get  the  required  form, 

Backward-mapping  methods  arc,  also,  interesting  for  a  purpose  of 
stochastic  simulation  or  estimation.  Tauchcn’s  method  is  found  to 
be  feasible  only  if  the  model  to  solve  includes  a  small  number  of 
exogenous  variables  but  even  in  this  case,  its  computational  cost  is 
important.  By  the  use  of  the  normality  assumption,  the  framework 
is  numerically  simplified;  however,  when  simplifying  the 
computations  by  a  convenient  choice  of  auxiliary  variables,  wc  may 
not  obtain  certain  variables  in  their  structural  form. 
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Abstract 


7 

t»S 


(1) 


In  this  paper  we  describe  the  algorithm  OPTCON  which,  has  been 
developed  for  the  optimal  control  of  nonlinear  stochastic  models.  It 
can  be  applied  to  obtain  approximate  numerical  solutions  of  control 
problems  where  the  objective  function  is  quadratic  and  the  dynamic 
system  is  nonlinear.  In  addition  to  the  usual  additive  uncertainty,  some 
or  all  of  the  parameters  of  the  model  may  be  stochastic  variables.  The 
optimal  values  of  the  control  variables  aire  computed  in  an  iterative 
fashion.  First,  the  time-invariant  nonlinear  system  is  linearized  around 
a  reference  path  and  approximated  by  a  time-varying  linear  system. 
Second,  this  new  problem  is  solved  by  applying  Bellman's  principle 
of  optimality.  The  resulting  feedback  equations  are  used  to  project 
expected  optimal  state  and  control  variables.  'These  projections  then 
serve  as  a  new  reference  path,  and  the  two, steps  are  repeated  until 
convergence  is  reached.  The  algorithm  has  been  implemented  in  the 
statistical  programming  system  GAUSS. 


with 


£«{x«»ui)  = 


(  1 

Vui-uJ 

’  Vu«“u«/ 

(2) 


X{  denotes  an  n-dimensional  vector  of  state  variables,  ut  denotes  an 
m-dimcnsional  vector  of  control  variables.  The  n-dimensionai  vector 
Xj  and  the  m  dimensional  vector  u,  denote  the  given  “ideal”  levels  of 
the  state  and  control  variables,  respectively.  5  denotes  the  initial  and 
T  the  terminal  period  of  the  finite  planning  horizon. 

The  matrix  W,  is  defined  as 


W|  a 


fWf* 

VW“ 


W?“\ 

Wf*/  • 


1  eS,...,3\ 


(3) 


where  W”,  Wf",  W“*,  and  W,“u  are  (n  x  n),  (n  X  m),  (m  x  n),  and 
(m  X  m)  matrices,  respectively.  Furthermore,  we  require  W,  to  be 
constant  apart  from  a  constant  discount  rate  a: 


1  Introduction 


W.aa'-'W,  tn$,...,T,  (4) 


Many  economic  problems  can  be  viewed  as  involving  the  optimization 
of  an  intertemporal  objective  function  by  a  decision-maker  who  is  con¬ 
strained  by  a  dynamic  system  subject  to  various  kinds  of  uncertainties. 
Such  problems  arise  both  on  the  level  of  the  individual  firm  as  well  as 
on  the  level  of  policy-making  for  a  national  economy.  Stochastic  op¬ 
timum  control  theory  has  proved  to  provide  a  powerful  methodology 
to  deal  with  such  problems.  Stochastic  optimum  control  problems  are 
usually  rather  complex,  thus  for  most  models  only  numerical  solutions 
can  be  obtained  for  particular  values  of  the  parameters.  Even  then, 
in  most  case#  only  approximations  to  the  true  optimum  solution  can 
be  found  at  present.  Thus,  there  is  a  need  for  further  algorithmic 
developments.  In  the  present  paper  we  report  on  a  new  algorithm 
for  the  optimum  control  of  nonlinear  dynamic  models  that  allows  for 
additive  uncertainty  as  well  as  for  the  presence  of  a  stochastic  pa¬ 
rameter  vector  in  the  system  equations,  to  be  called  OPTCON.  In  its 
present  version,  OPTCON  is  limited  by  two  simplifications  which  pre¬ 
vent  the  solutions  ol  tamed  to  be  truly  optimal.  Fust,  computation*  of 
approximately  optimal  policies  are  obtained  by  applying  repeated  lin¬ 
earizations  to  the  given  nonlinear  economic  model.  Second,  we  exclude 
any  learning  about  the  system  parameters.  A  GAUSS  implementation 
of  the  algorithm  exists  which  can  be  obtained  at  request  from  the 
author  mentioned  first.  More  details  on  the  algorithm  and  on  some 
applications  are  given  in  [6J. 

2  The  Optimum  Control  Problem 

OPTCON  can  deliver  approximate  solutions  to  stochastic  optimum 
control  problems  with  a  quadratic  objective  function  and  a  nonlinear 
multivariable  dynamic  model  in  discrete  time  under  additive  and  pa¬ 
rameter  uncertainties.  Thus,  we  consider  an  intertemporal  objective 
function  which  is  additive  m  time  and  can  be  written  as 


where  W  is  a  constant  matrix.  Without  loss  of  generality  it  is  assumed 
that  W  is  symmetric,  which  entails  that 

W«s[WH)'.  (5) 

Defining 

(:;)  -  -•(*)■  »> 

•*  -  {(»■-(»• 

(2)  can  equivalently  be  written  as 

= f (»;)*w'(*) + (*)'  (w-) + <  m 

The  dynamic  system  is  assumed  tu  be  given  by  the  »y»tem  ui  noniineai 
difference  equations 

+  t  =  S,  ,T,  (9) 

where  0  denotes  a  jv dimensional  vector  of  unknown  parameters,  zt 
denotes  an  /-dimensional  vector  of  non  controlled  exogenous  variables, 
and  <i  is  an  n-dimensional  vector  of  additive  disturbances,  0  and 
<i,  ( t=  5,...  ,T,  are  assumed  to  be  independent  random  vectors  with 
known  expectations  {0  for  0,  0„  for  c,,  f  =  and  covariance 

matrices  (ifi®  for  0,  See  for  et>  t  =  T).  f  is  a  vector- valued 

function. 


*  Fiauo*l  support  from  ike  'Fond*  vu  Fordtrvoj  d«  *a$s«»sck*fthctta 
ForsckiDs”  (K»aich  project  no.  J?  554$)  is  srnUfuHy  acknowledged 
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|  3  Overview  of  OETCON 


j  J  Input  of  the  Algorithm 
1 5  system  function 

j  j  initial  values  of  state  variables  £5-1  »  xJ.j 

|  \  tentative  path  of  control  variables 
|  path  of  non-controlled  exogenous  variables  (zt)JMs 
|  expected  values  of  system  parameters  0 

|  covariance  matrix  of  system  parameters 

j  covariance  matrix  of  system  noise  S£C 

j  weighting  matrices  of  objective  function  W'*, 

j  discount  rate  of  objective  function  a 

“ideal”  path  for  state  variables  (tyTmS 

“ideal”  path  for  control  variables  (^t)JaS 

Output  of  the  Algorithm 

expected  optimal  path  of  state  variables  (x? )Jm$ 

expected  optimal  path  of  control  variables  (u *t)JaS 
expected  optimal  welfare  loss 


Description  of  the  Algorithm 

1,  Compute  a  tentative  state  path:  Use  the  Gauss-Sridel  algorithm, 

the  tentative  policy  path  and  the  system  equation 

to  calculate  the  tentative  state  path  (xj )Jm$  according  to 

x«a  +  (10) 

2,  Nonlinearity  loop:  Repeat  the  steps  (a)  to  (e)  until  convergence  is 
reached  (i  c,,  until  the  optimal  control  and  state  variables  calculated 
do  not  change  more  than  a  prespecified  small  number  from  one  it* 
eration  to  the  next)  or  the  number  of  iterations  is  larger  than  a 
prespecified  number-  This  procedure  extends  the  one  developed  in 
(>!• 

(a)  Initialization  for  backward  recursion. 


Hr*l  «  0„**.  (11) 

h’r*.  =  0.,  (12) 

>‘Ui  =  0,  (13) 

1  =  0,  (14) 

AJ+,  =  0.  (15) 


(b)  Backward  recursion: 

}  Repeat  the  follow ing ste^s  j .  to xx, for  t  =  T, 

1.  Compute  the  expected  values  of  the  parameters  of  the  ha* 
\  earized  system  equation 

Xj  «  A|X|.t  +  C|  +  1  =  (161 

with 


A, 

=  (i„-Fx,r'FXl.„ 

(ID 

j  D, 

=  (i.-F„)-*r.,. 

(18) 

c< 

=  X,  -  AtX(.(  -  BtU(, 

(19) 

;  (, 

=  (i.-Fx.rv, 

(20) 

j  & 

=  «*s- 1  ({,.<,)  = 

] 

-  (I„-Fx,)-'£«  [(!„  -  F*,)-*]' 

.  (21) 

where  I*  denotes  the  ^identity  matrix;  Fx,.^*,,  FUo  and 
Fp  are  matrices  of  partial  derivatives  of  f  with  respect  to 
x*_j,  xt,  ut,  and  0,  respectively.  In  (17)-(21),  all  derivatives 
are  evaluated  at  the  reference  values  Xj.j,  x(,  ut>  0,  2,,  and 
2*  =  0n< 


u.The  matrices  A{,Bt  and  the  vector  < ,  disfunctions  of  the 
random  parameter  vector  0  and  are,  therefore,  random  them¬ 
selves.  Both  matrices  can  be  written  as  collections  of  their 
column  vectors; 

A«=(at,i  ...  a,.n),  t=S>..fT,  (22) 

B,  =  (bM  ...  b^),  (23) 

Each  of  these  column  vectors  as  well  as  ct  are  functions  of  0. 
These  functions  are  approximated  by  linear  functions. 


=  D*"0 

t  a  1,... 

t  ~  5,..v 

,n 

,T, 

(24) 

bij 

=  T>hlJ0 

i  =  1,-.* 
-t  =  s,.,. 

,m 

,r, 

(25) 

c< 

-  DC|0 

(26) 

where  D*M,  Db,J,  and  Dc‘  denote  matrices  of  partial  deriva¬ 
tives  of  the  elements  of  (22),  (23),  and  c<,  respectively,  with 
respect  to  0. 

We  reshape  these  matrices  as 

DAl  B  |vtc((D*'.*)') . v«((D- -)')),  (27) 

Db'  a  vec((Db">)')J,(28) 

df'  s  (vcc((Dc')')j.  (29) 

iii.  Compute  the  derivatives  of  the  parameters  of  the  linearized 
system  with  respect  to  0: 

Da'  =  ((I,  -  F,,)-1  ®Ip)  [r^oA.  +  F^  o]  ,(30) 
=  (<I.-FX, )-'«!,)  +  FUj  0] ,  (31) 

d"  =  vec  [((!„-  F,,)->Fe)']- 

-  Da'x,.,  -  l)B'ii„  (32) 

where  FXj-J  q,  TX(q,  and  FUf  q  denote  matrices  of  second- 
order  partial  derivatives  of  f,  introduced  in  a  way  analogous 
to  (3),  and  all  derivatives  are  evaluated  at  the  same  reference 
valutas  above.  _ _ _  .  ^ 

iv.  Compute  the  influence  of  the  stochastic  parameters;  Coin* 
pute  all  the  matrices  YtAK\  YtBK\  rBKBy  vAK\  vfK*% 
and  t£*c,  the  cells  of  which  are  defined  by 

(rA^),.j.=(t(K,D*'Jrw(D*'<)'] )l\\Z'X  <33> 

=  tr  [KiD*'3 E88  (dB‘ *),|  j'"’’"  (34) 

|F,flK8).J=tr  [k^^o  (d*-)']  • = };;;;;^(35) 

(«,'lK')i=tt  [k,Dc'.£W'(D*«)']  i  =  1 . «,  (36) 

|l.,BK')i=lt[K1Dc'2;e(,(Db''‘yj  i=  1 . m,  (37) 

|ti{,r')=tr[K,Dc,.S<W<Dc,y].  (38) 

v.  Convert  the  objective  function  from  “quadratic-tracking”  to 


“general  quadratic”  format; 

W"  =  o'-’W", 

(39) 

\V,“  =  a'-'W“, 

(49) 

wy*  =  o'-'w-, 

(41) 

w*  S  -W”x,  -  W*“ut, 

(42) 

w“  =  -W^xj  -  Wj*wU{, 

(43) 

»;  =  ixJWf'it,  +  uJW“x,  +  ^uiw^ui.  (44) 
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vi.  The  key  idea  of  our  algorithm  OPTCON  js  to  use  Bellman’s 
principle  of  optimality: 

=  miaE,.,  (i,(x,,u,)4-j;ti(x,)),  (45) 

where  denotes  the  loss  which  is  expected  at  the  end 

of  period  t  - 1  for  the  remaining  periods  t , . . .  ,T  if  t  he  op  timal 
policy  is  implemented  during  these  periods.  Ej.j  (.)  denotes 
conditional  expectation;  Xjt.j,  k  =  and  ii*.*,  k  = 

5  -f  arc  known  at  the  time  when  we  have  to  decide 
about  U{.  It  can  be  shown  that  can  be  expressed  as 

a_  quadratic,  function  of  Xj.it 

JftXl.  l)  =  |xI.iH,x,.i  +  x;.,hf  +  Af  +  hi  +  Af  (46) 

for  all  periods  1  a  5,...,T+  1,  where  II,,  hf, Af,  hf,  and  Af 
are  defined  below.  Here  we  introduce  the  following  simplify¬ 
ing  assumptions: 

A.  Each  occurence  of  Ej-i  (.)  is  substituted  by  E$„i  (.),  and 
each  occurence  of  coV|«j(..)  is  substituted  by  covs.i(«,J 
forallt  =s5+l»...»T+l«  Thus,  we  rule  out  any  learning 
about  the  parameters  of  the  model, 

B  Although  At,  Bj  and  c{  axe,  m  general,  nonlinear  func¬ 
tions  of  0,  we  will  compute  their  expected  values  by  eval¬ 
uating  the  equations  (17),  (18),  and  (19)  at  the  reference 
values  Xj-i,  Xi-  u i,  (0),  z<,  and  ct  =*  0„,  which  were 

true  only  in  case  of  linear  functions. 

vu.  Compute  the  parameters  of  the  function  of  expected  accumu¬ 


lated  loss: 

Kt  =  Wf*  +  Hf+i,  (47) 

fcf  a  w*  +  hfn,  (48) 

A"  =  r/KJ  + a;K,A,,  (49) 

a;-  a  {A?)',  (50) 

A"  =«  r“+B;KlA,4tt"A„  (51) 

Af  -  ri8KS  +  B;K,B1  +  2BiW1'“  +  WJ“>  (52) 

Af  a  v,',k'  +  a;K,c,  +  A(kf,  (53) 

Af  a  vf^'+BlK.c  +  Bikf  +  Wf'c+w?,  (54) 

a;  a  itr(K;£«j+AJtl,  (55) 

Af  a  !«<*•  +  »?«.  (55) 

Af  =  jufKiCi  +  cfkf +  »f +Af„.  (57) 

viil  Compute  the  parameters  of  the  policy  feedback  rule- 

G,  a  -  (Af  )"'  Af ,  (58) 

*>  =  -  (Af  )-'Af.  (59) 

ix.  Compute  the  parameters  of  the  function  of  minimal  expected 
accumulated  loss 

h,  =  A«-Ar(*r)".ir>  («°) 

h(  =  Af-Ar^jT'A?.  (61) 

5!  =  *(-f(Af)‘(Af)-‘Af,  (62) 

a;  =  a;,  (63) 

Af  =  Af.  (64) 

(c)  Forward  projection; 

Repeat  the  fiJovung  steps  i.  and  iu  for  1  =  5 . X, 

i  Compute  the  expected  optimal  policy; 

UJ  =t  GjX?_|  +  g*.  ($5) 


n.  Compute  the  expected  optimal  state.  Use  the  Gauss-Seidel 
algorithm  to  compute  xf*  such  that 

x;  =  f  (x;_,,x;,u;,o,z,) .  (66) 

(d)  Set  the  nevf  tentative  paths  for  the  next  -teratum: 

(*t'Ls  =  W)?is,  (67) 

(“<£«  =  ("f)L.  (58) 

(e)  Compute  the  expected  welfare  Toss, 

Js  “  4-  4-  hes~x  +  4-  (69) 


In  order  to  illustrate  the  feasibility  of  the  algorithm  OPTCON  and 
its  GAUSS  implementation,  we  have  applied  it  to  a  macroeconomic 
policy  problem,  the  controls  are  fiscal  and  monetary  policy  variables, 
the  states  are  macroeconomic  target  variables,  and  the  objective  func¬ 
tion  expresses  a  hypothetical  policy-maker’s  preferences.  We  have  used 
two  small  econometric  models  of  the  Austrian  economy  with  eight  (six) 
state  variables,  three  control  variables  and  three  (two)  non-controlled 
exogeneous  variables.  Different  control  experiments  were  performed, 
where  the  parameters  of  the  model  regarded  first  as  known  with 
certainty,  and  then  some  stochastic  parameters  were  tentatively  in¬ 
troduced.  Finally,  the  stochastic  nature  of  all  the  parameters  of  the 
model  (the  coefficients  and  the  constants)  was  taken  into  account  by 
assuming  the  estimated  values  of  the  parameters  to  be  their  expected 
values  and  the  covariance  matrix  of  the  coefficients  of  the  model,  which 
is  delivered  as  an  output  of  simultaneous  estimations,  to  be  the  covari¬ 
ance  matrix  of  the  parameters.  These  optimum  control  experiments 
were  carried  out  on  an  IBM  compatible  12  MHz  PC-AT  with  an  80287 
mathematical  cv,  toccssor.  The  running  time  of  the  GAUSS  program 
of  OPTCON  ranged  from  5  nun.  5  sec.  to  24  min.  37  sec.  The  results 
of  the  experiments  show  the  influence  of  parameter  uncertainty  on  the 
results  of  optimal  policies  fox  the  two  models  considered. 

Several  directions  of  further  research  may  be  suggested.  More  opti¬ 
mization  experiments  axe  required  in  order  to  study  the  results  under 
a  greater  variety  of  stochastic  parameter  patterns  and  different  eco¬ 
nomic  models.  For  the  algorithm  itself,  there  exist  several  possible 
extensions.  By  adding  updating  equations  for  the  stochastic  parame¬ 
ters  such  as  the  ones  used  in  (4),  it  could  be  expanded  into  a  passive- 
learning  algorithm  in  the  sense  of  (2|.  Another  interesting  extension 
to  be  considered  in  the  future  will  be  the  examination  of  the  effects  of 
decentralized  policy-making,  here  results  of  decentralized  control  the¬ 
ory  (dynamic  team  theory)  and  dynamic  game  theory  will  have  to  be 
incorporated. 


(l!  Gregory  C,  Chow,  Econometric  Analysis  by  Control  Methods  (Wi¬ 
ley,  New  York  19SI). 
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tion  to  an  adaptive  linear  decision  problem,  Annals  of  Statistics, 
2(1974)  337, 

(4)  Elisabeth  Chase  MacRae,  An  adaptive  learning  rule  for  multi¬ 
period  decision  problems,  Economotrica  43(1975)  $93. 

(5)  Josef  Matulka  and  Reinhard  Neck,  OPTCON.  An  Algorithm  for 
the  optimal  control  of  nonlinear  stochastic  models,  Research  Re¬ 
port  9007,  Ludwig  Boltzmann  Institut  fur  okonomische  Analysen 
wirtschaftspohtischer  Aktivitaten,  Vienna  1990. 
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Abstract  -  The  purpose  of  this  paper  is 
to  analyze  the  optimal  coordination  aneng 
interdependent  econoaies  with'  an  indepen¬ 
dently  acting  coordinator,  using  aspects 
of  hierarchical  control  theory  in  a 
linear  (i zed) -quadratic  framework. 

I-INTRODUCTIOM 

Once  the  1992  unification  of  Europe  is 
achieved,  the  Single  Market  will  reduce 
the  roon  for  independent  policies  as  it 
night  enlarge  the  existing  icbalances 
anong  sectors,  regions  or,  also,  coun¬ 
tries.  To  prevent  this  situation,  the 
European  Commission,  or  the  Board  of 
Price  Ministers,  will,  have  to  act  as  a 
coordinator  of  nore  balanced  econoaic 
policies. 

II-  THEORETICAL  FRAMEWORK 

In  order  to  analyze  this  relationship 
under  the  assumption  that  Hierarchical 
control  Theory  applies  in  this  macro¬ 
economic  framework,  we  have  built  an 
annual  interconnected  econometric  codel, 
based  on  the  COMET-V  model  (1988)  for  the 
Coaaon  Market.  The  countries  under  study 
are  the  EC  countries  less  Luxemburg,  and 
an  aggregation  of  these  countries  (EC) . 

Trying  to  capture  the  interrelations 
that  arc  taking  place  whithin  this 
system,  we  can  write  its  reduced-form  as: 

Y,(t)  -  A,  Yj  (t-1)  +  A,0  Y0(t-1)  + 

+  B|  Uj  ( t)  +  D,  d,(t)  +  nt(t)  (1) 

where: 

Y,(t):  an  n,-dimcnsional  vector 

containing  the  endogenous  (target) 
variables  of  the  i-th  country, 
representing  the  objectives  for  the  H 
subsystems  (1-1,2,.., H)  and  the 
coordinator  (i-0);  A,  (i-0,1, . . .  ,M)  is  a 
matrix  of  the  system  parameters  belonging 
to  the  lagged  endogenous  variables,  and 
Ajq  is  the  matrix  of  parameters  belonging 
to  the  coordinator's  target  vector, 
denoting  the  effects  of  tho  coordinator's 
lagged  endogenous  variables  on  the  i-th 
subsystem. 

u,(t):  a  p,-dimensional  vector  of  the 
exogenous  (control)  variables  which  are 
subjected  to  control  by  tho  i-th  policy 
maker  and  B,  is  the  matrix  of  coeffi¬ 
cients  of  these  control  variables. 
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d.(t) :  a  1, -dimensional  vector  (called 
data -vector)  including  the  current  and 
lagged  non-control lable  exogenous  varia¬ 
bles  of  the  i-th  subsystem,  and  D,  is  the 
matrix  of  coefficients  of  these  pure 
exogenous  data'. 

jji  (t) :  an  n, -dimensional  residual  vector, 
assuming  that 

E(*2,(t))-0  and  E(rjf(t)tj 
where  6fJ  and  Sta  are  Kronecker  deltas. 

The  Hierarchical  Control  Problem 
consists  of  the  minimization  of  each 
subsystem's  and  coordinator's  objective 
function,  subjected  to  an  econometric 
model.  These  objective  functions  are 
expressed  by  similar  quadratic  functions, 
which  are  the  expected  weighted  sums  of 
the  squared  deviations  between  actual 
targets  and  the /desired  targets,  and  the 
actual  controls  and  the  desired  control 
variables.  We  assume  that  the  subsystems 
would  adopt  a  myopic  behaviour,  i.e., 
they- would  decide  their  policy  period  by 
period  since  they  don't  know  the  target 
variables  of  the  coordinator  and  the 
other  subsystems  in  advanced.  The  same 
holds  for  the  coordinator. 

Jf(t)  -  Min  E  (I  Y,  (t)  -  Y*|(t)  |*  0  (t)  + 

o.(i)  ' 

lu,(t)  -  U|(t)l*Ri(t)  )  (2) 

where  Y#,  and  u",,  i-0,1,..., N,  are  the 
desired  target  and  desired  control  varia¬ 
bles,  respectively.  The  weighting 
matrices  Q.(t)  (i-0,1, . . . ,H)  a: e  assumed 
to  be  positive  seni-do*inite  and  R,(t) 
(i-0,1, are  assumed  to  bo  positive 
definite. 

According  to  tho  Certainty  Equivalence 
Principle,  the  optimal  solutions  will  be 
the  same  i:»  a  stochastic  optimal  control 
problem  with  additive  error  tern  as  in  a 
deterministic  optimal  control  problem. 
Taking  the  first  derivatives  of  these 
objective  functions  with  respect  to  the 
control  variables,  and  equating  these  to 
zero,  we  obtain  the  control  solutions. 

III-APPLICATION  ON  A  EUROPEAN  MODEL 

The  codel  used  is  a  linear(-ized) 
econometric  model,  with  about  8  behaviou¬ 
ral  equations  per  country.  These 
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creations  refer  to  private  expenditures 
(consumption  and  investment) ,  foreign 
trade  (total  exports  and  imports),  price 
indices  (gross  domestic  product  deflator 
and  prices  of  total  exports) , 
unemployment  rate  and  money  stock.  A 
definitional  equation  for  trade  balance 
vas  also  introduced  in  the  model.  The 
interdependency  among  countries  is 
censured  by  means  of  the  quantity  and 
price  transmission  in  the  international 
trade  sector  and  by  considering, 
explicitly,  tte  lagged  coordinator’s 
values  for  unemployment  rate,  trade 
balance,  gross  domestic  product  and  its 
deflator,  in  the  countries’  equations. 
The  sain  control  variables  used  in  our 
model  are  the  effective  direct  tax  rates 
on  household  and  company  income,  the 
indirect  tax  rate,  ^  and  the  rate  of  social 
security  contribution  _of  households, 
other  important  control  'variables  are 
nonetary  policy  variables  as  the  long  or 
short  tern  interest  rate  and  the  exchange 
rate  with  respect  to  the  dollar  (annual 
average) ,  and  the  governmental  subsidies, 
wages  and  employment.  The  model  was 
estimated  on  a  yearly  basis  for  the 
sample  period  1960-1986,  and  was  applied 
to  analyze  the  optimal  coordination  for 
the  period  1987-1592. 

In  our  experiment,  the  wheighting  matri¬ 
ces  were  considered  constant  and 
diagonal-  The  Q,-values  concerning  the 
CDP-deflator,  unemployment  rate,  money 
stock  and  the  deflator  of  total  exports, 
were  set  equal  to  2  and  the  other  Q, 
values  were  considered  equal  to  1.  The  R,- 
values  were  set  as  an  identity  matrix. 
The  optimal  control  -policies  can  be 
divided  into  different  categories, 
depending  on  the  direction  of  the 
information  trans-nission.  We  concentrate 
our  analysis  on  four  type  of 
informational  exchange  being  defined  as: 

-  pure  Top-Down  Policy:  the  desired 
targets  and  desired  controls  determined 
by  the  coordinator,  are  transmitted  into 
the  subsystems’  optimal  policies. 

-  Pure  Bottoa-Up  Policy:  each  subsystem 
determines  its  control  policy,  and 
transmits  its  information  of  targets  to 
the  coordinator,  who  determines  his 
optimal  control  policy  given  the 
subsystems'  targets. 

-  Mixed  Top-Down  Policy:  the  coordinator 
determines  the  overall  target  variables, 
but  imposes  no  restrictions  on  the  values 
of  the  subsystems'  control  variables. 

-  Mixed  Bottom-Dp  Policy:  the 

coordinator  receives  the  information  of 
targets  from  the  subsystems,  and 
transmits  his  desired  control  variables 
to  the  subsystems. 

We  compared  the  optimal  targets  and  the 
optimal  control  variables  with  the 
observed  values,  by  means  of  the  Theil's 
inequality  coefficient.  This  coefficient 
will  measure  the  distance  between  the 
values  obtained  by  the  application  of  the 
hierarchical  control  solution  and  the 
observations,  for  which  the  OECD  data 
(and  its  predictions  until  1992)  were 
taken. 


The  sain  conclusion  is  that  there,  is  not 
3  significant  difference  neither  between 
the  Top-Down  and  the  Mixed  Top-Down  poli¬ 
cies,  nor  between  the  Bottom-Up  and  the 
Mixed  Bottom-Up  policies,  so  we  say  say 
that  it  is  very,  isportant  who  determines 
the  initial  targets.  Comparing  among 
policies,  and  considering  all  the 
equations  together,  the  Top-Down  policy 
seems  to  be  the  sore  accurate  for 
Germany,  France,  The  Netherlands, 
Belgium,  United  Kingdom,'  Ireland,  Denmark 
and  Portugal.  For  the  rest  of  the 
countries  the  Theil's  values  were 
similar. 

For  EC,  Germany,  France,  The  Netherlands 
and  Belgium,  the  Theil's  coefficients  for 
the  target  variables  were  rather  small, 
which  can  imply  that  the  observed- policy 
vas  close  to  the  optimal «  The  rest  of  the 
countries  present  larger  values,  which 
implies  that  the  observed  policies  are 
not  considered  to  be  the  optimal. 
Analyzing  the  Theil's  coefficient  for 
the  control  variables,  the  larger  values 
were  observed  for  the  interest  rate  which 
may  imply  that  a  strong  monetary  policy 
is  necessary.  The  government  wages  shows 
large  values  for  Italy,  The  Netherlands, 
Ireland,  Denmark  and  Portugal.  For 
Irland,  Denmark,  Greece  and  Spain  large 
values  were  also  observed  for  the 
subsidies  and  the  people  employed  by  the 
government. 

IV  -  CONCLUSION 

This  paper  deals  with  the  application  of 
hierarchical  control  theory  on  a  simulta¬ 
neous  econometric  model  for  the  European 
Community.  By  the  analysis  of  the  control 
solution  we  nay  conclude  that  for  most  of 
the  countries  a  controlled  policy  seems 
to  be  near  to  the  observed  policy. 
However,  analyzing  equation  by  equation, 
it  is  difficult  to  determine  which 
policy  is  "better"  for  the  system  as  a 
whole.  It  will  be  fruitful  to  improve 
this  policy  approach  by  refining  the 
econometric  model.  It  will  be  also 
important  to  make  some  experiments 
variating  the  weighting  matrices  involved 
in  the  minimization  problem,  or  setting 
different  desired  paths  for  the 
variables. 
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Abstry;*;  -  Repair  cost  limit  caintftflaace  po- 
llcicj-are  characterized  as  follows  s'  khen  a 
system  failure  occurs  the  necessary  repair 
cost  is  estimated-  If  the  estimated  cost  ex¬ 
ceeds  a  given  Unit,  then  the  syst e=  is- re¬ 
placed  by  a  new  one;  otherwise  a  minimal  re¬ 
pair  is  carried  out-  Based  on  an  inhomogene¬ 
ous  Poisson  failure  process-a  general  and 
unifying  approach  to  repair  cost  limit  cai*te- 
nance  policies  is  presented.  Issportant-spe 
clal  cases  are  discussed  in  detail.  Optimal 
repair  cost  limits  are  derived. 

I.  INTRODUCTION 

During  the  past  40  years  many  mathematical 
models  for  proper  scheduling  of  maintenance 
actions  have  been  developed;  for  recent  sur¬ 
veys  see  |4|  and  |9|-  Their  principal  objec¬ 
tive  is  to  find  maintenance  policies  being 
optimal  with  respect  to  given  economic  or  re¬ 
liability  criteria-  Among  these  policies  the 
important  class  of  repair  cost  limit  replace¬ 
ment  policies  is  characterized  as  follows: 
When  a  system  failure  occurs  the  necessary 
repair  cost  is  estimated.  If  the  estimated 
cost  exceeds  a  fixed,  possibly  time-dependent 
limit  -  called  repair  cost  limit  -  then  the 
system  is  not  repaired  but" replaced  by  a  new 
one.  This  basic  model  may  be  combined  with 
age  and  block  replacement  policies  to  obtain 
models  being  rather  useful  for  practical  ap¬ 
plications.  Usually  the  long-run  average 
maintenance  cost  per  unit  time  (maintenance 
cost  rate)  serves  as  an  optimality  criterion. 


|4|.  Thus,  type  1  failure?, ^ay  be  interpreted 
as  slight -cnes,  easily  to  be  removed,  whereas 
type  2  failures -may  be  complete  system  break¬ 
downs.  A  failure  occuring  at  system  age  t  is 
with  probability  p(t)  of  type  2  and  with  pro¬ 
bability-  p(t)  *  1-p(t)  of  type  1.  All  repla¬ 
cement  and  minimal  repair  times  are  assumed 
to  be  negligible. 

This  failure  model  has  been  first  introduced 
by  Bsichelt  Ml  in  1976,  see  |3|  and  |4)  for 
icproYrd  versions.  Later  the  special  case 
p(t/*p  has  been  reconsidered  by  Fontenot  and 
Proschan  |6|.  Here  a  short  survey  on  those 
results  is  given  needed  for  tbs  purpose  of 
this  contribution. 


The  lifetime  X  of  the  system  is  assumed  to 
be  a  random  variable  with  IFR  (increasing 
failure  rate)  distribution  function  F(t), 
survival  function  ?(t)  =  1-F(t),  density 
function  f(t)  *  F’(t),  and  failure  rate 
q(t)  »  f(t)/?(t).  Let  a  cycle  be  the  tine 
between  two  neighbouring  replacements  and 
G(t)  be  the  distribution  function  of  the  ran¬ 
dom  cycle  length  Y,  5(t)  =  1-G(t).  Then 
P(t«Y*  t+  a t|Y»  t)  =  p(t)q(t)At  +  o(at) 


so  that 

G(t+4t)  -  G(t)  . 
- 7G(t)  =  p(t)q(t) 

At  ' 


o(A  t) 
At 


Letting  At  — 0, 

G'(t)/G(t)  =  p(t)q(t). 


Repair  limit  replacement  policies  have  been 
already  proposed  by  Gardent  and  Nonant  In 
1963.  Drinkwater  and  Hastings  (1967)  computed 
optical  repair  cost  limits  by  an  approxima¬ 
tion  procedure  which  successively  improves 
given  repair  cost  limits.  Under  the  same  as¬ 
sumptions  as  stated  by  Drinkwater  and 
Hastings  (piecewise  linear  failure  intensity, 
piecewise  linear  repair  cost  rate,  exponen¬ 
tially  distributed  repair  time)  Beichelt 
(1978)  derived  the  exact  average  cost  per 
unit  time  so  that  optimal  repair  cost  limits 
can  be  computed  by  any  available  optimization 
method.  Later  Park  (1983)  reconsidered  a  spe¬ 
cial  case  of  this  model.  Based  on  a  general 
failure  model,  In  this  contribution  a  unify¬ 
ing  approach  to  repair  cost  limit  models  is 
developed  allowing  arbitrary  repair  cost  di¬ 
stributions. 


Hence  p(t)q(t)  is  the  failure  rate  belonging 
to  G(t)  so  that 
t 

G(t)  *  exp(-  5  p(x)q(x)dx).  (1) 

0 


If  Xfc  denotes  the  time  of  k  th  type  1  failure 
then  it  is  well-known  that  on  condition  Y>Xk 
the  common  probability  density  function  of 
the  random  vector  (X^ ,X2» • • • yX^)  is  |4| 


,*k> 


1^0,  otherwise,  k—2. 


(2) 


Let  Z  be  the  random  number  of  type  1  failures 
within  a  cycle.  Then, 


CO 

P(Za0 )  °  S  p( c)dP(t) ,  and  for  n>1,  U3ing  (2) 
0 


II.  A  GENERAL  FAILURE  MODEL 

At  time  t»0  a  system  begins  to  work.  Two  ty¬ 
pes  of  system  failures  nay  occur: 

Type  1 :  failures  of  this  type  are  removed  by 
minimal  repairs. 

Type  2:  failures  of  this  type  removed  by  re¬ 
placements. 


t  n 

P(Z«n)  *  ^  “T  [5  p(x)q(x)dx]  p(t)dF(t). 


Hence, 

00  f 

E(Z)  -  XnP(Z»n)  =  >  «(t)dG(t)-1 ,  O) 

n»0  0 


By  definition,  a  minimal  repair  does  not  af-  t 

feet  the  actual  failure  rate  of  the  system  where  Q(t)  =  Jq(x)dx.  In  particular,  for 

0 
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p(t)  =  p, 

S(Z>  •  pfq(t)Q(t)e'pQ(t)dt-1 
0 


■  p5x  e”pxdx-1  »  ~  ,  p>0.  (4) 

0  ? 

If  cs  and  c„  denote  the  cost  of  a  minimal  re¬ 
pair  and  a  replacement,  respectively,  then 
the  long-run  average  maintenance,  cost  under 
the  failure  model  described  is 


K-ta 


n  Q(t)dG(t)-i I 

Lo. _ J 


c  +  c 


5  G(t)dt 
0 

In  particular,  for  p(t)  z  p, 

1—p 

K  » 


c  +  c 


5(P(t))pdt 

0 


(5) 


(6) 


III.  MAINTENANCE  POLICY 


Let  C  be  the  random  cost  for  removing  a  sy¬ 
stem  failure.  Then  the  maintenance  policy 
considered  here  is  the  following  one: 

On  failure  the  system  is  replaced  by  an 
equivalent  new  one  if  C  exceeds  a  given  re¬ 
pair  cost  limit  c;  otherwise  a  minimal  repair 
is  carried  out. 

If  cv  denotes  the  constant  replacement  cost 
and  R(x)  *  P(Cslx)  the  distribution  function 
of  C,  then  it  is  obvious  to  assume 

{1,  if  XiC  , 

r  and  0  <c  <c  .  (7) 

0,  if  x  <0,  r 

The  repair  cost  limit  model  can  evidently  be 
interpreted  as  a  special  case  of  the  failure 
model  discussed  in  section  II:  A  type  1  (type 
2)  failure  occurs  if  and  only  if  C<c  (C>c). 
Therefore, 

p  o  R(o),  p  a  R(c).  (8) 

In  what  follows  if  will  be  assumed  that  p 
does  not  depend  on  time,  i.e.  neither  C  nor  c 
depend  on  the  system  age  at  failure.  Hence 
(6)  can  be  applied  for  computing  the  expected 
maintenance  cost  rate  K**K(c).  But  an  impor¬ 
tant  peculiarity  of  the  repair  cost  limit  mo¬ 
del  has  to  be  taken  into  account:  the  mean 
repair  cost  ca  for  removing  a  type  1  failure 
depends  now  on  c  and,  hence,  on  p: 

co  "  [  S  R(x)dx  -  «!<•)]  .  (9) 

where  R(x)  »  1-R(x).  (Note  that  ca  is  the  ma¬ 
thematical  expectation  of  C  on  condition  that 
Ci£c.)  Thus,  (6),  (8),  and  (9)  yield  the  ex¬ 
pected  maintenance  cost  per  unit  time  (main¬ 
tenance  cost  rate) 


K(o) 


JL  C  __ 

- -  (  R(x)dx  +  c  ~  c 

a  R(c)  0  r 

T (F(t)R<o)dt 


(10) 


The  problem  consists  now  In  computing  a. repair 
cost  limit  c«c*  minimizing  (10).  It  will  be 
solved  on  condition  of  Veibull-distrlbuted  li¬ 
fetime:  F(t)  *«  exp(-at®).  In  this  case, 

K(c)  =  a1/b[r(1  +  i)]_1  K(c),  where 

K(c)  -  (RCc))171”1  |  R(x)dx+(c 

Hence  the  problem -of  minimizing  K(c)  is -equi¬ 
valent  to  minimizing  K(c).  From  dK(c)=0  there 
results  a  necessary  condition  for  the  optimal 
c  a  c*: 

.^er  .  (12) 

For  b> 1  there  is  a  unique  solution  c»c*  of 
(12).  The  corresponding  minimum  modified  main¬ 
tenance  cost  rate  is 

K(e»)  =  (er  -  e*)(R(e*))1/b  . 

Ic  -x]  s 

~ — I  ;  0^x^cr,s>0. 

Then  the  modified  maintenance  cost  rate  (11) 
is 


-c)R(c) 


(id 


K(c) 


iFft 


r  vs-s/b 


cr 


]■ 


and  the  optimal  repair  cost  limit  is 

s+i  /  b-r 


c*  “  (1 


b*s  ^cr* 
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ECONOMICALLY- OPTIMAL  PROCESS  CONTROL  USING 
SEQUENTIAL  SAMPLING  PLANS  FOR  VARIABLES 

OLGIERD  HRYNIEWICZ" 
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Abstract  -  Shewhart '  s  x-chart  has.' been  ^widely 
used-  for.  “process  control,  when  ^quality 
character 1st  res  are  norma 1 i y  distributed.  jA 
genera! -reodei  forVoptimal  .design  Jot  control 
procedures  has,  been  *  ’preposed  “in 
Hrynie'wiczC  1990) -In,  the  paper  this  model^.is 
used-  to  design  economically^ optimal  sequential 
procedures. Comparison  of  the  proposed'/  optima  1 
procedures-and  traditional  Shewhart *s  control 
charts",  is  presented. 


1 .  INTRODUCTION 

In  1924  Shewhart  Introduced  a  new  method 
•for  controlling  the  quality  of  a  production 
procesc-the  control  chart.  Methodology 
Introduced  by  him  consists  of  sampling  from-  a 
process  and  evaluating  the  samples  1  reorder  to 
find  a  signal  that  the  considered  production 
process  is  out-of-control .Whenever  this  state 
of  the  process  is  Indicated  .searching'  and 
removing  the  assignable  cause  takes  pl^ce. 
The  problem  of  the  optimal-  design  of  control 
charts. using  economic  considerations -was  first 
solyed  by  Duncan(  1956)  .Since  this  yexiy. 
Important  work  a  great  number  of  papers 
devoted  to  this  problem  have  been  published  - 
especially  to  the  problem  of  the  optimal 
design  of  x-charts.  Original  results  by 
Duncan(1956)  have  been  modified  by  von 
Collani ( 1986)  who  used  as  the  objective 
function  the  average  profiteer  item  produced 
In  the  long  run.  The  results,  of  von 
Col lani ( 1986)  have  been  generalized  in 
Hrynlewlcz( 1990)  for  a  very  wide  class  of 
control  procedures.  In  this  paper  we  apply  the 
results  of  Hrynlewlcz(  1990)  to'  develop 
economical ly  optimal  control  procedures  based 
on  a  sequential  sampling  plan  for  normally 
distributed  variables. 

2.  MATHEMATICAL  MODEL. 

We  assume^  that  the  considered  production 
process  can  be  either  In  an  acceptable  STATE  2 
characterized  by  a  mean  value  of  normally 

distributed  quality  characteristic  ,or  In  an 
unacceptable  STATE  II  characterized  by  a 
shifted  valuo  where  c  Is  a  standard 

deviation  of  the  controlled  characteristic 
The  values  of  //q,  o  and  6  are  assumed  to  bo 

known  numerical ly. We  further  assume  that  the 
duration  of  STATE  I  Is  a  random  variable 
distributed  accordingly  to  a  known  continuous 
distribution  function  F(t).We  assume  that  the 
moment  of  the  transition  from  STATE  I  to.  STATE 
II  Is  not  directly  observable  and  the 
transition  from  STATE  II  to  STATE  I  can  be 
achieved  only  by  a  special  correction  action. 
We  consider  three  types  of  possible  actions] 

a)  monitoring  of  the  process  {sampling). 

b)  Inspection  (searching  for  an  assignable 
cause) i 

c)  renewal. 

By  monitoring  we  understand  a  statistical 
procedure  which  allows  us  to  determine  the 
actual  state  of  the  considered  process  with 


probability  less 'than  one. Thus. there  are  two 
types  of  error  involved: type  I  error  o.  and 
type  II  error  p. 

Ke  consider  the  following  control 
procedure.  After  first  h  produced  Items  we 
sample  consecutive  Items  observing  their 
values  x'j  1-1,2,...  .  The  test  statistics  has 

the  following  form 

-b  +  cn  <  )  (--•  u  )  <  a  +  sn  Cl) 

1-1  '  * 

where  s-6/2.  Wo  continue  sampling  until  the 
two  sided  inequality  (1)  is  fulfilled.  When 
after  n  items  observed  the  sum  In  (1)  is 
greater  than  a+sn  we  -stop  the  process  and 
perform  an  inspection.  When  It  is  smaller  than 
b+sn  we  stop  our  sampling  procedure  and  do  not 
interfere  In  tho  production- process.  The  next 
sampling  begins  after  next  h-n  produced  items. 
The  expected  number  of  monitoring  actions 

while  the  process  operates  in  STATE  I  is  given 
by  (see  e.g.  Duncan (.1 956 ) ) 

& 

EIAjJ.  ^jROh)  «  Elt)/h-0.S  (2) 

!M 

whsre 

R(lh)-l-FUh)  (3) 

and'E^tl  Is  the  expected  duration  of  STATE  I. 
The  expected  number  of  monitoring  actions 

while  the  process  remains  in -STATE  II  is  given 
by 


Now  we  introduce  economic  parameters  in  order 
to  describe  the  economic  consequences  of  the 
actions  to  be  performed.  The  cost  of  a 
sampling  action  Is  described  by  three 
quantities! 

* 

aQ  -  fixed  cost  per  sampling, 

aj  -  unit  cost  of  sampling, 

nj  -  the  expected  number  of  sampled 

elements, where  1-1  in  the  case  that  the 
process  Is  In  STATE  I  during  the  sampling 
action,  and  i-2  In  the  case  that  it  Is  In 
STATE  II. 

Moreover  we  assume  that  the  following  costs 
are  known* 

e  -  average  cost  of  an  erroneous 

Inspection  during  STATE  I. 

r  -  average  coat  of  searching  for  an 
existing  assignable  cause  and  the  following 
renewal , 

Oj  -  average  profit  derived  from  one  unit 
producod  while  the  process  is  operating  in 
STATE  I, 

g2  -  average  profit  derived  from  one  unit 
produced  while  the  process  is  operating  in 
STATE  II. 

For  the  model  described  above  it  has  been 
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shown  In  HrynlewiczC 1388)  that  the  optimal 
design  of  a  control  procedure  for  which  the 
long  term  profit  per  unit  produced  Is  maximal 
can  be  found  by  maximization  of  the  following 
objective  function 


*  *  b-E(ATJa-EtAT]S1-EfATT)S, 

g  (!),r>«e  C - ■  ,  1  -1 - _ri  -  -1  +  g. 


2 

(5) 


(6) 


S2«  (aj  +  ajn2)/e*  (7) 

and 

b-  {(g1-g2)ECrJ-r*J/e*  >0  (8) 

It  has  been, shown  In  Hryniewlczi 1990)  that  the 
value  of  the  optimal  sampling  Interval 

approximately  equals 


.  /  2(<x+sT)  1 

h  -  E(r)  / - - -  (9) 

Y  (2A2-mb+A2(a+S1-S2) 

where  Aj-EtAj) ,  and  A2*E(Ajj1 .  The  remaining 
parameters  of  the  procedure  have  to  be  found 
by  maximization  of  (5).  When  relative  unit 
sampling  costs  at  are  not  too  small 

approximately  optimal  sampling  procedure  can 
be  found  by  minimization  of 
(Hryniewlczi 1990)): 

G*'«  (ZAj-lMotSj)  (10) 

The  approximately  optimal  values  of  parameters 
a  and  b  obtained  from  minimization  of  (10)  may 
visible  differ  from  the  values  obtained  by 
maximization  of  (5).  Especially  the 
approximately  optimal  parameter  a  is  usually 
significantly  greater  than  the  optimal  one. 
However,  the  resulting  profits  per  unit 
produced  differ  in  bothcases  only  slightly. 


3.  PROPERTIES  OF  THE  OPTIMAL  PROCEDURES. 
The  main  problem  which  has  to  be  solved  in  the 
process  of  optimization  is  the  calculation  of 
a,0,nj,  and  n2  for  the  considered  sequential 

sampling  procedure.  Unfortunately,  there  don’t 
exist  efficient  algorithms  for  exact 
calculation  of  these  parameters.  Computer 
experiments  have  revealed  that  well  known 
Wald's  approximations  cannot  be  applied.  In 
our  experiments  we  used  approximations  by 


Tallis  and  Vagolkhar( 1965) . 

a  ■  1-l(-5/2)  (11) 

P  ■  L{5/2)  (12) 

nj-  E(-6/2)  (13) 

n2-  E(5/2)  (14) 

where 

$(-0) t 1-Q(~0)-C(0) )  ,  1C. 

L  D(  »  )(l-o  (*o  )  )-t  (-0)C(0)  ( 15 

C(0)«-  (exp(-2a0)/20)l(0)  (16) 

D  (0)  —  (exp(-2b0)  /  20)  I  (0)  (17) 

1(0)— exp<02/2)  (0+03/6t05/40+..)  (18) 


E(9)«t( l-L(O) la+L(8)b  + 

o.5{,-»(-e))'I(i-t(6)i(^(ei+e4>(e))  t 

0.5{*(-0))'1L(e)!9»(-e)-^(9)l)/0  (19) 

Numerous  experiments  have  revealed  that  the 
objective  function  (£)  Is  "flat"  around  its 
maximal  value. Thus. different  approximations 
lead  to  significantly  different  parameters  of 
control  procedures.  However,  in  all  those 
cases  the  value  of  the  objective  function 
remains  practically  the  same. 

To  compare  the  optimal  sequential  procedure 
with  traditionally  designed  x-chart  consider  a 
production  process  with  the  following 
parameters: 

r  *100. ,e  *50. ,aQ*0. ,gj*l. ,g2*0. l,aj*0.5 

Moreover,  assume  that  the  duration  of  STATE  I 
is  exponentially  distributed'  with  X*0, 00005. 
In  Table  1  we  compare  the  sequential  procedure 
with  x-charts  for  2  different  values  of  shift 
6.  It  is  assumed  that  production  rate  is  such 
that  in  the  case  of  the  x-chart  samples  are 
taken  after  every  500  produced  Items. 

TABLE  i 

Comparison  of  optimal  sequential  procedures 
with  traditional  control  charts 


Parameters 

•6*0.5 

6-2. 

a 

11.07 

4.62 

b  (sen.) 

-0.6933 

-0.3564 

h 

15* 

121 

G 

0,966619 

0.986589 

n 

5 

5 

*  (X) 

3.0 

3.0 

h 

500 

500 

G 

0.585848 

0.977122 

The  results  presented  in  Table  1,  and  similar 
results  from  many  experiments  reveal 
significant  superiority  of  optimally  ^designed 
sequential  control  procedures  over  x-charts. 
The  optimal  sequential  procedures  are 
characterized  by  very  small  probabilities  of 
false  alarms  a  (usually  smaller  than  0.001) 
and  small  average  sample  sizes  in  STATE  I  (for 
moderate  and  high  unit  sampling  costs  usually 
close  to  1.), Their  superiority  is  especially 
significant  for  small  shifts  5. 
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THE  INSPECTION* ALLOCATION  PROBLEM  IN .STATISTICAL  PROCESS*  CONTROL 
"CONSIDERED  FROM  AN  ECONOMIC.  VIEWPOINT 
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Fachhochschule  Miinchen,  Fachbereich  07 
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Abstract:  Controlling  a  production  line  it  is 
important  to  decide  at  which  points  of  the 
line  the  process  is  to  be  inspected,  i.e. 
which  of  the  components  are  monitored  simul¬ 
taneously.  Optimal -control  policies  are- defi¬ 
ned  by  means  of  an, economic  objective  func¬ 
tion  which  can  be  interpreted  as  the  average 
longterm  profit  per  item  produced.  The  method 
of  dynamic  programming  turns  out  to  be  a  use¬ 
ful  tool  for  the  determination  of  such  opti¬ 
mal  control  policies.  A  numerical  example  is 
given  in  the^case  when-useveral  x-charts  are 
used  in  parallel. 


I.  THE  PROCESS  MODEL 

We  consider  a  production  line  consisting  .of  m 
distinct' and  serially  arranged  components  num¬ 
bered  l,...,m,  where  production  begins  with 
component  I  and  ends  at- component  a.  The  num¬ 
ber  of.  items  produced  per  time  unit,  ile.  the 
production  rate  y>0,  is  assumed  to  be  constant 
m  time  and  known.  Furthermore,  it  is  assumed 
that  component  j  (j-1  is  responsible 

for  the  value  of  the  quality  characteristic 
Xj,  and  vc  consider  the  case,  where  all  the 

Xj’s  are  mutually  independent  and  univariate 
normally  distributed. 

The  process  starts  in  the  in-control  State  I, 
the  state  of  satisfactory  production,  where 
the  expectations  of  the  X^'s  are  equal  to 

their  target  values  y^,  known: 

State  I:  Xj~  N(y^  ,0^);  j“1 , . . .  ,m;  >0,  known. 

The  time  t  of  production  in  State  I  is  assu¬ 
med  to  be  exponentially  distributed  with  ex¬ 
pectation  1/x>0,  known.  When  t  has  passed  a 
shock  occurs  and  the  process  enters  exactly 
one  of  m  substate  II(i)  of  the  out-of-control 
State'll;  i-1,...,m: 

State  II(i):  Xj~N(y.,oj);  J-1,. ...m;  j/i; 

Xi~  N(wi±di°i»°j)l  known. 

Letting  Cx  (i»l,...,m)  denote  the  event  of  a 

transition  from  State  1  to  State  II(i),  i.e. 
a  failure  of  component  i,  then 

Q 

q.  -  Prob(C-),  known,  with  l  q.  -  1 
1  1-1  1 
is  assumed  to  be  constant  in  time. 

Once  the  process  produces  in  State  I I(i)  we 
assume  that  it  remains  in  this  substate  until 
a  renewal  actio  is  undertaken  with  respect  to 
component  i,  after  which  the  process  starts 
anew  m  State  I.  The  time  between  two  subse¬ 
quent  renewals  is  called  a  renewal  cycle. 


II.  THE  CONTROL  MODEL 

Before  performing  a  renewal  action  State  II 
has  to  be  detected.  For  this  purpose  we  intro¬ 
duce  a  set  of  control  policies,  given  by 
{(p;h(M1),.,,h(Mr);n(M1),..,n(Mr);c1,..,cm)}. 
Here  p  denotes  any  partition  of  the  production 
line  (1,2,...,m)  into  blocksMj , . . ,Mr  (1<r<m). 
To  give  an  example  let  be  m-6  and  p  equal  to 
12(3451 6.  Then  M2-{3,4,5)  andMj-16}. 

Furthermore, 

h(M.)€lR+is  the  sampling  interval  at  block 
J  Mj  (l<i<r), 

n(M.)cN  is  the  sample. size  used  at  block 

and  J 

ci€Ro  *s  the  contr°l  limit  with  respect  to 

quality  characteristic  Xj  (1<i<m). 

Such  a  control  policy  operates  as  follows: 
At;the  last  component  of  each  block  Mj  ( 1< j<r ) 
every  h(M^)  time  units  of  production  an  inde¬ 
pendent  random  sample  of  size  n(M^)  (n(Mj)con- 

secutively  produced  items)  is  taken  and  ana¬ 
lyzed  with  respect  to  all  the  quality  charac¬ 
teristics  Xj  with  ieMj:  let  xx  be  the  values 

of  the  corresponding  sample  means. 

If  I Xj-Uj{>CjO*//n(Mj)'  then  component  i  — 
being  an  element  of  M*  —  is  inspected. 

If  lxj-»ll<c1ol/VnCM^ ;  holds  for  all  ieM^, 
then  Mj  is  left  alone. 

It  is  assumed  that  an  inspection  of  component 
i  ( l<i<m)  reveals  its  actual  state  of  produc- 
tion  with  probability  one.  If  State  II(i)  is 
recognized  a  renewal  is  undertaken  restoring 
State  I. 

Thus,  a  partition  gives  the  components  of  the 
production  line  which  are  monitored  simultane¬ 
ously  or  —  in  other  words  —  gives  the  points 
where  monitoring  and  inspections  are  allocated. 
Furthermore,  according  to  the  here  considered 
in-control  and  out-of-control  states,  two-sided 
x-charts  are  used  in  parallel  (one  for  each 
quality  characteristic). 


III.  THE  ECONOMIC  MODEL 

To  determine  the  economic  design  of  control 
charts  Duncan  (1956)  introduces  the  average 
longterm  profit  per  time  unit  as  objective 
function.  This  model  is  also  described  by  Mont- 
oaery  (1985).  Uhlmann  (1982)  and  v.Collani 
1981,1989)  use  the  average  longterm  profit  per 
item  produced  which  turns  out  to  be  simpler 
from  a  mathematical  point  of  view  and  seems  to 
be  more  appropriate  from  the  economic  view¬ 
point,  too.  Thus,  the  author  prefers  the  latter 


approach:  The  here  .introduced  objective  func¬ 
tion  can  be  considered  as  a "generalization, of 
v.  Collani's  objective  function.  Let  be  P  the 
total  profit  and  N  the  total  number  of  items 
produced,  where  both  of  the  random  variables 
refer  to  one  renewal  cycle.  Then,  the  objec- 
tivefunction  is  defined  by 
m  ECPlCi3 
«*  ■  <i  • 

To  obtain  an  explicit  expression  of  the  objec¬ 
tive  function  we  introduce  the  following  eco¬ 
nomic  quantities: 

gj>0:  gain  per  item  produced  in  State ‘I: 

gjj(.)<gl:  gain  per  item  produced  in  State  II(i). 

e->0:  cost  of*  an  erroneous  -inspection  of 

1  conponent  i. 

a(M)>0:  costs  of  sailing  and  analyzing  one  item 

with  respect  to  all  quality  characteris¬ 
tics  X-  with  i  M,  where  M  is  any  block  of 
the  production  line; 

b  (M)>0:  benefit  of  one  renewal  of  component  i  M, 

1  where  M  is  any  block  of  the  production 

line. 

We  assume  that  for  any  block  M  the  costs  of. 
sampling  n  items  and  analyzing  them  with  res¬ 
pect  to  all  the  Xj's  with  i  M  are  proportional 

to  the  sample  size  n,  i.e.  are  equal  to  a(M)n. 

In  order  to  reduce  the,  number  of  input-para¬ 
meters,  we  maximize  the  standardized  objecti¬ 
ve  function 

•  jt  SiSn(i))vA 

instead  of  **  itself.  With  the  standardized 
sampling  intervals  x(Mj):»  xh(Mj)  we  get  in 
analogy  to  Arnold  (1990),  where  the  case  of 
r»1  is  investigated: 

x(p; x(M1 ) , . . .  ,x(Mr) jnfMj) . n(Mr) ; c1 , . . .  ,cQ)  - 

*  2^  xTffTT  l  “  '  eiai“  a(M:)n(Mi)  + 

j-1  |  isMj  1  1  J  * 

\  diCb^Mpd-S^+e^pCexpCxiMj))-!)  | 
-  expCxi-Hjn^ - — }* 

Here  the  o^'s  and  the  fc^s  are  the  probabili¬ 
ties  of  the  errors  of  Type  I  and  Type  II  res¬ 
pectively: 

aj  *  2i(-e»); 

where  icMj  and  *  is  the  standardized  normal 
distribution  function. 


IV.  THE  SOLUTION  TECHNIQUE  AND  AN  EXAMPLE 

Obviously  the  standardized  objective  function 
can  be  separated  into  r  terms,  where  r  is  the 
number  of  blocks  of  the  paitition  of  the  pro¬ 
duction  line.  These  terms  may  be  maximized 
separately  and  thus,  the  method  of  dynamic 
programming  can  be  applied:  Considering  the 
production  line  (1 ,2, ...,i,i*l, .. . ,o)and  assu¬ 


ming  that  to  all  the  "partial"  production 

lines  {ml  ,{m-1  ,mK  . . . ,{ i+1 . ml  the  optimal 

partitions  are  already  calculated,  then  the 
optimal  partition  of  the  "partial"  production 
line  {i,...,ml  is  easily  determined  as  the 
optimum  of  the  partitions  beginning  with 
i|;. ;i.i+1  j and  ending  at  the  block  {1,2, 
where  (..  indicates  the  optimal  partitions  of 
the  "partial"  production  lines  (i+l,..,m)  and 
(i+2,..,ffl)  respectively. 

To  give  a  numerical  example  we  consider  aT pro¬ 
duction  line  consisting  of  six  serially  arran¬ 
ged,  components  numbered  J,..,6.  The  probabili¬ 
ties  q^  are  assumed- to  be  q^Oll,  q2»q.j«q4*q5“0.2, 
q6»0.1.  The  numerical  values  of  the  shift  pa¬ 
rameters  6.  are  6^-J,  Oj-1.5,  2, 

6s«i,  66«1.5.  The  cost  parameters^  are 
ej«2,  C2"e3"^»  e4??»  eS*e6"1,  The  sampling 
costs  per  item  are^-  assumed  to  consist  of  the 
cost  of  0.001  for  taking  one  items,  and  the 
cost  of  0.0005  per  quality  characteristic  for 
analyzing  one  item.  For  instance  we  obtain 
a({3))-0.001S,  a({3,4))»0.002  and  a( (2,3.4) )-0.0025. 
Furthermore,  let  be  bj-400,  b2«'500,  bj-300, 
b4-200,  b5?200  and  b^-300.  The  benent  bj(M) 
with  i6M  is  assumed  to  be  equal  to  b^  dimi¬ 
nished  by  0.5  \  per  component  between  i  and  the 
sampling  point  determined  by  M.  For  instance 
we  get  b3({3))-300,  b3( C 3 , 4 > )- 298 - 5  and 
b3( < 3', 4 , S)  )»297.  This  approach  of  calculating 
b.(M)  reflects  the  fact  that  it  is  desirable 
to  detect  failures  in  the  production, line  as 
soon  as  possible.  By  dynamic  programming  and 
with  x(M)-Xh(M)  we  obtain  the  optimal  control 
policy  (pjxCM,) . x(Mr);n(M|) , . . . ,n(Mr); 

c1,...,c6)-(1 1 2 1 34 \ S6;0. 03519, 0.01160,  0.01564, 
0.02563,  18,5,7,13;  3.102,3.300,3.118,3.760, 
2.870,3.551)  with  t-301.80  as  the  correspon¬ 
ding  maximum  value  of  the  standardized  objec¬ 
tive  function.  Thus,  it  is  best  to  monitor 
the  components  1  and  2  separately  and  to  moni¬ 
tor  simultaneously  the  components  3  and  4  on 
the  one  hand  and  the  .components  5  and  6  on  the 
other  hand. 
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Abstract 

I  develop  an  efficient  coaputational  algoritha 
for  the  statistical  design  of' fraction  defective 
charts  when  there  are  up  to  ten  different  shifts  for 
which  protection  is  desired.  This  algoritha  allows 
the  enumeration  of  all  feasible  statistical  designs 
and  thus  can  be  extended  to  allow  for  the  economic 
statistical  design  of  fraction  defective  charts.  A 
third- use  of' this  algoritha  is  to  design  single 
saaple  acceptance  sampling  plans  with  specified  AQL 
and  LTPD  under  a  Type  B  scenario. 

Statistical. Design  of .Fraction  Defective 
Control  Charts  With  Some  Extensions 

I.  Introduction 

In  this  paper  I  develop  an  exact, 
computationally  efficient  algoritha  to  statistically 
design  fraction  defective  charts.  This  algoritha  can 
be  used  to  give  protection  for  up  to  ten  shifts  in 
the  process  parameter.  A  slight  modification  in 
Input  allows  it  to  be  used  to  design  fraction 
defective  charts  for  the  situation  in  which  saall 
shifts  in  tho  process 

parameter  are  to  be  disregarded,  a  situation  explored 
by  Voodall<1985). 

While  this  algoritha  can  be  used  on  a  stand 
alone  basis,  it  can  Also  be  used  as  the  basis  for 
economic  statistical  design  of  fraction  defective 
charts  since  it  allows  enumeration  of  all  feasible 
statistical  designs;  these  Jointly  define  the 
feasible  region  for  an  economic  statistical  design. 

In  section  II  I  discuss  the  development  of  the 
statistical  design  algoritha.  Its  use  in  its  various 
forms  for  statistical  design  is  illustrated  with 
several  examples.  An  example  of  its  use  in  the 
design  of  a  single  saaple  fraction  defective 
acceptance  sampling  plan  with  specified  AQL  and  LTPD 
in  a  Type  B  situation  is  also  illustrated. 

In  section  III  I  use  the  algoritha  to  define 
the  feasible  region  for  economic  statistical  design 
of  fraction  defective  control  charts.  A  pattern 
search  algoritha  is  developed  to  solve  the  economic 
statistical  design  problem  and  the  results  are 
compared  to  economic  designs,  A  brief  summary  is 
drawn  in  section  IV. 

II.  Statistical  design 

A  statistical  design  of  a  fraction  defective 
control  chart  involves  the  selection  of  the  saaple 
size  n  and  the  acceptance  number  c  such  that 

ARLj  >  ARLBj 

and  ARLj  <  ARLB*  i-2,3,,„,k 

<1> 


where  ARL  Is  the  Average  Run  Length  of  the  control 
chart  when-the  process  is  in, control ’with  process 
proportion  defective  pj  and  ARL^,  i-2 , 3 , . . .  k  Is  the 
ARL  when  of  shift -of  a  specified  magnitude  p4  occurs 

ARLB  is  defined  as  the  desired  bound  on  ARL.  An 
alternative  to  this  type  of  statistical  design  is  one 
in  which  it  is  desired  that  the  control  chart  does 
not  signal  a  shift  when  the  shift  is  saall.  This 
type  of  design,  which  was  proposed’ by  Woodall  can  be 
defined  as  (1)  above  with  the  exception  that  for  1-2, 
the  constraint  is  replaced  with 

ARLj  >  ARLBj. 

We  can  define  ARL  in%  terms  of  a  and  8 , 
respectively. tho'  Type  I  and  Type  XI  error 
probabilities  of  the  control  chart.  Specifically, 

ARL  -  1/a 

and  ARLj  -  l/<i-p,).  <2> 

For  fraction  dofectlve  control  charts  we  can  define 
a  and  8  as 

•■i-g  (5)  ’ 


trt  (5) p/(»-w>w 


(3) 

Machine  calculation  of  a  and  p  can  be 
difficult  for  practical  values  of  n  because  of  the 
factorial  terms  and  powors  of  p  in  (3)  above 
Recently,  an  efficient  exact  algorithm  for  this 
purpose  was  presented  by  Cehrlein,  Ord  and 
Fishburn(l986)  and  a  similar  algorithm  is  employed  in 
my  computations. 

Statistical  design  of  fraction  defective  charts 
can  be  accomplished  by  an  enumeration  of  all  n  and  c 
values  until  the  constraints  in  (1)  are  all 
satisfied.  A  more  efficient  algoritha  can  be 
developed  by  using  knowledge  of  the  shape  of  the 
operating  characteristic  <O.C.)  curves  as  n  and  c 
varies.  Operating  characteristic  curves  for  various 
n  and  c  are  presented  in  Figure  I.  Specifically,  as 
c  increases  for  a  fixed  n  the  O.C.  curve  shifts  to 
the  right.  As  n  Increases  for  a  fixed  c  the  0  0 
curve’s  steepness  increases  One  can  also  summarize 
these  changes  In  terms  of  of  Type  I  and  Type  II  error 
probabilities.  In  particular,  as  n  increases  a 
increases  and  8  decreases  for  fixed  c.  As  c 
Increases  «  decreases  and  p  increases  for  fixed  n. 

Using  these  facts  I  develop  an  algorithm  more 
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efficient  than  enumeration.  The  steps  in  this- 
algorithm  are  as  follows.  First,  set. c-0; and  then 
check  whether  the  first. constraint  is  statisfied.  If 
it  is  not  then  c  can  be  increased  until  the  first 
constraint  is  satisfied  because  any  increase  in  n 
will  further . lower  the  first  ARL.  Next,  n  Is 
Increased  until  the  remaining  fc-1  constraints  are 
satisfied  or  until  c  is  again  Increased. 

I  emphasize  that  there  are  a  number  of 
solutions  to  (1)  and  one  must  chose.between  these  on 
the  basis  of  some  criteria.  In  the  examples  I 
solved,  the  criterion  is  the  design  satisfying  (1) 
with  the 'smallest  n  for- the  saallest-c.  The 
algorithm- I  develop  allows  enumeration ‘of  the 
feasible  statistical  designs  within  a  finite, range  of 
n  and  c  and  an  option  iKl’the  coded  version  of  the 
algorithm  that  Is  attached  allows  one  to  enumerate 
all  feasible  statistical  designs. 

III.  Economic  statistical  design 

In  economic  statistical  design  we  wish  to 
minimize  the  cost  of  a  process  or  maximize  the  profit 
of  a  process  when  the  costs  of  statistical  process 
control  and  the  costs  of  operating  out  of  control  are 
considered' and  statistical  constraints  are  placed 
upon  the  model.  Sanlga(1989)  developed  the  idea  of 
economic  statistical  design  and  shows  that  this  type 
of  design  can  be  more  satisfactory  for  decision 
makers  at  all  levels  of  a  firm. 

Mathematically,  the  economic  statistical  design 
problem  can  bo  expressed  as 

max  P(n,c,h) 

S.t,  ARLj  >ARLBj 

ARL.  <ARLB1  1-2,3,... ,k 

<5> 

where  P  is  the  relative  profit  per  item  produced  and 
h  is  the  sampling  frequency.  In  this  paper,  X  use  a 
model  for  P  developed  by  Hrynlewicz(1989)  who 
explored  the  performance  of  economically  designed 
fraction  defective  charts  when  shifts  apart  from 
expected  shifts  occured. 

The  function  P  is  nonlinear  and  n  and  c  are 
integer  variables  which  complicates  the  solution 
procedure.  In  this  paper,  I  take  advantage  of  the 
statistical  design  algorithm  I  describe  in  the  last 
section  to  facilitate  the  solution  to  (5).  In 
particular,  the  algorithm  I  use  to  solve  (5)  is 
comprised  of  two  stages.  In  the  first  stage,  I  use 
the  statistical  design  algorthla  to  find  a  feasible 
statistical  solution,  Next,  the  optimal  sampling 
frequency  is  found  for  this  solution  by  pattern 
search.  The  same  process  continues  for  each  feasible 
statistical  solution.  Finally,  the  feasible 
statistical  solutions  (with  optimal  h)  are  compared. 

IV,  Summary 

I  h*ve  designed  an  algorithm  to  allow  the  exact 
anC  efficient  calculation  of  the  design  parameters 
for  a  fraction  defective  control  chart  under  a 
statistical  criterion.  This  algorithm,  can  also  be 
used  to  design  Type  ft  single  sample  acceptance 
sampling  plans  with  specified  AQL,  and  LTPD.  The 
algorithm  allows  enumeration  of  the  feasible  region 
for  the  integer  parameters  in  an  economic  statistical 


design.  This  use  is  demonstrated  in  the  last 
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ECONOMICALLY  OPTIMAL  CONTROL  OF  PROCESS  VARIABILITY 
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Abstract .  The  derivation  of .  economically 
optimal;  procedures  to  .  monitor  arid  control 
production  process /Variability  is  considered. 
The  objective'  is  to -maximise  average  profit 
per  item , produced . 


INTRODUCTION 

Production  processes  are  operated  in  order  to 
generate  profit.  The  level  of  profit  is 
clearly  a  function  of  the  quality, of  the  items 
produced,  which'  in  turn  depends  on  the 
operational  state  of  the  process.  SPC 
procedures,  typically  x,R  charts  and  Cusum 
charts,  are  relatively  widely  used  to  control 
quantitative  characteristics  of  produced  items 
(and  the  process  producing  them).  These 
procedures  have  traditionally  been  designed 
using  statistical  criteria  only  e.g.to  satisfy 
specific  ARL  or  OC-curve  requirements:  the 
costs  and  economic  benefits  associated  with 
operating  them  being  largely  ignored... The,  idea 
of  designing  SPC  procedures  so  as  to  optimise 
the  economic  performance- of -the- process  being 
monitored  has  attracted .steady  academic 
attention  since  the  1950s,  but  little 
practical  implementation.  Mathematical  models 
containing  too  many  parameters  and  complex 
optimization  procedures  are  cited  as  prime 
reasons  for  this  lack  of  acceptance ( 1] . 

Recent  work,  particularly  by  von  Collani  and 
others  in  Wvirzburg,  shows  that  when  the  chosen 
objective  is  to  maximise  long-run  profit  per 
produced  item,  the  resulting  objective 
function  is  simple  in  formulation  and  allows 
easy  determination  of  optimal  control 
procedures,  see,  for  example,  the  tabular 
procedure  for  designing  optimal  5?  charts  in 
(2]  and  nomograms  for  Y,np  and  c  charts  in 
-  This  paper  shows  how  the  objective 
function  is  obtained  and  used  to  generate 
economically  optimal  s  charts  for  controlling 
process  variability. 


THE  PROCESS  MODEL 

Let  X  be  the  quality  characteristic  of 
interest.  Assume  X  -  N(p,o2).  The  process 
operates  in  one  of  two  possible  states:  State 

I,  the  'in-control*  state  with  o  =  oo  or  State 

II,  the  'out-of-control'  state  with  o  =  8oo, 
where  6>1,  The  process  starts  in  State  I  and 
can  slip  to  State  II  as  the  result  of  a  single 
assignable  cause.  The  current  state  is 
determinable  only  by  means  of  a  process 
examination.  A  return  from  State  II  to  State  I 
requires  a  repair/renewal  action.  State  I 
lifetime  T,  is  exponentially  distributed  with 
mean  E(T)  =  1/e.  The  average  number  of  items 
produced  per  hour  is  v  and  6,v,9,o©  are  known. 


THE  CONTROL  PROCEDURE 
Samples  of  n  items  aro  drawn  from  current 
production  at  time  intervals  of  length  h.  The 
standard  deviation  s  is  determined  and  plotted 
on  a  control  chart  with  a  single  action  limit 
at  ko©.  A  process  investigation,  followed  if 


necessary  by,  a  renewal,  is  undertaken- if  s 
plots  outside  this  limit. 

Clearly,  (n-l)s?/o2  -  X?„-i  (Chi-square 
distribution,  with  n-1  degrees-of -freedom) . 
Hence,  the  probability  of  a  'false; alarm'  is 
given  by 

a  =  Prob ( s >ko© | state  I)  =  1  -  F([n-l]k2),  (1) 

the  probability  of  not  detecting  State  II  is 

P  =  Prob ( s£ko„ |  state  II)  =.P([n-l]k2/62)  (2) 

and  the  average  run  lengths  in  state  I,  II  are 
ARLi  s  l/a,i  ARLxx  =  l/(l-p)  respectively. 

F  is  the^c.d.f .  of  a  X2„.i  random  variable. 

The  parameters  n,h,k  are  chosen v so  as  to 
maximise  the  long-run  average  profit  per  item. 


THE  OBJECTIVE  FUNCTION 
Let  a*  =  cost  of  sampling  a  single  item, 

e*  =  cost  of  Investigating  a- false  alarm, 
b*  «  (expected)  benefit  per  renewal  viz, 
the  extra  profit  resulting  from  a 
renewal/repair  minus  the  cost  of 
locating/repairing  the  assignable 
cause. 

g  =  (average)  profit  per  item  when  in 
state  II 


Further,  for  each  renewal  cycle,  let  Ax,  An 
be  the  number  of  samples  taken  while  the 


process  is  in  State  I,  II  ;  let  Ar  be  the 
number  of  false  alarms  and  A  =  Ai  +  An  the 


total  number  of  samples  drawn.  If  L,  N,  G  are 
the  cycle  length,  number  of  items  produced  and 
the  profit/gain  achieved  In  a  given  cycle, 
then 


E(N)  3  hE(L)  3  hvE(A) ,  (3) 

E(G)  =  b*  4  gE(N)  -  e*E(Af)  -a*nE(A).  (4) 

Since  T  is  exponential,  it  can  be  shown  that 

E(AX)  =  l/(edh-l)  and  E(Af)  =  a/fe^-l).  (5) 

Clearly, 

E(An)  =  ARLn  =  l/(l-p).  (6) 

Now,  defining  long-run  average  profit  per  item 
as  P*  =  E(G)/E(N),  substituting  from  equations 
(3)-(6)  and  re-writing,  we  obtain  P*(h,n,k)  = 


e*  r  (b*/o*)(e®h-l)  -  a 
vh^  e^p 


(1-P) 


(a*/e*)nj  +  g 


This  is  further  simplified  by  noting  that  P* 
is  maximised  with  respect  to  h,n,k  only  by 
maximising  P(h,n,k) 


lr(b*/o*)(eeh-l)-a 


(1-p)  -  (a*/e*)nj 


Finally,  setting  x  =  6h  (viz.  h*rl/0),  b  = 
b*/e*  and  a  =  a */e*,  we  obtain  the  objective 
function: 
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1  r  b(ex-l)-a  n 

P(x,nfk)  =  -  - — - (1-p)  -  an  „  (7) 

x  L  e*-p  J 

where. a,  p  are  given  by  (1),  (2)  respectively. 
This  objective  function  is  very  much  simpler 
than  earlier  alternatives  which  were  generally 
based  oh  minimising  the  loss  per  unit  of  time , 
see  [4],  [5] -for  example.  The' parameters  a,b 
are  interpre table  as  the -relative  sampling 
cost  per- item  and  relative' benefit  per 
renewal,  while  x  is  the  sampling  interval 
expressed  in  average  State  Z  lifetime  -*a 
process  characteristic.  In  practice,  b  will  be 
large ( while  a  and  x  will  both  be  small 
-  typically  «1. 


THE  NO^SAMPLING  PROCEDURE 
When  unit  sampling  cost (a*)  is  high,  relative 
to  the  cost  of* a  process- investigation (e* )  and 
the  benefit(b*)»  itis  clear  that  frcm  an 
economic  viewpoint,  it  may  be  better  not  to 
sample/operate  the  control  chart,  but  rather 
to’perfonn  routine  process  investigations  (and 
if  necessary,  renewals)  at  regular  intervals 
of  length  h.  This  procedure  is  described  in 
terms  of  the  s-chart  parameter  set  (h,n,k)  by 
setting. n=0,  k=0.  In  this  case  a**lj  p=0  and 
the  objective  fonction(7)  takes  the  form 

P(x,n,k)  *  [b(e*-l)-l]/(xez)  (8) 


OPTIMAL  CONTROL  PROCEDURES 
The  task  of- finding. the  optimal  s-chart  design 
has  been  reduced  to  that  of  maximising  P  with 
respect  to  x,n,k  (given  that  values  for  the 
economic  parameters  a*  ,b*,  e*  are  available). 
Despite  the  relatively  simple  appearance  of 
(7),  this  is  not  a  trivial  task  and  will 
hardly  be  undertaken  by  the  Industrial 
practitioner.  The  report  by  von  collani  and 
Sheil(6],  contains  extensive  tables  of  optimal 
designs,  indexed  by  values  of  6,a  and  b:  Table 
1  contains  a  representative  sample  of  these. 


II uiu  1  I  COMO  OptlMl  De.iqm* 


I  *  b 

1 

«  -  .0005 

•  -  .005 

»  -  .05  | 

s 

k 

J 

S 

k 

5 

. 

k 

5  i 

|1.S  20 

34 

1.3)) 

.0)47 

1* 

1.304 

.09)0 

0 

.000 

.3457  | 

i  v> 

31 

1.330 

.0247 

19 

1.309 

.0)49 

0 

.000 

.2124  | 

1  1M 

31 

1.3)0 

.0142 

19 

1.311 

.0)21 

0 

.000 

.1197  1 

|  500 

31 

1.3)0 

.00)7 

19 

1.312 

.0174 

0 

.000 

.0444  | 

1 2.0  20 

14 

1.430 

.02)1 

9 

1.594 

.0592 

5 

1.395 

.1410  | 

1  M 

14 

1.430 

.0145 

9 

1.599 

.0347 

5 

1.412 

.0954  | 

|  ISO 

14 

1.440 

.004) 

10 

1.570 

.0227 

5 

1.423 

.05)0  \ 

j  500 

14 

1.440 

.0044 

10 

1.571 

.0124 

5 

1.430 

.024)  j 

|2.5  20 

9 

1.M2 

.0144 

4 

1.444 

.0447 

4 

1.40) 

.1344  | 

1  » 

9 

1.442 

.0114 

4 

1.444 

.0291 

4 

1.417 

.0412  | 

]  150 

9 

1.442 

.0047 

7 

1.74 

.0149 

4 

1.424 

.0454  | 

j  500 

9 

1.441 

.0037 

7 

1.741 

.0103 

4 

1.590 

.0271  | 

(3.0  20 

7 

2.010 

.0144 

5 

2.010 

.0431 

4 

1.110 

.1340  { 

|  50 

1 

2.011 

.0103 

5 

2.01) 

.0249 

4 

1.722 

.0429  \ 

j  150 

1 

2.071 

.0040 

5 

2.014 

.0154 

4 

1.730 

.0444  | 

j  500 

1. . . 

-JL 

1 

2.07) 

.00)2 

5 

2.015 

.0044 

4 

1.735 

.0251  | 

- » 

A  significant  feature  of  the  optimal  designs, 
which  is>also  apparent  in  Table  l,  is  that  for 
each  value  for. 5,  the  values  for  n  and-k  are 
almost-  independent  of  b.  This  feature  was  used 
in.  [7]-' to  ^provide  tables  of  7  at  least 
approximately  -  optimal  n,k  pairs  (and 
associated  values  for.a,p)  for  a  wide  range  of 
6,  a  values.  As "anticipated,  values  for  the 
' standardised '  sampling  interval  x *  are - smal 1 . 
This,,  fact' 'leads’"  to*  a  simple  method  for 
computing  a  close  approximation  to ‘ x . 
Differentiating  (7)  with  respect  to  x  and 
equating  to  0,  yields  the  identity 

(ex-p+xex)  ( 1-p )  [b(  1-p  )+a] , 

=  (eVp)2[b(l-p)-an]. 

Taking  the  usual. series  expansion  for  ex  and 
neglecting  0(x2)  terms,  yields  the 
approximation 

r  2(l-p)a(an*a)  -i* 

*  »  -  (9) 

L  (i+p)[b(l-p)+a]  J 

When  the  no-sampling  procedure (n=0,k=0)  is 
optimal,  (9)  -  becomes 

*  «  /(2/<b+l)].  (10) 

In  this  case  however,  the  exact  value  for  x  is 
easily  obtained  by  iteration[7] .  Thus,  the 
problem  of  obtaining  economically  optimal  s- 
charts  to  monitor  process  variability  is 
greatly  simplified  for  the  potential  user.  The 
approximate  approach  outlined  above  -  and 
detailed  in  [7)  -  has  the  disadvantage  that 
the  precise  values  for  a, 8  for  which  the 
control  churt/procedure  design  is  required  may 
not  appear  in  the  table,  so  that  some 
interpolation  is  required  and  a ^ consequent 
decrease  in  accuracy  likely  results.  The  ideal 
would  be  to  have  nomograms  available  for  the 
three  design  parameters;  the  author  is 
currently  working  on  the  production  of  such 
nomograms . 
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Abstract  -  It  Is  well  k'nown  that  the  sampling  plan  with 
given, producer’s  and  consumer’s  risks  Is  the  most  basic 
acceptance  inspection.  As  It  is,  no  sooner  have  we  tried  to 
use  this  plan  than  we  are  confronted  by  a  difficulty,  that 
is,  how  we  should  specify  the  four  parameters  determining 
the  sampling  plan.  The  four  parameters  are  p„  p,,  a  and 
0,  which  specify  the  two  points  on  the  00-  curve. 

However,  there  are  few  studies  on  this  topic.  In  this  paper 
a  reasonable  specification  of  these  parameters  is  discussed 
from  an  economic  viewpoint. 


1.  INTRODUCTION 

The  sampling  plan  with  given  producer’s  and  con¬ 
sumer’s  risks  tends  to*  be  avoided  to  use  because  of  the 
difficulty  of  specifying  the  two  points  on  the  00-  curve, 
that  Is  (  p„  1-a  )  and  (  p„  0  ),  where 

p,«  fraction  defective  corresponding  to  an  acceptable 
quality  level, 

p,»  fraction  defective  corresponding  to  an  unsatis¬ 
factory  quality  level, 

a  ■  specified  value  of  producer’s  risk,  and 
0  *  specified  value  of  consumer’s  risk. 

From  this  reason,  Deming  (1986)  recommended  to  use  the 
Dodge-Romlg  and  Mll-STD  sampling  plans  (ISO  2859). 
However,  the  sampling  plan  with  given  producer’s  and 
consumer’s  risks  Is  most  basic  and  useful  If  the  difficulty 
of  specifying  the  four  parameters,  p„  p„  a  and  0  Is 
dissolved.  The  purpose  of  this  paper ‘Is  to  analyze  the 
criterion  of  specifying  the  producer’s  and  consumer’s 
risks  and  to  give  a  guidance  for  reasonable  specification  of 
these  four  parameters  from  an  economic  viewpoint. 

2.  INTERPRETATION  OF  SPECIFYING  TWO-POINTS  ON JfHE 

QQzmyj; 

The  criterion  for  specifying  two  points  on  the  OC  - 
curve,  usually  (p„l-a  )  and  (p,,  0  ),  means  that 

1)  To  specify  the  probability  that  a  good  lot  (p  «p,)  Is 
erroneously  rejected  Is  a. 

2)  To  specify  the  probability  that  a  nonconforming  lot 
(p  *P*)  Is  erroneously  accepted  Is  0. 

Actual  risks  a*  and  0 ‘  are  calculated  as  l-£4p,)and  UPt) 
respectively,  where  Up )  Is  the  probability  of  accepting  a 
lot  with  fraction  p  defective. 

From  the  producer’s  standpoint,  the  costs  of  accept¬ 
ing  and  rejecting  a  lot  with  fraction  p  defective  without 
sampling  Inspection  are 

C<P><p)  =  tfA<P>.  WP> 

C(RP\p)  =  WS<P).  RpR(P>  (I> 

where 

N  *  lot  size 


A  (.P)*  cost  of  accepting  an  item  without  regard  to 
quality  from  the  producer’s  standpoint 

R cost  of  rejecting  an  Item  without  regard  to 
quality  from  the  producer’s  standpoint 

A^a  cost  of  accepting  a  nonconforming  item  from 
the  producer’s  standpoint 

cost  of  rejecting  a  nonconforming  item  from 
c  the  producer’s  standpoint. 

For  details  about  a  liner  cost  model,  Raid’s  (1981)  work  can 
be  x-eferred  to. 

Hence,  the  following  policy  leads  to  a  more  profitable 
state: 

If  C<P>  >  ,  the  lot  is  rejected  (Case-R^>) 

If  C<P><  C<P>  ,  the lotis accepted  (Case-A,»*>). 

When  the  lot  Is  erroneously  rejected  although  Case-A<r> 
occux*s,  the  extra  cost  Is 

c<;>-  c<;)=,v(R(p)-A(p)+//p>p  -a<2!v>  (2) 

s  W-F(p). 

The  expected  value  of  this  oppoi'tunity  loss  per  lot,  that  Is 
4  Regret  ’  ,  is  given  as 

«g,(p)=  N-F(p){l  -  UP)).  (3) 

On  the  other  hand,  from  the  consumer’s  standpoint,  the 
costs  of  accepting  and  rejecting  a  lot  with  fraction  p 
defective  without  sampling  Inspection  are 

C(^(p)= 

C  (G)(p  )  a  w/jG)  +  A’p  S(2C)  , 

where  the  cost  parameters  from  the  consumer's  standpoint 
correspond  to  those  from  the  producer’s  standpoint. 

In  the  same  manner  as  the  case  for  the  producer,  when  the 
lot  is  erroneously  accepted,  the  extra  cost  Is 

CW-e<«- «>,<«■.  (6) 

=  A’-0(p). 

The  Regret  In  this  case  is 

Rgt(p)  =  N-G{p)Up).  (6) 

Thus  the  criterion  for  specifying  the  two  points  on  the  00- 
curvo  (he.  (p„l-a  )  ,  (p,  ,  0  ))  may  be  interpreted  as  a 
criterion  having  the  following  two  regrets : 

1)  To  specify  the  regret :  RgyiPt)  =  N'F(pk)  a 
when  a  good  lot  (pap,)  Is  erroneously  rejected. 

2)  To  specify  the  regret :  Rgtipt)  =  N’G(pa)  0 
when  a  nonconforming  lot  (p=Pj)  Is  erroneously 
accepted. 

From  the  above  Interpretation  of  the  criterion,  we  may 
introduce  the  following  consti'alnts  from  the  economic 
viewpoint : 
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Kg,(p.)  £  «(P>  m 

fete)  S  «(C).  ‘ 

where  M<r>  andStf <c>  Imply  the  allowable  expected  extra¬ 
loss  for  the  producer  an  i  the  consumer,  respectively. 

In  the  actual  case,  that  acceptance  or  rejection  of  lots 
is  judged  with  sampling  fnspction,  the  Inspection  cost- 
should  be  additionally  considered.  The  inspection  cost 
greatly  depends  on  ASN  (Average  Sample  Humber ). 

Assuming  that  the  actual  Inspection  cost  can  be 
represented  as 

dx  +  dxs, 

where  d,  ■  fixed  cost  for  Inspection,  d,  *  sampling  cost 
per  item,  and  s  =*  ASN.  Then,  in  this  case  the  respective 
regrets  for  the  producer  and  the'consumer  are 

Rg,Up)  *  (N-  s)  F(p)(l  -  Up))  +  *  <<*«  +  d*s) 

(8) 

/te,’(p)  =  (N-s)  G(p)Up)  +  (l-£)(d,  +  dts), 
where  C  denotes  the  proportion  of  sharing  the  Inspection 
cost  by  the  producer.  From  Eqs.(7)  and  (8),  we  have 

Kg, -ip,)  £  M(p)+  e  I  (9) 

Kg,'( P.)  £  M(C)+(1-«H, 


where  I  *  allowable  Inspection  cost  which  Is  determined  by 
the  mutual  agreement  between  the  producer  and  the 
consumer.  From  Eqs.(3)(6)  and  (0),  we  have 

,  „  x  tfw+e</rd,-<*,g) 

(N-s  )  F(px) 

..  .  .  M"»*{l-e)(I-drd,s) 

Upt)  s ,  — 

From  the  definition  of  7,  we  have 
7  b  dx  +  dts , 
then  by  substituting  Eq.(U)  Into  Eq.(10),  wo  can  determine 
the  risks  a  and  ,6  as  follows  : 

1  "  Wp,)  S  <R-(/-d,)/d,>  F <p,)  2  “ 


<io> 


(II) 


Up,) 


s  (d-(i-d,)/d,)  C(P.)  “  B 


(12) 


Then,  p,  and  p,  are  obtained  by  solving  Eq.(12),  that  Is 
HP* 


Pi  E  A,  -  R,  + 


(K,l''-A,i',)(N-lI-d,)/d,)  a 


(13) 

P,=  K,  -A,  g  , 


3.  PROCEDURE  FOR  SPECIFYING  FOUR  PARAMETERS 

It  Is  said  that  the  choice  of  (p,,  p,)  should  be  based 
on  the  technological  requirements  for  usage  of  products 
and  the  economical  consideration  about-',  the  pxxxiuctlon 
cost,  the  inspection  cost  and  so  on.  It  is,  however, 
difficult  to  specify  (p„  p,)  In  actuality.  Even  though  (p„ 
p4)  can  be  specified,  the  specification  of  (  a ,  0  )  is  also 
difficult.  As  a  matter  of  fact,  most  sampling  inspection 
tables  and  design  procedures  are  based  on  (  a,  0  ),  which 
are  some  conventional  values,  c.g.  IX,  5X,  10X,  etc.  So  the 
user  of  the  sampling  plan  is  obliged  to  specify  (  a,  0  ) 
beforehand.  By  using  the  tentative  (  a,  0  )  specified  as 
the  conventional  values,  we  obtain  (p,,  p,)  from  Eq.(13), 
and  then,  a  design  of  the  single  sampling  attribute  plan  ( 


n  ,  c  j  is  possible.  Note  that  the  ASN  is  derived  after  the 
sampling  plan  (  ,n  ,  c  )  was  determined.  Hence,  it  is  neces¬ 
sary  to  ascertain  whether  Eqs.(10)  and  (11)  are  satisfied  or 
not  after  determining  the  sampling  plan  (  n  ,  c  ). 

Eq.(ll)  gives 


1  -  Up>)  S 


W<r>*  e(I-dx-dts) 
(N-s)F(p,) 


Up*) 


rt{CMl-g)(7-<W,s) 
s  (N-s)G(Pi) 


S  0. 


(14) 


Then,  If  Eq.(ll)  Is  satisfied,  Eq.(10)  (equivalently,  Eq.(9)) 
is. satisfied.  Even  if  Eq.(ll)  is  not. satisfied,  it  does  not 
necessarily  follow  that  Eq.(10)  is  not  satisfied. 

Table  1  shows  the  possible  cases. 


TABLE  1.  Relationship  Between  Eqs.(10),(U)  and 

User’s  Satisfaction  With  Their  Requirements. 


Case 

Is  Eq.(ll) 
satisfied  ? 

Is  Eq.\10) 
satisfied  ? 

Are  the  user's 
requirements 
satisfied  ° 

1 

Yes 

_ 

Yes 

2 

No 

Yes 

Yes 

3 

No 

No 

No 

In  the  cases-1  or  -2  in  Table  1,  the  present  sampling  plan 
will  be  adopted.  If  the  case-3  occures,  it  follows  that 


l  -  Up,)  £  a  £ 

«p.)  s  e  £ 


eU-d,-d,s) 
<.v2S)>-(p,)  =  “ 

tfc>t  a-c)u-d, -d,s)_ 

(R-s)G(p,) 


(15) 


Accordingly,  there  are  following  steps  to  get  a  sampling 
plan  which  satisfies  Eq.(10),  that  Is 


Step-1 :  To  replace  a. and  0  with  a’  and  0\  x'espectively, 
and  renew  them  a  and  0,  respectively. 

The  case-3  indicates  that  the  initial  specified  risks 
were  needlessly  small.  Sampling  plan  based  on  (a* , 
0 ’)  Is  'reduced’  inspection  In  compax’lson  with  It 
based  on  old  (  a,  0  ).  So  the  now  plan  based  on 
(a* ,  0')  will  lessen  its  ASN. 

Step-2  :  To  alter  the  type  of  Inspection  to  that  whose  ASN 
can  be  reduced.  For  example,  changing  the  sampling 
plan  flora  single  to  double,  or  double  to  sequential 
Enell(I984)  discussed  which  sampling  plan  should 
be  chosen  In  various  situations. 

Step-3  :  To  raise  the  allowable  inspection  cost  7,  through 
the  mutual  negotiation  between  the  producer  and  the 
consumer. 
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TV  tiftne  fatrcl  Earsire  ilpritk  d  fo-lber  Emcc  Ssta 
fee  Zilsrg  fes  Yaari 
bstitrte  cf  tel  id  fetisaslks 
Seilugjhg  Krersity,  feiin  Gda 


fetrafcla  this  paper,  a  adaptive  control  alsritk  cf 
nen-lbear  exnaicsjstafestea  iitredred .  Bis 
aterifa  is  &  da!  cf  apaaeter  estiEtcd  aferi- 
tk ,  ad  if  has  tie  reorsite  fen .  Tie  mi  fen  sell 
Excr  of  esccfix-i-aai  be  oblaised  via  this  aferiik-fer 
<x,  it  is  satisfactory  tie  afplfetim  cf  sank  sis- 
tes. 

1.  L'ltrodotim 

There  ere  a  iot  of  ccoraic  sjstns ,  dose  Bihea- 
tic  rafels  are  non-linear .  For  erapie ,  ve  oosidr 
the  sjstea  vhich  ensists  of  it  eoeicaic  sectors  -fcexr- 
dirg  to  tie  &KrDcusliiSfmkti(o  faction,  ve 
can  vrite  the  Bthsatic  nodal  of  ttecaaalcsjsta 

Q(t)  sgJ,(t)li(t)*'Ii(t)M  (1) 
there  t  is  a  tiie ,  Q(t)  is  output,  Li  is  labor  of  ith 
sector ,  Sift)  is  capital  of  the  ith  sector.  Sift)  is 
■technical  progress",  at  aadbi  are  elasticity  of  labor 
and  capital  of  the  ith  soda- ,  respectively . 

It  is  very  itportant  for  fte  arauis  of  the  ecsncsic  systa 
above  to  solve  the  following  trebles:  Sjpcse  Cb(t)  vas  given, 

how  can  (Mine  li°(t),  Sic(t),  i  =1 ,  2 . H,  such  that: 

2  iift)Li°(t)*Mt)M  -  (bft)  ft) 
where  «  assure  that  Sift),  ai  and  hi,  are  know.  This 
trebles  can  be  trarsfered  into  the  trebles  of  designing 
control  sysla.  Us  general  fen  of  this  trebles  is  : 

Consider  the  non-linear  eooncsic  sjstfj  S.  Supose  its 
axfel  has  been  vritten  in  the  fen  of  prediction  no¬ 
dal  : 

yfthftWMJt1-,  9  ft).  t)  (3) 

where  >(t)  is  an  cne-dirensicna'  output,  uft)  is  a  input 
vector ,  8  ft)  is  an  irinow  randos  tise-varyirg  paraae- 


ter  verfrr,  t  is  discrete  tire,  red 

Tt-r^  jft-t),ift-n*l), . ,yft-l) } 

!  { nft-i),nft-t*l) . eft) } 

If  tie  yaft)  vas  given,  tie  trebles  is  that  hw  k  can 
(Mine  efft),  sua  that 

flYt-i1’.  Ot*"*,®®.!] 

SfraiEtrs  eozi  to  ysft),  rare  §  ft)  is  an  predic- 
tkn  value  of  8(0. 

Tbs  Easing  cf  appreaiEtim  is  fet  tie  octroi  ateritis 
satisfies  tie  criteria  cf  ccctrol  errer  being  mifen  ssall . 
ftdiirtien :  Sspuse  { C?,  f  (B )  is  a  grot?  cf  control 
las  of  thesetes  S  which  cottars  peraeter  ?.  If  fer 
®j  £  >  0 ,  tire  exist  8°f  B  and  K  >  0 ,  such  that 
tie  ccctrol  variable  aft)  ebtained  with  alaeriti*  C?°fcr 
t  >  S  satisfies  : 

I  Vo(t)  -  f(  Yw1’,  Dt”,  8  ft)  .1  ]  I  <  E  a-s 
Are  roft)  is  expected  output  at  tie  t .  6  ft)  is  tie 
prediction  esliBtim  of  8  ft) ,  re  sav  that  the  ccr.trol 
laws  { C?,  ?  f  B )  satisfy  tie  criteria  of  cmtrol  er¬ 
rer  being  inifcn  ssall. 

2  teptive  catrol  algeritia 

Fa  convenient,  the  rxfcl  (3)  of  tie  systes  is  charged 
into  tie  follovirg  fen : 

yft)=f[Yt-,w,,i:v,Ut-it’.  8ft)  ,t]  (0 

In  general  case ,  the  adaptive  catrol  consists  of  t» 
parts  ,  which  are  the  algorithi  of  paraieter  cstiralic'i 
sd  the  algcrilhi  of  catrol  lav .  But ,  in  this  paper , 
the  parsaeter  8  ft)  of  noefcl  (4)  say  be  tisrraryirs  and 
the  estiBieu  value  8  ft)  of  8  ft)  is  nxessary  fa  Min¬ 
ing  uft) .  Oily  can  ve  obtain 

8(0),  8(0,  8(2) .  8ft-l), 

via  general  cstiBted  algoritha ,  so  ve  nust  calculate 
prediction  value  8  ft)  of  8  ft).  This  nears  that  the 
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adsi'ne  cadre!  a&riti*  ist  resist.  ef  tins  pads , 
vai-a  are  tie  jsra&z  estistad  a’srifia.  tie  sear 
tar  preSatfca  aisritii  ad  tie  zgtha  octroi  !ar. 
gtc  details  are  as  folforis : 

(1)  Ire  paaretir  estiztad  aferifis 
a  tie  paper  [\]  re  fee  given  estireted  alsriik  cf 
pararetir  8(0  iasdsl  (0  : 

S 

3(0=§(i-i)* — : - Ve^.)f[8(t-1),  t] 

ItWMtt-i)]*2 

*b(0-fSr««,a(O.Oe.lt’.f(t-l).t3}  0 

ibere  8(0  isaaestiEtedHlueof  8(0,  o  is  a  sui¬ 
table  ccstat  and 

78c«-afit,8(l-i)>^fCft-»t’. 

aW.W^S.tlU.fct-i) 

7re  cilicKJ  of  tiiis  algoritia  res  ban  dsrstraled  b 
practical  application  ard  theoretical  araljsis 

(2)  The  parsretcr  prediction  algoritia 
If  l-l  is  present  tire  ,<sirg  stinted  algcritia  0, 
re  aily  ftain  cstiiatire  avxrrt 

8  (0),8(i) .  8(H) 

“Bis  estireted  value  8  ft)  can  only  obtained  by  sore 
prediiion  alfjritia.  Here,  re  sissst  isine  reli-level 
rewsive  prediction  rethod  vhidi  details  caa  be  fond 
in  pacer  [2] . 

(3)  Tie  ahplire  control  lav 
Because  the  ■positkn"  of  control  variable  u(t)and 
iriron  joareter  8  ft)  in  axfel  (4)  m  fe  considered 
as  the  sane,  as  tag  as  re  axsidr  the  control  variable 
aft)  as  tine-varying  peraacter  of  the  sjsten ,  then  re  can 
retenine  control  lav  oft)  by  estinated  algorithm  of  ur 
Imvn  pararetcr ;  So  re  haw 
? 

uWft-l)* - V„(,.i>f[o0(t-]),t] 

IVoCMifft.tf’ft-OJF 

‘froW-fSt-^/ft-l),!!.-,*-*,  8ft),t3} 
vherc  P  is  a  suitable  ccnstant,  yoft)  is  expected 
output.  8  ft)  is  (he  prediction  value  of  6  ft),  and 
Vu  (t-u  f  (t,u°  ft-l)3=-guf  Dft-i* 

u,i!t-it",,8ft),t]u«u«(t-i) 

Ifcir  the  proper  condition,  it  can  he  prove  that  this 
adbplive  control  algoritia  satisfies  the  criterion  of 


3feplkatkn 

fe:4r  rdelft)  a pis,  re  swse  A  vhitioA  {sag 
crerellitr ,  then 

o(o^(oi,(o*!i,(Oi,*Moyo*%(fla 

fesae  ostiretad  values  ?i(0,  raft),  4,  4,  Si  and  h 
rere  retabed.  Fcr  tie  systa ,  re  have  : 

uftM,(o.u(o,i,(o,yo)T 

Lot* 

A(0=81t(0Ll°(t-lh,o(t-liSl 
d*(o  ^(oyft-uVft-i)^ 

4ft)  4i?i(0Li°(t-l)1,Ii°(t-l)t1 

4ft)  ^(oyft-i^ft-ip 

then 

Vo<t-nf6f,ft-l),tMdi(0.4(0,4(0  A(0)T 
If  the  expected  output  Cb!0  vas  given  ,re  tale 
that: 


rU°(0-| 

rlAt-D] 

rdifth 

LAt) 

=  . 

IAH) 

ft 

4ft) 

IA0 

LAW) 

4ft) 

LIAi)J 

k°(t-l)J 

L4ft)J 

*!0)(t)-J.(0l1<,(t-l)1,l1o(t-l)4,4(0L!o(t-l)^2o(t-lH 
vtae  LAW),  Lz°(t-1)  ,l.°(W)  and  fc°(W)  ae  taxvis 
and  captials  at  lire  W  of  tvo  canosic  sectors  respe¬ 
ctively.  For  any  £  >  0,  as  long  as  re  tale  the  suitable 
value  of  P,  fcr  t>N ,  re  have 

!W0-|i.(0L.o(0i,Il°(0S,:<£  a.s 

Osins  this  rethod,  re  can  obtain  a  satisfied  input  of 
capital  and  Itiox  in  cadi  eoccvoaic  sector. 
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/csiract-DeveJop*Et  of  economy.  growth  of  population.  eaploita- 
tfcn  of  resexrce,  progress  of  science  and  technology  zod 
wrtBBMtOT  iatarefepeodect  and  Kttally constrained  so  that 
there  b  a  relaitoasfclp  of  tie  mity  of  opjesites  zne 
therefore,  tie  stair  on  eaviroomestol  probk*  wold  hai*  better 
uxferstaodbe  to  octroi  it  cely  ttra  th:;  c*e2t  system  cf  mity 
of  opposites  b  taken  bto  accoct,  lib  stair  a  tiae  control 
oil  of  total  anstof  pollutant  u  environment  is  just  based 
co  tib  vingobt  to  establish  an  grriraaestal  ecjajmctric 
■xfcl  forstohrbg  tie  pressure  cf  economic  cfevelopmect  and 
population  growth  as  veil  as  the  fmctico  cf  technical  progress 
on  environmental  protection,  analyzing  and  evaluating  Lie  states 
cf  environmental  pollution.  forecasting  tie  developing  ferofexy 
ia  fctcre.  and  appraising  bteratively  the  different  controlling 
alternatives  to  sene  the  tfecbico  aeking  of  ecvircaestal 
management. 

1 .  Overall  desist  of  ncrocxtrol  model  of  total  amount  of 
pollutant  for  regiocal  environmental  pol lotion 
Thb  aacrococtrol  model  involves  the  integrative  study  on  tic 
generation,  discharge,  control  or  reduction  and  comprehensive 
utilization  of  the  pol lotar.ts  fro*  economic  sectors,  resident's 
daily  living  and  social  cocstaplion  as  wll  as  the  investment 
needed  vithia  the  area  of  jurisdiction  ( province.  autenovus 
region,  ■clct polity  directly  inder  the  antral  govormst.  city 
and  prefecture)  .The  overall  structure  designed  as  show  ia  fig.1 
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cocsbts  of  three  pert ss  the  model  of  economy  acd  popebtioo. 
the  aacrococtrol  model  of  total  ascent  of  polluted  discharge 
b  wdewat/r,  waste  gas  and  solid  taste,  and  the  model  cf 
integrated  acalrsb  of  csviroomect  aad  economy. 

A. £ccnxy  and  pcpcUiico 

Economic  development  and  growth  of  population  are  the  storting 
points  cf  ecvimental  pollctioo.  The  cstablcfarst  of  models 
of  regional  economic  dyykpmst  and  population  control  provides 
favorable  condition  rot  only  fer  dawbg  cs>  plan  or  pmcra*. 
forecast  tag  and  analyzing  the  si  tat  ion  of  economic  and  social 
tfevekpRct,  bet  also  for  stohias  the  relation  of  environmental 
pollution  to  economic  and  social  detfloptcaLlidr  the  addition 
with  these  mod»b  established. the  models  on  be  csed  in  eombisr 
tioo  with  the  mc&l  of  total  aaxnt  octroi  to  get  information 
about  the  ugect  of  cconccic  development  and  population  growth 
on  environment  and  also  the  constraint  of  environmental  polio* 
ticn  on  economic  development.  of  which  the  quantitative  analysis 
■ay  provide  the  fcasb  for  dxbico  making  of  the  coordin&tive 
development  of  cccncwy  and  environment.  fa  case  these  mxfcls  are 
not  established,  the  indexes  of  economy  and  population  may  be 
used  as  external  variables  for  the  model  cf  total  amount  control 
with  the  following  requirements: 

(1)  Collection  of  information  about  the  economy  and  urban 
peculation  of  the  refoincc  year  according  to  the  economic  index 
of  the  "model-. 

(2)  Collection  of  the  similar  infarction  of  the  level  year 
from  related  sectors  in  the  area  of  Jurisdiction. 

The  sectoral  economy  consists  of  two  categories:  that  of 
province.autcnomous  region, large  and  aodits  cities  b  classified 
into  30  sectors  according  to  the  rational  statistic  yearbook  and 
the  regolrements  of  the  statistic  department  of  KEPA^nd  that  of 
small  cities  lists  alt  main  pollution  soirees  and  controls  more 
than  83  of  pollutants  in  the  area  of  jrisdiction.  Based  on  the 
characteristics  studied  on  the  total  aaouit  of  pollutants,  the 
related  indexes  of  corresponding  economic  sectors  are  selected 
as  external  variables  for  the  nodal  of  total  axxnt  control  to 
daw  ip  planing  alternatives  with  similation  of  the  reference 
year. 

B. Tbe  sxrocontrol  model  of  total  amxnt  for  waste  water 

This  model  b  composed  of  tw>  ports:  the  model  of  waste  water 

from  economic  sector  and  the  model  cf  irban  sewage.  The  poll«r 
tacts  are  CCD,  oils  and  others  selected  according  to  the  charac¬ 
teristics  of  the  region.  With  the  amount  of  generation  of  waste 
inter  and  OX)  and  also  the  output  value  of  the  sector  in 
reference  year, the  generation  coefficient  of  waste  water  per  ten 
thousand  dollar  of  output  value  and  that  of  ODD  per  ten  waste 
water  may  be  calculated  icing  the  node!  established.  Considering 
the  progress  of  science  and  technology,  the  coefficient  of 
generation  of  pollutant  in  level  year  cay  be  modified:  and  the 
amoxit  of  pollutant  generated  by  the  sector  can  be  calculated 
with  the  output  value  in  the  level  year  (  of  the  planning  period 
or  forecasting  period). Using  the  control  rate  of  waste  water  for 
three  ( high,  medium  and  low  )  alternatives,  the  corresponding 
investment  for  the  achievement  of  environmental  objective  of 
total  arnouit  control  can  then  be  derived  through  optinization. 
As  to  the  irfcen  sewage,  the  generation  coefficient  of  domestic 
sewage  per  capita  can  be  cbtaind  similarly  from  the  urion 
population  and  asoent  of  sewage  generated:  and  also  the  amxnt 
of  sewage  and  (HO  based  on  specific  conditions  in  the  area  of 
jurisdiction.  Again,  the  control  rate  is  used  as  variable  to 
derive  the  investment  for  control  land  the  benefits  of  sewage  re¬ 
cycle  may  be  obtained  according  to  the  amxnt  of  sewage  recyccd. 

C.  The  eacrocontrol  model  of  total  amount  for  atmospheric 
pollutonts 

This  model  cents  ire  three  parts:  (1)  combustion  process  of 
economic  sector  generating  waste  gas,  smoke  and  dust  S02  and 
Hfe;  (2)  technological  process  of  economic  sector  generating 
waste  gas,  industrial  dust,  S02  and  HOx?  and  (3)  fuel  burning 
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process  of  resided'*  deily  living  and  social  cassKtico  with 
pollst ants  sr  as  (1).  bcla dirg  the  energy  soirees  csed  by 
restctectsfexa  as  coal,  trivet,  liquidized  tzs.  lateral  gas  and 
electric  stove).  cxstrelizzd  haetswoly  and  tie  energy  for 
enterprises  zed  ergmizztiens.  lie  2»®t  of  atmospheric  pollu¬ 
tants  generated  ia  cystica  process  aa  be  calculated  on  tie 
tests  cf  different  fael,  type  of  cnrfxstkn  and  discharge  factor 
of  pollctact:  aad  that  generated  ia  technological  process  cao  be 
calculated  similarly  as  for  waste  wter.  i.e.  based  oo  tie 
general  toe  coefficla*  of  pollutant  per  ten  thousand  dollar  of 
output  value.  The  control  variables  for  combustion  process  are 
tie  rates  cf  sack?  elimination  and  dedtstiag.  etc.;  those  for 
technological  process  are  control  rate  of  waste  gas.  recovery 
rate  of  industrial  dbst  and  rate  of  recti Iizatico  of  S02,  JCfec 
and  others:  those  for  energy  used  ia  daily  living  are  rate  of 
gasification,  percentage  of  briquet  ia  tie  coel  used  (or  tie 
coefficient  of  energy  stnetsre  for  domestic  usage)  ^nd  that  for 
central  izrd  beet  supply  is  tie  area  of  heat  strply.  Vith  these 
variables  mentioned  above  and  tie  objective  of  total  anxt 
control  of  atmospheric  pollutants,  tie  aaar.ts  of  various  pollu¬ 
tants  discharged,  tie  amxcts  controlled,  reduced  and  cowrehor 
sirely  utilized  wit*  various  facilities  and  tie  corresponding 
investments  of  ecviramental  protection  can  be  calculated  for 
three  alternatives  (high,  maditmi  ad  lev).  Ia  order  to  achieve 
tie  larcKootrol  objective  vith  effective  and  rational  inrest- 
Knt  of  environmental  protection,  according  to  tie  results  of 
calculation  for  tie  analysis  and  evaluation  are  carried  out  vith 
criteria  of  value:  all  control  variables  are  vodified:  and  nev 
alternative  is  established  ictil  it  is  satisfactory. 

D. Tne  Batccctrol  model  of  total  amowt  for  solid  waste 

This  wfel  coexists  of  tw>  parts1  the  control  model  of 

industrial  solid  teste  of  caxrmic  sector  and  the  control  mxfel 
of  trbaa  dsmestic  refuse  and  maore.  Tie  mistrial  sol  id  waste 
includes  smelting  residue.coal  ash. slag. fargue.cbemical  residue, 
tailing  and  other  residue:  and  its  amxat  of  generation  is  also 
drived  frev  its  generation  coefficient  per  ten  thousand  dollar 
of  output  value,  while  that  of  coal  ash  or  slag  is  calculated 
vith  the  ccraaption  of  energy,  that  of  gangue  vith  its  genera¬ 
tion  coefficient  per  ten  tkxsand  of  raw  coal,  and  that  of 
doaestic  refuge  and  wtre  vith  their  generation  coefficient  per 
capita  of  urban  population  in  the  area  of  jurisdiction.  The 
control  variables  for  industrial  solid  vaste  are  the  rate  of 
comprehensive  utilization  ard  the  rate  pf  treatment  and  disposal: 
and  those  for  domestic  sol  id  waste  are  the  rate  cf  mechanized 
clean**  and  transport  and  the  rate  of  detoxification.  These 
variables  are  used  to  derive  the  aacunt  of  cowrehensive 
utilization,  treatment  and  disposal  solid  vaste  for  three  alter- 
natives  (high,  medium  and  lov),  and  also  the  corresponding  invest¬ 
ment  of  environmental  protection.  Siiilarly,  the  criteria  of 
value  is  used  to  ftrther  modify  the  control  variables  to  select 
rational  investment  for  achieving  the  objective  of  environmental 
protection  planning  or  forecasting  period.  The  structure  of 
macroccntrol  model  of  total  amount  fer  vaste  water  as  show  in 
Fig.2  demonstrates  also  the  structure  of  the  model  for  atmosphe¬ 
ric  pollutants  and  solid  vaste. 

E. The  integrated  analysis  of  environment  and  economy 

The  model  of  integrated  analysis  is  established  on  the  basis 
of  Bcrocontrol  model  of  total  amount  for  vaste  vater,  atmosphe¬ 
ric  pollutants  and  solid  vaste:  and  focused  on  tho  analysis  and 
evaluation  of:  (1)  the  total  investments  for  controlling  various 
pollutants  from  different  industries  or  enterprises  and  their 
ratio;  (2)  the  total  investment  for  controlling  the  pollutants 
from  resi&nts'daily  living  and  social  ccsimption:  (3)  the 
percentage  of  environmental  protection  investment  of  production 
sector  to  the  investment  of  capital  ccnsructicn  or  renovation 
of  the  indsties  cr  enterprises  of  the  sea  or!  (4)  the  percentage 
of  investment  fer  environmental  protection  planning  to  the  GKP 
in  the  planning  period:  and  (5)  the  analysis  of  benefits  of 
control  .recovery  and  comprehensive  utilization  of  various  pollu¬ 
tants.  In  case  modification  is  needed,  users  can  base  on  the 
results  of  analysis  and  evalution  to  irput  nev  information  to 
the  control  variables  of  related  pollutant  to  get  nev  solution 
through  calculation  vntil  it  is  satisfactory. 


H.  function  cf  technical  progress  ia  environmental  protection- 
modificztioQ  cf  generation  coeff iciest  of  pollutant 

the  macrooctroi  model  of  total  amort  cf  pollutant  is  dynamic 
vith  ispcl  of  variables  (  sai  as  the  generation  coeff  iciest  of 
pollutant)  dang  tag  mil  the  time  because  of  extbuxs  progress 
of  science  and  technology  and  enhancement  of  management  level. 
The  charge  of  generation  coefficiect  of  pollctact  may  influence 
directly  the  amort  of  pollotact  generated,  discharged  cr 
controlled  ms  well  as  the  isvestment  for  pollution  control. 

__  fere,  the  modification  of  generation  coeffciect  of  pollutant 
is  the  key  to  successful  study  of  the  macrcoctrol  idel  of 
total  amoixtt  of  pollutant,  the  charge  of  vhich  is  related  to 
following  factors:  __ 

I.  The  technological  renovation  and  the  adoption  of  nev 
technique,  equipment  s  material  ia  prediction  sector  increase 
the  rate  of  utilization  of  energy  and  resou-ce  as  veil  as  econo¬ 
mic  benefits,  thus  lowering  the  generation  coefficient  of  pollu¬ 
tant  per  ten  thousand  dollar  of  output  value.  In  the  years  from 
1384  to  138$  in  China  the  discharge  coeff iciert  of  industrial 
sewage  was  decreased  by  95  each  yesr:  but  the  output  value 
rapidly  increased  by  11.3  and  the  elasticity  coefficient  of  the 
amort  of  sewage  discharge  was  0.2.  This  means  technological 
progress  has  very  great  influence  on  environment. 

2.  Enhancement  of  comprehensive  utilization  of  pollutant  mod 
development  of  technology  generating  less  or  no  waste  enable 
full  utilzation  of  resource  and  decrease  the  generation  coe- 
ffeient  of  pollutant. 

3.  Development  of  large-scale  enterprise  advanced  in  techno¬ 
logy  and  more  effective  in  scale  may  lower  the  consumption 
indexes  of  energy  and  rcsoirees  in  amparison  with  small-scale 
enterprise,  because  difference  of  the  structure  of  large,  medium 
and  small  enterprise,  has  marked  influence  on  the  generation 
coefficient  of  pollutant.  For  exa*>!e.  small  cement  plant 
generates  more  dust  per  uiit  output,  about  6-8  times  of  that 
generated  by  madium  cement  plant. 

4.  Enhancement  of  management  level  of  enterprise  and  implemen¬ 
tation  of  the  responsibility  srste*  for  working  post  to  reduce 
spattering  or  leaking  and  generation  of  pollutant  in  production 


To  sum  ip,  the  modification  of  generation  coeffcient  of  pollu¬ 
tant  is  of  great  significance. 
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aged  4i.  Then  the  unified  Zjt  defined  as 


In  this  paper,  we  ate  pattern  clattering  and  multivari¬ 
ate  analysts  of  variance  method*  to  analyze  the  daily  load 
patterns  in  Ontario,  Canada,  in  order  to  determine  the 
impact  of  the  Time  Of  U*e  rate  implementation.  Our 
finding*  show,  that  in  the  first  year  after  the  implemen¬ 
tation,  there  were  no  significant  changes  in  the  electricity 
conuuaption  behaviors  of  the  customers  selected  for  this 
study. 

1  Introduction 

Time-of-Use  (TOU)  rate  has  been  proven  to  be  effective  in  achiev¬ 
ing  desirable  electricity  load  patterns  [2}t  {3J  In  this  paper,  we 
analyze  the  DLPs  in  order  to  detect  impact  of  the  TOU  rate  on 
electricity  demand  patterns.  Two  approaches  are  employed  The 
first  one  is  the  pattern  clustering  method,  which  partitions  the 
monthly  averaged  DLPs  for  a  number  of  years  into  different  clus¬ 
ters  (groups)  according  to  some  similarity  measures.  The  resulting 
dusters  are  then  analyzed  to  detect  the  potential  pattern  changes, 
and  measure  their  significance.  The  second  approach  adopted 
is  the  well-known  multivariate  analysts  of  variance  (MANOVA) 
method  II).  Il^ compares  the  unified  DLPs  of  different  years  in 
order  to  determine  if  they  are  statistically  different. 

The  two  methods  are  used  to  analyze  the  real  electricity  con¬ 
sumption  data  in  Ontario,  Canada. 


where 


Since  we  only  use  unified  DLPs  in  the  rest  of  this  paper,  the 1  will 
be  suppressed. 

2.1.2  Step  2  :  Clustering 

The  classical  clustering  method  can  be  formally  described  as  the 
following.  Let  Af  be  an  integer  set,  {dj},j  6  M  a  set  of  objects 
{</,},  and  n,  1  <  n  <  M  an  integer  where  M  is  the  number  of 
objects  in  M.  The  dustenng  method  solves  the  following  opti¬ 
mization  problem : 

J  =  m\nY,  D(c 0  (1) 

V  Sal 

where  C  =  (ci.cj,  ,c,)  is  a  partition  of  M  with  eg  (J-  •  Uc*»  ~ 
M  and  CiflCj  =  £,»  £j. 

DM  =  m (2) 

">  >o. 


£<4. 


2  Methodology 

2.1  Pattern  Clustering  Method 

The  pattern  cluttering  method  proposed  conritls  of  three  steps 
The  first  step  is  data  reduction  and  unification.  We  lake  this 
step  to  compress  the  data,  reduce  random  interference  and  re¬ 
move  the  effects  of  economical  or  weather  conditions.  The  second 
Step  is  clustering  The  unified  average  DLPs  produced  by  the  first 
step  are  processed  by  the  so-called  hierarchical  clustering  method 
which  partitions  the  DLPs  into  2f  clusters,  l  =  The  clus¬ 

tering  results  are  presented  as  tables  of  graphic  patterns  for  visual 
inspection.  The  last  step  makes  use  oi  quantitative  measures  to 
assess  the  significance  of  the  TOU  impact. 

2.1.1  Step  1:  Data  Reduction  and  Unification 

For  a  particular  year,  denoting  the  DLP  of  day  j  e  {1,  -  -  -  ,365} 
as  dj  =  (cfj  dj  •  -  •  dj4j,  then  the  monthly  average  d„,  =  (d^,  <P„ 
of  month  i  will  be 


dm  i  ~  - 


*»«»-* 

:  £ 

1 


•**,12 


where  mi  is  the  number  of  days  belween  Jnnunry  1  and  the  first 
day  of  the  srt  month  plus  1, 

..I, °"ly  >"<««<'£  !».DlP’*  ,h"P'  l>»*  not  its  absolute 
value.  So,  we  apply  a  unification  approach  to  the  monthly  aver- 

•Tin  research  is  parity  supported  by  a  {rant  from  Ontario  ltydio  and  a 
grwl  from  NSERC, 


and  I]  *  |]  denotes  the  Euclidean  distance  Vectors  dj  and  partition 
C.  attuning  the  above  minimum  are  called  the  cluster  centroids 
and  optimal  partition  respectively.  It  is  not  difficult  to  show,  that 
dj  =  where  nk  is  the  number  of  objects  in  cluster  k. 

To  solve  (1),  several  iterative  algorithms  are  available  |4) 

In  this  paper,  a  sanation  of  the  classical  clustering  method 
called  hie raixhical  clustering  is  used  to  generate  2(  clusters  It  first 
divides  all  objects  into  two  partitions  (dusters)  using  Algorithm 
I  These  two  partitions  are  called  the  first  level  partitions  Then 
each  partition  at  the  first  level  is  further  divided  into  two  smaller 
partitions  called  second  level  partitions,  and  so  forth  So,  at  level 
l,  there  will  be  2f  partitions  (dusters) 

The  hierarchical  clustering  results  are  presented  as  graphic 
pattern  tables.  An  example  in  which  four  years’  48  average  DLPs 
axe  analyzed  is  given  in  Fig  I.  In  this  table,  DLPs  belonging  to 
one  duster  are  represented  by  one  graphic  pattern  Dy  visually 
m*pecling  the  table,  one  can  easily  observe  three  features.  First, 
,  there  are  seasom  1  variations  For  each  year,  the  average  DLPs  of 
d’MMay  to  September  (summer  season)  belong  to  one  duster  while 
that  of  October  to  April  (winter  season)  belong  to  another  at  the 
second  level.  Second,  years  86  to  88  seem  to  be  very  dose  to  each 
other  while  year  89  looks  very  different.  Third,  the  difference 
belween  86  to  88  and  89  seems  to  be  much  larger  than  seasonal 
variations. 


2.1.3  Step  4  s  Using  Quantitative  Measures 
We  define  the  two  distances  ; 

/.  The  Centroid  Distance  CD,j  ts  defintd  as  the  distance  be¬ 
tween  the  centroids  of  cluster  t  and  cluster  j 
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2.  The  Upper-Bound  diiianct^bttetin  clusters  t  . and  j  jis  de¬ 
fined  as  the  sum  of  the  centroid  distance  and  the  objective 
function ,  and  denoted  by  UD,j.  Thus  UD,j  =  CDtJ  +  DBJ. 

If  we  denote  the  maxi  mam  distance  between  the  element*  of 
two  cluster*  as 

d,j  —  vnx.ec.jeCj  1  dm  —  dj  jj,  then  we  can  *ho*7  that  CD,j  and 
UDtj  pro-ride*  the  upper  and  lower  bound*  of  d,/,  i.c.,  CDy  < 

<hj  <  UD.f.  , 

We  use  the  previous  example  to  show  that  these  two  quantities 
can  be  used  to  measure  the  difference  between  two  cluster*. 

We  hare  mentioned  that  visual  inspection  indicated  that  the 
difference  between  8S-88  and  89  i*  a  dominant  factor.  This  state¬ 
ment  can  be  quantified  as  follow*.  Let  u*  turn  to  Fig.  2  which 
show*  the  distance*  between  centroid*  and  objective  function*  of 
the  hierarchical  clustering.  Check  the  first  level  of  Fig.  1  and  2. 
The  difference  between  86-88  (a  cluster  denoted  by  0)  and  89  (A) 
can  be  characterized  by  CDqajs  0.3189  and  UDqa  =  03189  + 
0.7336  =  1.0525.  The  seasonal  changes  can  be  seen  from  the  sec¬ 
ond  level.  For  year*  86-88,  it  is  the  distance  between  0  and  and 
GDpo  ~  0.2224  and  UDoo  -  0.2224  -f  0  2261  =  0.4485.  For  year 
89,  it  is  the  distance  between  A  and  A,  and  CD**  =  0.1237  and 
VDaa  =  0  1237  +  0.0234  =  0.1471.  Apparently,  CD^a  and  UDoa 
are  much  larger  than  their  counterparts  C7?oo,  UDqqiCDaa  and 
UDaa  Thus,  the  difference  between  86-88  and  89  is  indeed  bigger 
than  seasonal  changes. 

Lei  us  use  the  same  example  to  show  how  the  difference  be¬ 
tween  year*  88  and  89  is  quantified.  Since  the  difference  between 
two  years  is  defined  as  the  average  of  the  difference*  between  their 
corresponding  months,  we  first  calculate  the  difference  for  each 
month  Let  us  start  from  January.  At  the  third  level,  January  89 
and  January  88  belong  to  /j(A)  and  C|(0)  respectively  (see  Fig. 

Since  the  approximation  approach  using  two  bounds  cannot 
be  directly  calculated  for  /,  and  c,,  we  go  up  2  level*  on  F,g>  2  to 
the  first  level.  Because  January  89  is  a  member  of  a3  and  January 
88  a  member  of  a j,  we  use  the  difference  between  a i  and  a3  to 
approximate  the  distance  between  the  two  elements.  We  see  from 
Fig.  2  that  CDstm,  =  0.3189  and  OBJ,I0,  -  0  7336.  Therefore 
UDn;  =  0  3189  +  0.7336  =  1.0525.  Since  February  and  March 
data  are  idenlicaf  to  January,  we  turn  to  April,  and  find  that  April 
88  belongs  to  /j(O)  and  April  89  belongs  to  flj(oo).  Checking  Fig. 


2,  we  learn  that  the  difference  between  e*  and  ej  can  be  used  to 
approximate  that  between-/;  and  Oj.  Simple  computation  shows 
that  CD.,,,  =  0.1237  and  UD<it,  =  0.1471. 

Repealing  this  process  for.tne  remaining 'months,  we  finally 
obtain 

CDn„  =  (cB&>  +CD^+-  +  C0gS)/12 

=  0.286 

and  U Du#  =  0.902. 

Thus,  roughly  speaking*  the  difference  between  88  and  89  he* 
between  0.286  and  0.902. 


2.2  Multivariate  Analysis  of  Variance 
Method  (MANOVA)  (1) 

3  Experimental  Results  and  Conclu¬ 
sions 

Both  pattern  clustering  and  MANOVA  method*  were  used  to  ana¬ 
lyze  the  DPI*  of  the  Ontario  Western  and  Eastern  Systems.  None 
of  these  methods  detected  any  statistically  significant  difference 
in  the  DLP’*  before  and  after  the  TOU  rate  implementation. 
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Figure  1:  Graphic  Patterns  for  Si 
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Abstract-Tbe  cstimabiWy  of  linear  parametric  function  In  linear  models  have 
been  studied  by  Baksalary  and  kala  [l],  Wang  Songful  [2]  and  bu 
S3tti»Tqypi*n  [33.  and  some  neceaory  and  sufficient  codition*  for  the  estlmibili- 
ty  of  linear  parametlc  function*  have  abo  been  estibflnhed  la  this  paper,  .we 
give  another  necewary  and  sufficient  condition  for  the  ettunib&ty  of  linear 
parametric  funcoccs  in  liear  model.  And  the  result  In  th*  paper  Led  odes  those 
which  have  been  given  In  [2]  and  [33- 

1.  latroducrioo 

Fee  several  ten  years,  linear  models  have  been  used  In  many  fields  of  na¬ 
tion  economy  In  our  courtry.  such  as  In  Industry,  agriculture,  mctccroJgy,  e- 
ccnnomy  managemen*,  medical  and  health  work,  education  and  psychology,  c. 
c.  And  extended  the  study  of  their  theories.  In  ccder  to  use  the  linear  model  to 
dove  practical  problems.  It  I* oesiesary  to  give  *  further  study  of  the  estifflihai- 
ty  of  linear  parametric  function  la  linear  models. 

A  general  Lnear  model  is( 

Y-X0  +  e.E(e)-O,cov(e) -c^V  (1) 

Where  Y  isannXl  vector  of  observations,  Xb  an  nXp  known  matrix, 
6  h  a  pX  1  vector  of  eskown  parameters,  and  e  b  an  nX  1  random  vector 
which  with  expectation  zero  and  covariance  e*V,  where  Vba  nonnegative  def¬ 
inite  matrix. 

IF  In  model  (1)  0  b  restricted  as  Hfl-b,  then  the  model  b  writea  as  toi- 
!owi| 

Y-Xfl  +  e,  H3-b,  E(e)-0,  cov(c)-o*V  (2) 

Where  H  b an  IXp  known  matrix,  b ban  1X1  known  vector,  the  others 
are  the  same  as  those  in  (1). 

We  know  that  the  parametric  functions  A0  of  0  b  estimable  If  there  b  an 
mXamaticsB,  such  that  E(0Y)— A0.  For  all  0,  where  A  ban  mXp  matrix. 
We  denote  $  as  the  least  square  estimator  of  0,  then  for  estimable  function  AS, 
we  have  Ad  b  the  best  linear  unbiased  estimator  of  A0  from  the  Gauss- Markov 
thorem  when  1 V | ^ 0  where  jv|  denote  the  determinant  of  the  matrix  V,  In 
partctice,  not  all  the  linear  functions  of  0  are  estimable.  So  some  necessary  and 
sufficient  conditions  for  the  estimabiUty  are  given  previously.  Searle  (I960) 
states  the  condition  tsi  the  linear  combinations  A0  are  estimable  If  and  only  If 
A(X*X)"X’X—A  where  (X'X)“  b  any  matrix  satisfying  the  matrix  equation 
X*X(X’X)-X'X-X’X.  (The  x-  b  called  the  g-Invene  of  Matrix  X).  Buk- 
salary  and  Kala  (197$)  CO  «*te  the  condition  as,  r(X(Ip— A"A))—r(X) 
— r(A),  where  r(X)  denote  the  rank  of  the  csllmabUlty  of  an  matrix.  Wang 
Scnggul  (1981)  [2]  give  another  condition  of  the  estlmabfiity  ot  Afl  and 
avoiding  the  computation  of  g-lnver*e  of  a  matrix.  What  to  do  In  hb  paper  b  to 
sieve  linear  homogenous  equation  and  compute  the  rank  of  a  matrix.  Uu 
Shuangqvan  ( 1988)  [3]  also  give  another  condition  which  Indude  that  In  [2]. 
We  now  abo  give  a  condition  which  Include  those  In  [2]  and  [33-  So  the  con¬ 
dition  In  thb  paper  b  the  extension  of  those  In  previous  papers. 

2.  Preliminary 

The  sequel  <2scus*  b  In  real  spaces.  Let  A,X,C,T,V  be  matrix  and  x.y, 
o«0. y,  be  vectors. 

Definition!  Let  and,— be s linear  Independent  vectors,  lhenL(a,, 


Ci.—#*)  b  ihe  space  spaned  by  the  vectors  <n,c,'~,o..  Chat  b  for  every 
'd€Wai»««»— •«•)•  there  exit  ti,t,,~,t.;  such  that, 

0  ■  tio,  + 1  j«i  +  •"+ t.(V> 

l^TTttnA  1 ,  Let  A  be  mXp  rnatix  R'.R*  be  puJtoeosfcmai  and  nwfrmensSonal 
spaces. 

A  |R’— -R*  be  projector,  then 

1. duE  (R»)  -  duaCR(A))  +  dim(N(A))  (3) 

2. Ra(A)-N(a')  (O 

Where  dim(  •  )  denote  the  dimensco  of  a  space , 

R(A)  -  {AXiX  e  R») 

N(A)  -  {a  1  Ax  -  O.x  6  R*) 

Ra(A)«  (x,xJ.R(A)> 

A*  b  the  transpose  of  matrix  A.  Form  lemma  1.  We  have( 
p  -  dun(R')  -  dim(R(A»  +  <Sim(N(A». 

So  the  dimendion  ot  solution  of  the  equation  Ax— 0  ba 

<ha(N(A»  -  p  -  d»m(R(A)>  ~  p  -  r(A)  (5) 

Lemma  2,  Let  A.B  be  mXp  and  pXq  matrices,  then 

r(AB)  -r(B)  -  dim(R(B)  0  N<A»  (6) 

-r(A)  -  6a(R(A*)  fl  N(B*))  (7) 


3.  Main  result 

From  [23  and  (33  wc  have 

Theorem  lt  In  linear  model  (1),  A0  b  estimate  If  and  only  If 

i(A)  +  r(XV)-.r(X)  (8) 

Where  Ah  mXp  matrix,  r(A)-k,  VbepX(p— k)  mrJx,  Whose  rov 
vectors  are  the  solution  of  A* AY  —  0- 

Theorem  2*  Let  T  be  pXq  matrix  such  that  N(A)—R(T).  Then  Afl  b  es¬ 
timable  in  linear  model  (1)  If  and  only  If 

r(A)+r(XT)-r(X)  (9) 

From  theorem  1  and  theorem  2  we  known  that  theorem  2  b  the  extension 
of  theorem  1  since  when  for  a  pXq  rnatix  T  such  that  N(A)  — R(T)  theorem  1 
b  also  bold.  But  there  b  no  answer  In  f  3]  about  the  search  of  T.  In  thb  paper 
we  give  another  condition  which  extended  the  matrix  C,  and  also  give  the  way 
of  how  to  search  C. 

Theorem  3i  LetCbepXq  matrix,  such  that 

dim(R((X’>  0  N«T»  -  <Hm(R(X’)  fl  P.(A)> 

Then  Afl  b  estimable  in  linear  model  (I)  if  and  only  If 

r(A)  +  r(XC)  -  f(X)  (10) 

Proof  1  Rom  lemma  2,  we  have 

r(XC)  -r(X)  -  <Ho(R(X')  0  N(Cf» 

So  *(XC)  -r(X)  -  dim(R(X*)  0  R(A')) 

From  the  definition  of  estlmabillty,  we  know  that  Afl  b  estimable  If  and 
only  If  there  esist  a  mxn  matrix  B  such  that  A— BX,  where  plx;  denote  the 
space  spaned  by  column  of  X.  But  p(A')Cu(X*)  b  equivalent  to> 
dim(R(x)  nR(A’))  -r(A) 

So  r(XC)  -  r(X)  -  dJm(R(X’)  f|  R(A’))  -  i(X)  -  f(A) 
r(A)+r(XC)-r(X) 
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Note  1 1  Free  theorem  2,  we  know  tha;  T  Is  used  as  C  in  theorem  3  but  the  *c- 
cepcabfe  spectrum  of  C  b  roach  greater  than  that  of  T,  Beam  use  N(A)*R(T) 
stale  Cat  N(A)  and  R(T)  have  the  same  maximum  linear  independent  vectors, 
but  C  is  different  from  that,  thb  may  be  seen  as  follows, 

Let  a,,a,,‘",£fc,<u*i»—»a«,»be  maximum  linear  independent  vector  of  R 
(X")«  where  li«r(X‘)"r(X)>*i  a4,a,,--;,ft,fUi,-**.ft1beroaIxroum  lin¬ 
ear  Independent  vectcrsof  R(A#).  where  ti™r(A')*»r(A)>s,o^, o„”*,a,5 
Y»*i»*"*Y*  be  maximum  linear  Independent  vectors  of  N(0  where  r«p— r 
CCT)“P— r(C»s,  and  aim,  a,,  ft,  Y«0“l,2.—.ti*  J-s+l,s+2,—.tn 
I  »*+!.*+ 2, •■*,r)  are  each  other  Independent  vectors.  From  the  suppose 
above,  we  have 

dimfROO  n  N«r»  -  * 

-d£a(R(X')  0  R(A') 

So  If  there  Is  abo  r(XC)  -  r(X)  -  r(A). 

We  have  that  AS  b  estimable  from  theorem  3.  But  under  this  circum¬ 
stance,  N(A)^R(C),  we  can  not  use  theorem  2  to  justify  if  AS  b  estimable 
or  not.  Even  If  r(XC)-r(X)-r(A)  bbc*L  When  N(A)-R(C)  theorem  3 
b  obviously  bold  since  from  lemma  1 ,  we  have 

N«f >  -  R1-  <C)  -  N*  CA)  m  R(A*) 
dim(R(X')  fl  N(C))  -  dim(R(X*)  fl  R(A*)) 

So  the  condition  N(A)  "> R (C)  b  only  a  particular  situation  of 
<Lm(R(X’)  0  N(C*))  -  dim(R(X’)  f|  R(A’) 

So  the  results  in  our  paper  b  the  extcnsJcn  of  those  in  £2]  and  [3} 

Note  2,  How  to  search  for  C? 

Since  X.x„  Amx,  are  also  known  xrutrics,  so  R(X’),  R(A')  are  also 
known 

Let  x(X')  ••  t,  <  mln(p,n) 


And  C  can  be  get  from  sieving  the  cqution. 

Note  3»If  we  ebese  do,*,,— ,c»as  c,,o,,— ,a.,y,,y„— ,r.as 
-A 

then 

M*  —(aa,ao,*-,<V  Yi,Yi.”’.Y.> 

—(a,  .a,. —.a,',  ft*., &♦„—,&,) 

Where  u  ■■  t,  —  s 
SoM-A 


And  MC**0  b  equivalent  to  AC~0.  Since  A*Ax~0  have  the  same  solu¬ 
tion  as  Ax>«0,  then  we  get  theorem  1. 

{the  solution  of  MC-  0)Z>  (t»Mt-  0,  t  €  R»}  fl  (Ct,t  €  R«> 

So  the  theorem  2  b  a  particular  situation  of  theorem  3.  For  I  .near  model 


O-GHI-  ■(:)-• :) 

Then  we  have, 

Corilary  1  >  Let  C  be  a  pXq  matrix,  such  that 

dim(R(X’  H')  0  N((T))  -  dlm(R(X*  H’)  0  R(A’)) 
then  AS  in  model  (2)  b  estimable  if  and  only,  if 

"xM-.ca, 


Where 


(X-H)-(X)' 


Let  a  multivariate  linear  model  b 

Y-XS  +  e  (II) 

Where  Y  b  an  nXm  random  matrix,  X  b  an  nXp  known  matrix,  and  0 
b  a  pXm  unknown  parameters,  e  ban  nXm  random  matrix,  let  U(X)  denote 


r(A’)  —  t,  <min(p,m) 

The  maximum  linear  independent  venters  of  R(X') ,  R(A’)  are  the  same 
as  those  supposed  in  Note  1  Since 

0<s<mJn(t|,t,) 

Then  dim(R(X')nR(A,))-s 

Form  theorem  3  If  there  b  also 

dlm(ROO  f|  N(CT))  -  • 

Then  the  estlmability  of  AS  In  linear  model  (1)  can  be  Jjstifkd.  For  If  C 
has  be  searched  for,  we  can  do  b  only  to  >*stify  If 

r(XC)  -  r(X)  -  r(A)  b  tn»  or  not 

The  ways  of  c hosing  N(C)  are  at  least  C^.  Beacause  we  can  just  chose  s 
vectors  from  the  maximum  linear  Independent  vectors  of  R(X’) ,  then  add  any 
vectors  which  are  independent  with  Oa,Oi,***,a^.  the  vectors  we  chosen  and 
the  added  vectors  form  a  new  linear  Independent  vectors  whwe  dimension  b 
smaller  than  the  whole  space  considered. 

Suppose  the  choen  vectors  are, Ok,, a,. 

Where  li,It'ii,“*J,is  a  permutation  of  l,2,-',ti  let  Yi»Yi,*“*y»  be  the 
adding  sect  on  which  are  Independent  with  o.i .  ,  o.then  a,,,og,  y,» 

Yi»*"»Y»  form  a  new  space  as  , 

Yi  »Yi  *  »Y«) 

LetN(C?>  •*  L«(oiii, ad, Yi.Y»»—.Y.) 

AndM’  «  («d,o.»< •••♦a..  Yi»Yi»*".Y.) 

Where  M'  b  a  matrix  formed  by  cti.Od, •••,«•.  YnYi»,,,*Y.. 

From  U«f)  -  {t,  -  0, 1 6  R») 

We  have  C (a,,, oid, —  ,o..  Y»*Yi,«",Y.)  “  0 

Cm*  -  o 

SoMC-0 


the  trace  of  matrix  X,  tr(AS)  b  estimable  If  there  exist  an  mXn  matrix  B  such 
that, 

E(tr(By))  -  tr(AS) 

From  the  definlton,  we  known  that  ir(AS)  b  estimable  if  and  only  If 
there  exist  a  mXn  matrix  B  such  that, 

A-BX 

Corollary  2,  Linear  model  (II)  If  there  exist  a  pXq  matrix  C,  such  that 
dim(R(X*)  f|  N(C))  -  dim(R(X‘)  fl  R(A*)) 
then  tr(AB)  b  estimable  if  an  only  if 

i(XC)  -  r(X)  -  r(A) 
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Abstfact— In  this  pi  per, the  Doubie- Level  Prediction  Model  is  raised 
and  by  using  it,  a  new  method  to  solve  prediction  problem  of  regional 
gross  output  v»  hie  of  Industry  is  presented.  The  results  of  practical 
checking  computation  are  satisfied. 

1 .  Introduction 

Itisc.  ^gmficance  both  in  theory  and  practice  to  analyse  and  predict 
regional  industry  economy  system.  It  Is  not  only  helpful  to  make  mid. 
dle-term  or  long-term  plan  scienti/iely.but  also  has  Important  guiding 
value  in  managing  and  adjusting, controlling  the  recent  industry  econo¬ 
my.  Formerly  general  Time  Series  Analysis  Method  was  mostly  used  to 
analyse  and  predict  regional  industry  economy  developing  state.  But  the 
regional  gross  industrial  output  value  Isn\  a  Stationary  Random  Se¬ 
quence,  therefore,  the  application  of  classic  Time  Series  analysis 
method  will  come  across  some  difficulties.  Sometimes  the  time  series  da¬ 
ta  have  to  be  resolved.  The  tendency  term  and  period  term  have  to  be 
separated.  In  case  the  other  terms  were  stationary ,  the  classic  Time  So- 
nes  Analysis  Method  was  used  to  predict  the  gross  Industrial  output  val- 
uc. 

By  using  the  Double-level  prediction  Model  and  Multi-Level  Recur¬ 
sive  Prediction  Method ,  It  w©n\  bo  necessary  to  resolve  the  time  series 
and  the  supposition  of  the  steady  lime  series  also  Isn’t  necessary.  And  the 
ecmputkm  amounts  of  prediction  will  bo  decreased  greatly. 

1 .  Double-Level  Prediction  Model  and 
Multi-Level  Recursive  Prediction  Method 

Thinking  over  the  changing  condition  of  gross  industrial  output  value 
of  one  atca,  Let  Y(k)  stand  for  the  gross  output  value  of  Industry  at  k 
time,  let  U(k)  stand  for  the  Input  of  Industry  system, Including  Invest¬ 
ment  in  fixed  assets  and  circulating  funds.  Generally  speaking,  the 
charging  law  of  Y(k)  can  be  described  by  the  model  of  the  form  below. 

Y<k)-fa,Y<k-l>-f-+a.Y(k-n)-b.U<k--p)+b,U(k-p- 

I)  +  -+b.U(k-p-m)+V(k)  <1) 

where  a, ,  ••  a. ,  b» ,  -b.  are  awaiting  parameters ,  V  (k  )  is  Random 
Noise  term,  interference,  n  and  m  are  ranks  of  the  model, and  p  is 
Time  Lag.  The  parameters  a,,— a,,  b*,— b«  of  the  model  are  Time 
Varying.  The  changes  of  them  show  some  relevant  changes  of  economy 
Inner  system.  And  the  literature  Cl)  Indicates  that  the  Random  Noise  V 
<k>  of  .:,xM  (1)  can  be  omitted.  But  at  this  time,  it  must  be  compen¬ 
sated  by  the  Parameter  Randanllamtion.  So  we  can  change  model  (1) 
Into  the  form  Wow 

Y<k)-fa,(k>Y(k-l)+«--fa.(k)Y(k-n)-b,(k)U<k-p)-fb, 
<k)U<fc-p-l)+-+b.(k)U(k-p-m)  (2) 


among  them,  a,(k),— a«(k),t«(k),— b„(k)  are  Random  Time 
Varying  parameters,  n  and  m  are  ranks  of  the  model  p  is  Time  Lag. 
They  have  certain  economy  meaning. 

If  0<k)-(-Y<k-l>,-,-Y<k-n),U<k-p),U(fc-p- 
1);-U(k— p-m»‘ 

«(k)  -  {a, (k) ,— a. (k)  ,b»(k) ,  -b.(k) }• 
then  model(2)  can  be  transformed  Into 

Y(k>«  0(k)f(k)  (3) 

where  8(k)  is  a  multi-dimension  random  series.  Model  (2)  or  Model 
(3)  is  called  First-Level  Prediction  Model. 

Since  fi(k)  Is  a  multi -dimension  variable, the  mathematics  model  M 
(0(k»  that  ®(k)  can  be  satisfied  can  be  created  according  to  itself 
changing  character.  At  this  place , 

e(k)-{8(k),»(k-l),  —  ,  «(k-q)> 
here  q  is  a  proper  constant.  Since  #(k)  hasn’t  had  any  demisions ,  model 
M(8(k>)  can  be  set  up  by  the  pure  mathematics  method.  According  to 
IiteratureC2),the  Model  AR  method,  Constant  Increment  method, Con¬ 
stant  Factor  method,  Period  Variable  method  ,  Period  Increment 
method ,  Average  approximate  method  and  etc.  are  usually  used  to  set  up 
model  M(8(k)>  on  prediction  problem.  Model  M(0(k))  is  called  Seo- 
cnd-LeYel  Prediction  Model. 

As  far  as  Double-Level  Prediction  Model  goes, The  base  steps  of  the 
multi- Level  Recursive  prediction  Method, as  reference  (3)  mentioned, 
go  as  follow. 

1)  According  to  observing  data,  we  use  proper  distinguishing  method 
to  estimate  the  unkown  parameter  fi(k)  of  the  model  (3). 

for  example,  the  following  Recursive  Algorithm  can  be  applied. 

«»)-i(k-l)+T^iyp0(k)(Y(k)-0(k)'{<k-l» 

Where  $(k)  expresses  the  estimating  value  of  ®(k) 

2)  According  to  the  estimated  value  sequence  1(0),  J  (!),■■■  5 
tN),we  can  set  op  the  Second- Level  Prediction  Model  $(k-f 

(k» , Where  N, fa  the  present  moment,  and 

4(k)»{J(k),J<k-l>,-5(k-q)} 

3)  Through  model  M(d  (k)),  the  prediction  value  ft  (N+l),  5  (N 
+  2),  —ft  (N+h)  of  the  time  varying  parameter  8(k)  can  be  ob¬ 
tained.  Where  h  is  prediction  step  length. 

4)  According  to  Input  value se.-JesO(N+I— p),  0(N4-2— p),—0 
(N+h—p),  which  were  given  before  and  using  the  First-level  Predic¬ 
tion  Model  tf<N+ h)«-0<N+b)'l<N+h),th*  prediction  value  ?(N 
+ 1 )  ,Y (N-f  2) ,  -f  <N+h>  ,can  be  obtained.  Where 

0<N-H)~{-Y<N>,-  -Y(N+l-h),0(N+l-p),U(N  - 
p),-U(N-p+l-m» 

0(N+2)-(-^(N+l),-Y(N),- — Y(N-fl-p-h),0(N 
+2— p),0<N+ 1— p),— U(N+2— m— p)) 
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I .  Application  Example* 

1 )  Tbe  pr eJiction  results  of  Helloes  >*Ag  province  gross  cu’put  value 
of  industry 

Tbe  following  numbers  are  known  historical  data,  which  were  com¬ 
puted  according  to  fixed  price.  Tbe  unit  Is  RMB  100  million  yuan. 


year 

gross  output 

value  of 

industry 

Investment 

in  fixed 

assets 

1971 

131.9 

5.94 

1972 

138.4 

6. 67 

1973 

149.2 

8.46 

1974 

159.9 

8.96 

1975 

177.7 

10.97 

1976 

187.9 

8.26 

1977 

204.3 

8. 89 

1978 

222.1 

13. 25 

1979 

236.7 

12.70 

1980 

249.1 

15.90 

1981 

256.5 

14.05 

1982 

274.1 

18.68 

1983 

295.6 

24.39 

1984 

325.0 

26.10 

1985 

360.8 

28.65 

198$ 

389.4 

34. 09 

1987 

432.4 

41.22 

1988 

482.1 

43.57 

1989 

511.3 

38.97 

1990 

519.7 

45,39 

According  to  the  formulas  which  were  given  above, the  recent  year* 
prediction  values  are  as  follow*. 


year 

true  value 

prediction 

value. 

error  rate(J^) 

1987 

432.4 

423.7 

-2.0 

1988 

482.1 

461.6 

-4.3 

1989 

511.3 

505.0 

-1.2 

1990 

519.7 

555.6 

+6.9 

2)  About  the  prediction  result*  of  Harbin  city  gross  output  value  of 
industry 

The  following  number*  are  known  historical  data. 

Tbe  price  Is  also  fixed  price.  The  unit  is  RMB  100  million  yuan. 


year 

gross  output 

valued 

industry 

year 

gross  output 

value  of 

industry 

1971 

'29.2 

1981 

56.7 

1972 

30.8 

1982 

60.1 

1973 

33.2 

1983 

65.9 

1974 

33.3 

1984 

75.7 

1975 

37.4 

1985 

88.6 

1976 

37.5 

1986 

96.1 

1977 

42.0 

1987 

108.2 

1978 

47.5 

1988 

125.6 

1979 

52.2 

1989 

127.7 

1980 

55.7 

1990 

125.5 

According  to  the  formulas  which  were  given  above,  the  recent  year* 
prediction  values  are  as  follows 


year 

true  value 

prediction 

value 

error  rate(^) 

1987 

108.2 

107.1 

-1.0 

1988 

125.6 

116.0 

-7.6 

1989 

127.7 

125.9 

-1.4 

1990 

125.5 

134.5 

+7.2 

The  prediction  results  above-mentioned  express  that  either  in  Hei¬ 
longjiang  province  or  in  Harbin  city, the  error  rotes  in  1987  and  1989 
were  within  1  percent  to  2  percent.  But  the  error  rates  in  1988  and 
1990  were  between  A  percent  to  7  percent.  The  Industry  Increase  in 
1988  and  1990  were  in  unusual  situation  .  In  1988,  economy  was  in¬ 
flation.  Total  amount  was  uncontroled.  Industry  Increased  over  usual 
years.  The  output  value  of  industry  than  that  of  last  year  were  11.5 
percent  and  1$  percent  respectively.  So  A.  3  percent  and  7.  6  percent 
errors  occur ed.  In  1990,  demand  total  amount  was  control ed.  The  un¬ 
reasonable  economy  structure  was  adjasted.  The  Increase  speed  of  indus¬ 
try  decreased.  So  around  7  percent  error*  occured.  These  fact*  show  that 
using  Double-level  PredK-.cn  Model  and  Multi-Level  Recursive  Predic¬ 
tion  Method  to  predict  the  dynamic  <A  regional  Industry  economy  system 
Is  suitable  in  normal  years.  But  the  thorough-going  and  concrete  analysis 
should  be  done  In  unusual  years. 


Reference 

C  1  }Han  Zhigang,  Multi-Level  Recursive  Identification  Method.  Chi¬ 
nese  Journal  of  Automation.  Voll, No.  1  (1989) 

(  2  ) Han  Zhigang.The  Multi-Level  Recursive  Method  and  Applica¬ 
tion  ,  Science  press.  Beijtng.  1989. 

C  3  3 Han  ZhIgang,On  New  Method  of  Dynamic  System  Prediction, 
ACTA  Automatic*  Slnka.  Vol  9,No.  3(1983). 
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Abstract 

Recently  i  new  class  of  queueing  networks  with  "negative 
and  positive*  customers  was  introduced  and  shown  to  have 
product  form  solution  Positive  customers  are  identical  to  the 
usual  customers  of  a  queueing  network,  while  a  negative  cus¬ 
tomer  deletes  a  customer  when  it  arrives  to  a  non  empty  queue. 

As  the  customer  flow  equation  is  a  non  linear  fixed  point  equa¬ 
tion,  the  stability  problem  remains  open  Iu  this  paper  we 
present  an  algorithm  to  compute  the  steady  state  distribution 
of  these  networks  and  check  the  stationnanty  of  the  queues.  We 
also  establish  stability  of  the  system  as  the  proof  of  the  algo* 
rithm  implies  the  existence  of  a  solution. 

I  Introduction 

In  a  recent  paper  {2),  a  new  class  of  queueing  networks  in  which 
customers  are  either  ’’negative”  or  *positivc*  was  introduced  and  was 
shown  to  have  a  product  form  solution.  Positive  customers  enter  a 
queue  and  receive  service  as  ordinary  queueing  network  customers, 
while  a  negative  customer  will  delete  a  positive  customer. 

Models  with  positive  and  negative  customers  can  be  used  to  rep¬ 
resent  different  systems.  The  first  study  on  network  with  negative 
customers  Was  motivated  by  the  analogy  with  neural  networks  (3],  (4). 
Queues  with  positive  and  negative  customers  may  model  systems  ol 
objects  exchanging  positive  and  negative  signals.  Another  possible  ap¬ 
plication  is  to  represent  multiple  resource  systems  Positive  customer* 
can  be  considered  to  be  resource  requests,  while  negative  customers 
can  correspond  to  decisions  to  caacel'riiesc  requests 

Unlike  BCMP  or  Jackson  networks  (l),  the  customer  flew  equa¬ 
tions  of  this  model  are  non-linear.  Therefore  it  is  not  easy  to  de¬ 
termine  the  necessary  and  sufficient  conditions  for  the  existence  of  a 
solution  inside  the  domain  of  stability  of  the  network  (i  e,  the  con 
ditions  under  which  the  stationary  solution  exists  (3J).  Fkuthcrraore 
even  i*  we  know  the  form  of  the  solution,  we  do  not  know  how  to 
compute  it  except  in  the  case  of  feed-forward  networks. 

In  this  paper  we  propose  an  algorithm  to  compute  the  steady  state 
distribution  of  this  new  type  of  networks.  In  section  II  we  introduce 
the  product  form  theorem  shown  in  (2J.  In  section  III  we  present 
an  algorithm  to  iteratively  compute  q,  the  load  of  service  center  1 
and- check  the  stationnanty  of  the  Markov  process.  We  prove  the 
algorithm,  evaluate  the  complexity  of  one  iteration.  The,  proof  ol 
the  algorithm  implies  that  there  always  exists  a  solution  to  the  fixed 
point  system  we  consider.  Our  results  may  be  useful  either  in  Neural 
Networks  theory  or  in  Queueing  Network  theory. 

II  The  Model 

We  consider  networks  with  an  arbitrary  number  N  o*  queues.  All 
service  centers  have  exponential  service  time  distributives.  In  this 
model,  when  a  negative  customer  arrive*  to  a  non-empty  queue  it 
deletes  a  positive  customer.  A  negative  customer  arriving  to  an  empty 
queue  just  vanishes.  Negative  customers  do  not  receive  service. 

External  arrival  streams  to  the  network  are  independent  Poisson 
processes.  We  denote  by  A i  the  external  arrival  rate  of  po^uve  cus 
tomers  to  queue  1  and  by  Aj  be  the  external  arrival  rate  of  negative 
customers  to  queue  1,  The  services  are  exponential  with  mean  /»,. 

Each  customer  may  change  service  centers  and  nature  at  the  com¬ 
pletion  of  its  service.  These  movements  of  customers  between  queues 
are  represented  by  a  Maikov  chain.  Negative  customers  leaving  a 
queue  (either  after  having  destroyed  a  positive  customer,  or  after  find¬ 
ing  an  empty  queue)  disappear  from  the  network. 


A  positive  customer  which  leaves  queue  1  (after  finishing  service) 
goes  to  queue  j  as  a  positive  customer  with  probability  F+[i,j],or  as  a 
negative  customer  with  probability  P“(t,  j]  It  may  also  depart  from 
the  network  with  probability  d|i)  We  shall  denote  the  state  of  the 
queucang  network  by  the  vector  rs(ii,.  ,zn)  where  x,  represents 
the  state  of  service  center  t  and  1$  as  usual  the  number  of  customers 
in  queue  1-  We  denote  by  J*,|  the  total  number  of  customers  m  queue 
».  Let  Ilf*)  denote  the  stationary  probability  distribution  of  the  state 
of  the  network,  if  it  exists.  The  following  result  shows  the  product 
form  solution  of  the  networks  being  considered  (see  (2]  for  a  proof) 

Theorem  1  >For  an  open  neftcorJL  0/  stations  the  stationary  Steady 
state  probabilities  are  given  by  II(i)  —  nfii  &(*i)»  vhere  each  gi  is  a 
function  depending  on  the  service  center,  if  the  system  of  non  linear 
equations  (1),  (2),  ($)  : 


//<  4*  A  j  +  A, 

0) 

tf 

A*  =  £f+(;,«J/W 

(2) 

r»i 

-V 

(3) 

have  a  solution  Such  that  V  *  0  <  qt  <  1  Ar  and  A*  ore  the  arrival 
rates  of  negaitce  and  positive  customers  arriving  to  station  i  from 
other  stations  m  the  network.  Furthermore  the  p,(*,)  in  (2)  hove  the 
following  form  :  j,(x,)  as  (1  - 

Obviously,  the  customer  flow  equations  (Eq  (1),  (2),  (3))  for  this 
type  of  networks  are  non  linear.  So  the  existence  of  their  solutions  is 
a  difficult  problem  except  foi  feed-forward  networks  (1  e  the  networks 
are  directed  acyclic  graphs)  .  In  this  case  it  is  possible  to  compute 
the  load  of  the  queues  in  the  topological  order  of  the  graph  The 
network  stability  conditions  and  the  existence  of  the  solutions  to  cus 
tomer  flow  equations  introduce  some  mathematical  problems  whicn 
had  been  partially  solved  in  {3]  using  an  homotopy  function  |6j  la 
the  next  section,  we  state  an  algorithm  to  compute  the  load  (7,),«j„jv 
Therefore  we  prove  that  there  always  exists  a  solution  to  this  fixed 
point  system  of  equations. 

III  The  Algorithm 

Instead  of  solving  the  initial  problem  introduced  in  theorem  1,  vve 
design  an  algorithm  to  solve  the  following  fixed  point  system  : 

K  =  L,.t  (4) 


Note  that  the  stationnanty  condition  is  now  included  into  the  fixed 
point  system  since  qt  is  positive  and  smaller  than  or  equal  to  1 
If  for  any  1  q,  is  equal  to  I,  then  we  show  that  queue  1  is  not 
stable.  Otherwise,  the  solution  of  the  second  problem  is  obviously  the 
solution  of  the  first  problem  Furthermore  uniqueness  of  the  solution 
follows  from  the  Markovian  framework  of  the  model 
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Description 

We  consider,  for  any  queue  index  s,  six  sequences  of  real  numbers 
(5l».  (Jj»,  IVU.  I£>  Aefintd  by  induction  (on  k)  ns 

follows  _  . 

WU=A.  +  S>,P+li,Wal» 
f\-l»  =  Ai+E^.^(j.*W55U 
[A£!t=A!+Ef.i£-(j,t>,yt 
[57}t-f!  =  tntn(l,(AtJi/(;i,  -f  (A£lt)) 

(jJt»  =  mMUA+U/G-i+lVM) 


(5) 


.with  the  following  initial  values  -  0  and  (57jo  =  I,  and  we  iterate 
the  computations  of  the  sequences.  The  fr ’’owing  theorem  states  that 
under  certain  con<lit.ons  these  sequences  lead  to  the  solution  of  the 
problem. 


Theorem  2  If  for  any  Service  center  i,  one  of  the  following  assump¬ 
tions  is  satisfied 

•  the  probability  that  a  positive  customer  feaves  the  station  to  go 
outside  is  strictly  positive  (it.  d(i]  >  0). 


•  there  is  a  Strictly  positive  probability  that  a  customer,  either  positive 
or  negative,  joins  a  queue  }  where  the  rate  of  negative  customers 
coming  from  the  outside  is  strictly  positive,  (1  e,  P*(«\  j)  + 

>  0  an&  >  0J 

then  the  algorithm  converges  to  N  values  of  the  load  (q,),,i.jv 
which  is  a  solution  of  this  new  fired  point  system 


Proof  of  the  algorithm 

Lemma  1  For  all  t  the  sequences  satisfy  the  following  relations 

(jJ.  £  tffl.  «»«<  (*+)»  £  l»f)*  o"«i  liLH  £  (?lt  («) 

The  sequences  (ft)*,  (A+j*  and  [Af]*  are  non  increasing  and  the  se¬ 
quences  (jjJ*.  (A£]t  end  {A£j*  are  non  decreasing. 

Proof :  by  induction  on  k. 

Lemma  2  The  sequence  defined  by  the  differences  between  the  two 
sequences  (ft]*  a«d{2sJ*  **  positive  non  increasing.  Furthermore  under 
the  same  assumptions  as  theorem  $,  the  summation  over  the  queues 
of  these  differences  multiplied  by  the  rate  of  service  of  the  queue  /i,  is 
decreasing  to  zero. 

Proof :  We  have  to  consider  two  cases  according  to  the  valuo  of 
the  lower  sequence  (cj. 

•  If  for  some  l  [jj|t 1$  equal  to  l  then  obviously  (q7]i  is  also  equal  to 
1.  And  for  all  k  >  l  the  two  sequences  stay  equal  to  1.  The  solution 
of  the  fixed  point  system  is  on  the  boundary  of  the  set  of  definition 
for  the  load  The  station  is  not  stable. 

•  On  the  other  hand,  if  (jJk  is  strictly  smaller  than  l  we  easily  give 
a  bound  on  the  difference  between  (fr)t  and  IgJ*.  Let  (AA+Jt  (AA~]t 
and  (A?,)t  be  the  new  sequences  defined  by  the  difference  between  the 
upper  and  lower  sequences  for  iteration  h, 


(AVU=iSHi£l‘ 

(A, -!.-!£)»  (7) 

Using  that  (jJ*  is  smaller  than  i  for  all  kt  we  get . 

(At-lwi  £(?!./(*+(£!*>  -  (£W0<.+(aF1*)  (8) 

And  after  some  algebraic  manipulations  we  obtain  i 


wl  ;al  »st  HiT  1 

Let  tj  ss  and  let  t  be  the  maximum 

over  the  queue  index  j  of  the  values  e}.  Then, 

N  N  N 

£  f*.(Aq,U*i  <  ][>j(A?,ltff,  <  1 Aq;)*  (10) 

iS>r  sal  jal 


Therefore  the  sequence  of  real  values  p«[Aq,]t  is  bounded  by 
a  geometric  sequence  of  rate  c.  Note  now  that  the  assumptions  of 
theorem  2  may  be  restated  as  e  <  1,  this  implies  that  the  sequence 
/*«(A?.|t  »*  decreasing  to  zero. 

Clearly  lemma  2  proves  that  the  algorithm  leads  to  the  solution  of 
the  fixed  point  system.  If  the  solution  is  such  that  all  the  components 
q,  are  smaller  than  1  0,  then  it  is  also  a  valid  solution  for  the  initial 
problem  :  q,  is  the  load  of  station  i  in  the  network. 

Otherwise  if  a  component  q,  is  on  the  boundary  of  the  domain 
then  this  implies  that  the  lower  bound  (A?jt/(pi  +  (AT)*)  is  greater 
than  l.  So  there  is  no  solution  of  the  fixed  point  system  inside  the 
domain  and  queue  1  Is  unstable. 

Thus  the  proof  of  theorem  2  is  complete  and  the  solutions  of  the 
fixed  point  system  (S)  give  the  loads  <?,  of  the  queues  in  the  network 
or  show  that  some  queues  are  unstable. 

Each  iteration  of  the  algorithm  consists  in  the  computation  of  the 
new  values  of  the  sequences  and  the  new  value  of  the  difference  to 
check  the  accuracy.  There  arc  6Ar  new  values  to  compute  and  fou-  of 
them  (the  most  expensive  in  term  of  computations)  require  a  linear 
number  of  operations.  Therefore  the  complexity  of  each  iteration  is 
quadratic. 


IV  Conclusion 

la  this  paper  we  prove  the  stability  (the  existence  of  a  solution)  of 
the  fixed  point  system  of  equations  (A)  (2)  and  (3)  using  an  algouthm 
to  obtain  the  solution.  The  uniqness  of  this  solution  in  the  domain 
of  ergodjdty  is  obtained  because  of  the  Markovian  framework.  The 
solution  is  accurate  as  we  obtain  lower  and  upper  bound  for  the  load 
of  the  queues.  Furthermore  the  algouthm  1$  effluent  as  the  differ- 
cnce  between  upper  and  lower  bounds  is  decreasing  quicker  than  a 
geometric  sequence, 

Smco  we  derived  this  algorithm,  the  model  of  positive  and  negative 
customers  was  improved  to  take  into  account  customer  class.  It  is 
shown  In  (5)  that  networks  with  multiple  classes  of  positive  customers 
and  negative  customers  also  have  product  form  solution  and  the  fixed 
point  system  is  slightly  different.  Application  of  the  algorithm  to  this 
new  fixed  point  system  is  straightforward. 


References 

|l)  F.  Haskett,  K.M.  G’handy,  R.R.  Muntz,  F.G.  Palacios  Open, 
closed  and  mixed  networks  of  queues  with  different  classes  of  cus¬ 
tomers,  ACM  Journal,  Yol,  22,  No  2,  p248-260,  April  1075 

(2}  E,  Gelenbe  Product  form  networks  with  negative  and  positive  cus¬ 
tomers,  To  appear  in  Journal  of  Applied  Probability,  1089 

(3)  E.  Gelenbe  Stability  of  Random  Neural  networks.  Neural  Com 
putation,  1000. 

(4)  E.  Gelenbe  Random  Neural  Networks  tn(h  Negative  and  Positive 
Signals  and  Product  Form  Solution,  Neural  Computation,  1900 

(5)  J.M.  Fourncau,  E,  Gelenbe,  R,  Suros  Multiclass  G-Networks, 
EH  El  Report,  Submitted  for  pubh  cation,  1000. 

(6)  C.D,  Garcia,  W.I.  Zangwill,  Pathways  to  solutions,  fixed  points, 
and  equilibria,  Prentice  Hall,  Englewood  Cliffs,  N  J-,  1981. 


1810 


COMPUTATIONAL  METHODS  IN  TEE  OPTIMIZATION 
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ABSTRACT  -  We  consider  a  periodic  review,  single 
product,  single  installation  Inventory, control  system 
with  probabilistic  demand  operating  In  the  steady 
state  under  the  (s,S)  policy,  A  unified  methodology 
is  provided  to  generate  five  computational  techniques 
for  the  determination  of  the  optimum  decision 
variables  s  and  Q-S-s  so*as  to  minimize  the  steady- 
state  total  expected  cost  of  operation  per  period. 

The  computational  methods  discussed  are:  an  analytic 
method,  a  numerical  method,  an  analogue  method,  an 
asymptotic  method  and  an  approximate  method. 

1.  THE  MATHEMATICAL  FORMULATION 

Consider  a  periodic  review  Inventory  control  system 
with  immediate  delivery  of  orders  where  unfilled 
demands  are  backlogged.  The  demand  during  successive 
periods  is  described  by  a  sequence  of  independently 
and  Identically  distributed  continuous  random 
variables  with  probability  density  function  4(0  and 
distribution  function. 0(0  <0<£<«>),  both  assumed  to  be 
Laplace  transformable. 

We  shall  assume  that  tho  procurement  cost  has  two 
components:  a  fixed  component  K  (K  >  0)  and  a  variable 
component  c.z,  where  z  Is  the  total  quantity  ordered. 
In  addition,  we  shall  assume  that  a  cost  L(y)  is 
Incurred  during  each  period  where  y  Is  the  Initial 
stock  level  at  the  beginning  of  a  period  following  a 
decision.  The  ordering  or  stock  replenishment  policy 
will  be  of  the  (s,S)  type,  that  is.  If  at  the  end  of  a 
period  tho  stock  level  is  x<$,  a  quantity  S-x  is 
ordered,  otherwise  no  order  is  placed. 

It  can  be  verified  (1}  that  if  Q  -  S-s  the  expression 
for  the  total  expected  cost  per  period  takes  the  form 


Uy>  “ 


h  F <y-0*(0d£  +  p  f  «-y)*<Odf  y  *  0 
Jo  Jy 

(V 

Jo 


p  I  {OyMO&Z 
o 


y  <  0 


(3) 

<4) 


Theorem 

The. optimal  values  of  Q  and  s,  Q*  and  s*,  satisfy  the 
following  set  of  equations 

3K(s*,x>  I 

M(s*,Q*>  -  K  and  -  -  0  (5) 

3*  Ix-Q* 

£«-<»*>  •  -<«*«>> 

where  M(e*,z)  -j£(M(s*,x)> - (6) 

1  -tfWx)) 

2.  COMPUTATIONAL  METHODS 
Z A _ An -Analytic  Method 

In  this  method,  the  function  M(s*,x)  is  first 
determined  explicitly  by  inverting  analytically 
expression  (6);  the  system  of  equations  (5)  is  then 
solved  for  s*  and  Q*.  Although  this  method  is  quite 
general,  nevertheless,  it  Is  convenient  only  when  the 
expressions  for  L(y)  and  ^<x)  are  rather  simple  In 
addition  and  quite  often,  the  inversion  of  M<s*,z)  can 
be  extremely  tedious.  Using  for  L(y)  the  expression 
given  in  (3)  with  4(x)  taken  as  the  gamma  density 
function  with  parameters  n  and  r: 


K  +  L(S)  *  j  HS-OHOV 

g(S,Q) - -  ct> 


Jo 


l  + 


4(x)  -  - pe'^x,  r  >  0,  n  >  0 

T(r> 


<D 


we  obtain  for  r  -  1 

h_ 

2/i 


M^.A.X)  -  ■  <J12X2  +  2 (,x)  +  ■ 


.•"‘"(hip) 


2/jx 


(7) 


<8) 


£■ 


vHu>  *  *(u)  4  4(u-v)tf(v)dv 


(2) 


and  that  the  minimum  value  of  this  cost  at  optimality 
is: 

g(S*.Q*)  -  L(**)  +  cD 


»  -  I  €««<*« 

Jo 


Let  h  denote  the  unit  holding  cost  and  p  the  unit 
penalty  cost.  Since  y  can  take  on  negative  values, 
the  expression  for  L(y)  becomes; 


2.2  A  Numerical  Method 

ThJs  procedure  consists  in  inverting^ (M(s*,x))  in  <6) 
numerically  for  a  range  of  values  of  s*.  The  value  of 
s*  and  x  are  then  selected  to  satisfy  the  system  of 
equations  (5),  To  illustrate  the  approach,  we 
consider  the  case  of  linear  holding  and  shortage  cost 
when  the  distribution  of  demand  Is  gamma  as  given  by 

(7>« 

To  reduce  the  total  number  of  Input  variables,  we 
perform  a  dimensional  analysis.  Ve  first  note  that 
the  problem  involves  five  input  variables  <p,  h,  K,  n 
and  r)  and  two  output  variables  which  are  the  optimum 
decision  variables  s*  and  Q*.  Ue  can  show  that  it  is 
possible  to  express  the  two  dimensional  output  factors 
fiS*  and  pQ*  as  a  function  of  the  three  dimensional 
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input  factors  - ,  — ■  and  r.  In  the  sequel  we  shall 
h  h 

express  the  diaensional  factors  by 


Equation  (5)  can  be  written  a3l 


Assuae  L(y)  Is  given  by  (3) .  Denote  H(x)  -  l-O(x) , 
then 


!£m(s*.x>>  -  -  ■ 


#{H(s*+xM 


iz(1-#<2» 


dc  -  5 


«o<x  v'  -  o 


where  »Q(X,t>  is' the  derivative  of  the  function 
H(s*,x)  defined  in  <6)»- appropriately  scaled.  Figure 
1  Illustrates  a  typical  output. 


— = - ) 


Jo  i-7(Z) 


For  large  x,  we  can  write: 

?(t)  -  1-zD  *  ^  z2  (D2  +  c2)  +  o(z2) 

where  D  <:«  and  a2  <  *  denote  the  finite  nean  and 
variance  of  the  R.V.f.  So  that  for  large  x,  the  first 
tera  of  (12)  becones: 

#'l(— — r - >  -  *2  +  i  <1  ♦  -^t)x  +  o(l) 


*2{1-?<*>1  2D  2 


A  siailar  expression  can  be  obtained  for  the  second 
tera.  We  obtain 

q*  i  y™  ,nd  |  H(v)dv « + <j*i 


Figure  1.  Optiaal  values  of-X,~  p$  and  Y  -  pQ  when 
demand  is  ganaa  distributed  of  order  r  -  5  (A  -  p/h, 
B  -  2Kp/h) 


This  aethod  is  based  on  the  analogy  that  equation  (6) 
bears  with  a  feedback  control  systea.  It  is  possible 
to  design  an  electrical  analoguo  circuit  to  siaulate 
the  above  systea.  By  setting  s  as  a  paraaeter, 
conditions  (5)  can  be  aade  to  be  satisfied. 

Consider  the  deaand  per  period  to  be  of  the  ganaa  type 
given  by  (7)  with  h  -  l,  p  -  10,  p  -  ,4,  and  r  -  4. 

The  plot  of  the  output  function  a(s.x)  -  M’(s,x)  for 
various  values  of  s  is  presented  in  Figure  2, 


If, we  assuae  that  M(s*,x)  Is  expandable  as  a  power 
series  of  x,  then  for  snail  x  we  have: 

M(s*,x)  ^  *o  +  al  x  +  a2 

«0>  aj  and  aj  can  be  evaluated  by  Baking  use  of  the 
known  Inherent  characteristics  of  the  function 
H(s*,x).  The  following  Identity  Is  obtained,  where 
R(*)  is  sone  renainder  tera: 

2K 

(—  +  L'(s*>)  x 
Q* 

-  -  -[I.-(s*)<L'(s*)  *(0)llxJ+tR1(x)-R2(x)l-0 

This  identity  yields  the  following  approximate  system 
of  equations  in  s*  and  Q*  for  snail  Q* 

L'(s*)  -  -  —  and  L“(s*)  -  ~  +  —  ^(0) 


This  nethod  in  conjunction  with  the  asymptotic  aethod 
could  be  used  to  provide  bounds  on  s*  and  Q*. 
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Figure  2,  Hybrid  coaputer  results  for  n(s,x)  when 
r  -  A  and  p  -  ,4. 
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Abstract -7h c  paper  applies  the  maximum  princi¬ 
ple  to  a  cod cl  of  the  Italian  economy  specified 
as  a  stock- flew  continuous  tine  maerodynamic 
node!  and  estimated  by  a  full  information  maxi¬ 
mum  likelihood  (FZKL)  estimator. 

X.  INTRODUCTION 

The  purpose  of  this  study  is  to  outline  at  an 
aggregate  level  some  optimizing  strategies  and 
policy  design  aimed  at  coping  with  the  sponta¬ 
neous  tendencies  in  Italy  of  income  distribu¬ 
tion  and  public  finance  seriously  troubled  by 
a  large  and  incessantly  growing  public  deficit. 
The  research  hinges  upon  a  model  principally 
devoted  to  make  evident  the  relations  conne¬ 
cting  the  public  and  private  sectors  of  the  I- 
talian  economy  Cl). 


II.  SPECIFICATION  OF  THE  OBJECTIVE 
FUNCTION 

Three  kinds  of  objective  function  (a,  b  and  c) 
will  be  employed: 

The  first  (a)  maximizes  the  level  of  private 
sector  value  added  and  of  gross  investment  in 
manufacturing  industry  and  services  and  mini¬ 
mizes  the  fluctuations  of  the  above  investments 
from  a  trend,  using  as  instruments  the  interest 
rate  and  currency.  In  symbols,  this  objective 
function  takes  the  following  form: 

«*  A  qy  [logP*ODsp(t)+loqOm;i  ♦qjloglj  (t) 

■qitii09ll(t)-l03il(t)^2',:a&<t>-°(t)]2 


PROD  indicates  the  private  sector  labour  pro- 
ctivity;  is  employment  in  industry  and  serv¬ 


ices;  1^  stays  for  gross  investment  in  manufac¬ 
turing  industry  and  services  in  real  terms; 
a  indicates  the  variation  rate  of  currency; 
i  is  domestic  nominal  interest  rate  for  long 
term  government  bonds.  The  dash  on  the  variable 
indicates  a  desired  (or  ideal)  value.  All  de¬ 
sired  values  are  represented  by  trends.  To  be 
precise: 

log*l(t)*logIl<t  )+0,01t;  a<t)=0.03*0.01t; 

1( t)=0. 0191-0. OOlt;  (in  quarterly  growth  rates) 
Ke  have  employed  the  following  weights: 


1.0:  V  0.5:  qi(t)=  0.5 


1The  computation  programs  and  the  continuous 
methodology  used  in  this  application  are  due 
to  Dr.  C.R.  Wyner. 


A  second  fora  of  objective  function  (b)  has 
been  obtained  by  dropping  in  the  objective  fun¬ 
ction  specified  above  the  term  which  minimizes 
the  fluctuations  of  gross  investment  about  its 
trend  a-~*  adding,  as  control  variable,  a  term 
which  minimizes  the  deviations  of  money  wage 
rate  in  the  private  sector  from  a  desired  path 
of  this  variable;  therefore,  this  new  objective 
function  also  includes  the  expression: 

-V  log*psp  1 1  rlog^Psp  ( t 


Where  logw  =logv 

psp(t)  ^  psp(t») 


+  0.03t 


Finally,  the  third  kind  of  objective  function 
(c)  adds  in  the  previous  one  a  new  target  which 
minimizes  the  deviations  of  the  public  debt 
from  a  desired  path  of  this  variable.  Besides, 
it  substitutes  the  wage  policy  by  two  new  con¬ 
trol  variables  which  minimize  the  deviation  of 


public  sector  expenditures  and  of  public  sec¬ 
tor  entrances  from  some  hypothetical  ratios 
between  these  variables  and  private  sector  nom¬ 
inal  value  added.  So,  the  new  objective  func¬ 
tion  replaces  the  wage  policy,  in  the  previous 
one,  by  the  following  expressions: 


-,:ullO9U7(t)-1O9I,R0Dsp(t^losOl(t)-lo9P(tT{,2 
where 6  =0.6  is  a  desired  ratio  (expressed  m 
logarithms)  between  public  expenditure  (U„) 
and  private  sector  nominal  value  added,  and 
PR0Ds  ,  O^,  P  are  the  variables  determining 
private  nominal  value  added. 


Where t  =0.75  is  a  desired  ratio  (expressed  in 
logarithms)  between  public  entrances  and  pri¬ 
vate  nominal  value  added,  and  E  indicates  the 
public  sector  revenue. 

Besides,  we  add  the  new  target: 

-qB(l°gB(t)-  l°95(t)>2 

Where:  logB^-logB^  j0.05t  and  q0=  0.5.  B  13 

the  total  amount  of  the  public  debt. 

We  refer  the  explorations  to  a  planning  hori¬ 
zon  going  from  1981  to  1986. 


III.  BRIEP  DISCUSSION  OF  SOHE  RESULTS. 

The  optinu  ation  results  ,  being  expressed  in 
logarithms,  allow  us  to  take  directly  the  vari¬ 
ation  rates  of  each  variable.  It  nay  be  useful 
to  expound  a  table  containing  the  average  ra¬ 
tes  of  growth  of  some  more  relevant  variables 
in  the  planning  horizon,  obtained  using  each 
one  of  the  three  kinds  of  objective  function 
specified  above. 
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of  the  objectives. 

Objective 

Object. 

Object. 

Object.  Force 

variables 

func. (a) 

func- (b) 

func. (c)value 

2.41 

2.71 

2.07  1.36 

P?0Dsp 

4.96 

4.57 

4.40  3.84 

rl 

B 

9.55 

8.30 

8.99  5.85 

8.60  15.55 

Table  1  shows  that  the  adoption  of  each  one  of 
the  objective  functions  leads  to  a  remarkable 
improvement  of  the  targets#  with  respect  to  the 
neutral  solution-  Khen  the  objective  function 
(b)  is  adopted,  employment  growth  appears  near¬ 
ly  doubled  in  confront  of  its  forecast  value 
probably  owing  to  the  use,  in  this  case,  of 
wage  rate  as  control  variable-  But  even  in  the 
other  cases  employment  growth  is  remarkable. By 
adding  the  growth  rates  of  employment  and  pro¬ 
ductivity,  we  find  that  the  yearly  average  rate 
of  growth  of  the  value  added  in  the  private  sec¬ 
tor  is  almost  2  percent  core  than  its  forecast 
value.  Also  the  dynamics  of  private  investment 
shows  a  marked  push.  Finally,  the  objective 
function  (c)  including  public  deficit  among  its 
targets,  shows:  a)a  considerable  reduction  in 
the  dynamics  of  this  variable  and,  b)  that  the 
use  of  public  entrance  and  expense  as  policies 
does  not  yield  a  great  benefit  to  employment 
and  productivity  growth.  It  nay  be  useful  to 
expuond  the  trajectories  of  some  variables. 

Employment  In  Industry  ond  services  (Oj) 


Cross  r«al  Invest. In  nanuf.end  services (i^) 


1*981  82  83  84  85  86 


Total  eaount  of  public  debt(B) 


Public  sector  expenditure  (U^) 


It. is  interesting  to  observe  the  trajectories 
of  public  debt  and  of  the  policy  instruments. 

We  can  see  that  the  fulfilment  of  an  incisive 
wage  policy,  as  the  objective  function  (b)  docs, 
deternines  a  sharp  reduction  of  public  debt 
growth.  Por  their  part,  public  finance  policies 
postulated  by  the  objective  function  (c)  cause 
a  drastic  fall  of  public  debt  at  the  beginning 
of  the  period,  which  »  followed  later  by  a  con¬ 
siderable  growth  of  th  3  variable,  the  trend 
of  which  remains  m.  b  lower  than  the  neutral 
solution.  This  resul^  is  due  to  a  strong  reduc¬ 
tion  of  public  expei  *  t-ce,  while  fiscal  policy 
shows  a  moderate  treno  Por  space  limitations, 
we  avoid  further  comae  &  on  the  above  graphs 
and  conclusions. 

REFERENCES 

(1)  Pusari  A.  (1990), "A  t»iorodynanic  model  cen¬ 
tered  on  the  interrelationships  between  the 
public  and  private  sector.*  of  the  Italian  econ¬ 
omy",  New  York  Economic  Review. 

(2)  Wymcr,C.R.  "Trans,  Rcsxuul,  Continent, opr e- 
dic  computer  programs  and  jelatiyc  manuals. 


1814 


A  STRATEGY  FOR  SOCIOECONOMIC  SIMULATION  OF  EUROPE  1992 


Jona  Walter 

California  State- University",  Doainguez  Bills 
Computer  Information  Systems,.  SBS  D-325 
Carson,  CA  90747  U.S.A. 


I 


1 


i 

f 


t 


Abstract  -  socio-economic  Simulation  (SESj  is 
a  multiple  decision-maker  process  allowing 
competitive  or  collaborative  behavior  among 
persons  sharing  a. common  simulated  economic 
environment.  Discussed  here~are  the  problems, 
of  handling  judgmental  information  in  the 
modeling  of  consumption  and  investment.  This 
is  implemented  with  the  application  of 
Computer  Assisted  Software  Engineering.  (CASE] 
diagrams  and  the  use  of  linked  spreadsheets. 


I  THE  PROBLEM  OF  JUDGMENTAL  INFORMATION 

A  problem  often  encountered  in  the  modeling 
of  economic  events  is  the  inability  to 
include  judgmental  information  together  with 
the  quantitative  data.  Human  reaction  to 
changes  in  the  simulated  environment  can  be 
of  greater  importance  than  the  numerical 
precision  of  the  estimates  afforded  by  the 
model’s  equations. 

One  way  to  sursounb  the  problem  of 
judgmental  information  is  to  actually  embed 
human  decision-makers  as  role  players  within 
a  simulation  exercise  or  "game."  Thus  a 
simulation  model  contains  not  only  data,  a 
set  of  equations  and  procedures,  but 
furthermore  a  network  of  participants,  in 
this  network,  participants  may' be  subject  to 
partial  ordering  to  reflect  hierarchies  of 
organizations.  Although  their  essential' role 
may  be  competitive,  the  decision-makers  are 
empowered  to  help  each  other  understand  and 
solve  common  problems  for  their  mutual 
benefit  through- their  negotiated  sharing  of 
information  normally  kept  confidential. 
Consequently,  the  simulation  is  defined  by 
who  the  participants  are,  in  addition  to  what 
they  do. 

Participants'  actions  are  of  special 
significance  with  this  approach.  Their 
behavioral  .patterns  are  recorded  by  the 
computer  system  and  become  an  integral  part 
of  the  modeling  process,  in  subsequent 
exercises  the  systea's  adaptation  to 
decision-makers'  behavioral  patterns  is 
incorporated  within  the  simulated  economic 
environment,  for  example,  by  using  the 
empirical  data  of  participants*  responses  to 
change  the^bias  of  one  or  more  random  number 
generators  in  the  model. 

The  result  is  a  truly  Interactive  process, 
where  the^model's  properties  influence 
participants'  behavior.  This  behavior,  in 
turn,  causes  changes  in  the  policies  and 
procedures  comprising  the  model.  Various 
examples  of  the  application  of  this  strategy 
toward  modeling  the  1992  economic  unification 
of  Europe  have  been  tested. 


II  MODELING.  CONSUMPTION  AND  INVESTMENT 

In  transnational  organizations,  decision 
makers  need  to  decide  not -only  whether  to 
build  or  buy  technology^' but  also  where  that 
is  to  be.  done.  At  the  national  level, 
consumption  and  investment  can  be  described: 

C  -  C(0)-fmpc*(Y-T(0)+t*YrTP(0)+p*Y),  and 

I  ■  lC0)-c*r,  with  the  variables: 

1(0)  Autonomous  investment 
c  Marginal  efficiency  of  capital 
T(0)  Autonomous  tax  revenues 
t  Marginal  tax  rate 

T?(0)  Autonomous  transfer  payments 
p  Relationship  between  transfer 

payments  and  income 
e  Exchange  rate 

C(0 )  Autonomous  consumption 
mpc  Marginal  propensity  to  consume. 

Y  Country  A's  national  income 

Y(B)  Country  B's  national  income 

The  constructs  presented  below  are  new 
approaches  to  business  simulation.  The 
simulation  serves  as  a  "Gedankenexperiment", 
to  detect  incipient  difficulties  before  any 
actual  decisions  are  taken. 

Ill  APPLICATION  OF  CASE  DIAGRAHS 

Pig.  1  below  is  a  diagram  of  flow  of  funds, 
patterned  after  a  Data  Plow  Diagram  ( DFD 1 . 
This  diagram  shows  part  of  the  relationship 
among  a  transnational  firm,  a  transnational 
bank  and  a  national  treasury. 


Pig.  1.  Plow  of  Funds 


I 
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Fig.  2  below  shows  a  decoaposition  diagram, 
protraying  the  hierarchical  relationship 
among  the  fira,  the  bank  and  the  treasury, 
below  the  director  of  the  simulation.' 


Director  of  1 
Simulation  | 


r — ■ — 

— 

i 

Transnational!  ,] 

National 

['  [Transnational! 

Bank  -  1  I 

Treasury 

!  [«»■  _ l 

Fig.  2.  Hierarchy -of  simulation  participants. 


IV  USE  Of  LINKED  SPREADSHEETS 

This  approach  uses  hidden  spreadsheets  linked 
with  a  caster  spreadsheet.  The  caster 
spreaesheet  is  accessible  to  all 
participants.  Its  primary  contents  are 
numbers  which  are  the- results  of- 
caiculations.  It  may  also  contain  formulas, 
but  only  those  formulas  intended- to  be  public 
knowledge  (e.g. ,  VAT  *  .15  indicating  a' 
value-added  tax  rate).  To  obtain  the 
results,  the  caster  spreadsheet  draws  upon 
hidden  spreadsheets  in  the  custody  of 
participating  decision-makers.  Formulas  in 
the  hidden  spreadsheets  are  privy  only  to  the 
simulation  director  and- the  specific 
decision-maker  to  whoa  that  spreadsheet  has 
been  allocated  by  the  director. 

Fig.  3  below  corresponds  to  the 
decoaposition  diagrae  of  Pig.  2. 
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Pig.  3.  Linking  of  3  Hidden  Spreadsheets 


1.  [Anonymous!,. Super calc5  User*s  Guide. 
.Computer  Associates,  San  Jose,  CA,  1988. 

2.  C.S.  Greehglat  and  R.D.  Duke,  Gaming  - 
Simulation:  Rationale.  Design  and 
Applications,  John  Wiley  and  Sons,  New 
York,  1975. 

3.  C.  Lopilato,  Europe  1992:  A  Computer 
Spreadsheet -Simulation  Game.  California 
State  University,  Dominguez  3ills, 
Carson,  CA,  1989. 

4.  C.  Lopilato  and.j.p.  Walter,  "An 
Artif  icialrlntelligence  Approach  to’  the'' 
Simulation  of  Europe  1992",  in  ai  and- 
Simulation;  Theory  and  Applications, 
edited  by  wade  Webster  and  Ran^eet 
Uttamsingh,  Society  for  Computer 
Simulation,  San  Diego,  CA,  1990. 

5.  -W.  'A.  McEachern , -  Economics:  A 
Contemporary  Introduction.  South- 
Western,  Cincinnati,  OH,  1988. 

6.  S.L.  Nelson,  "A  Spreadsheet  Business 
Planner",  Lotus,  April- 1990,  pp.  37-38. 

7.  F.  Rosenkranz,  (1979).  An  Introduction 
to  corporate  Modeling.  Duke  University 
Press,  Durham,  nc,  1979, 

8.  A.N.  Schreiber,  (1970).  corporate 
Simulation  Models.  University' of 
Washington,  Seattle,  1970. 

9.  J.P.  Walter,  PRECOS  -  Un  Nouveau 
Systeme^Ordlnateur  pour  la  Simulation 
Spcio-economigue.  Faculty  of  sciences. 
University  of  Paris,  1971. 

10.  D.  Harsh,  The  Idea  of  Economic 
Complexity.  Viking. Press.  New  York , 

198-J;  '  •  , 


1816 


DEALING  WITH  UNCERTAINTY' IN  MULTIPLE. OBJECTIVE  DECISION  SUPPORT 
CHenggeler  Antunes  <I*3)  and  jolo  N.  CUmaco 

(1)  Department  of  Electrical  Engineering  ;  (2)  Faculty  of  Economics  -  Umycraity  of.Coimbra  -  3000  Coimbra  -  Portugal 
(3)  INESC.  Rua  Antero  be  Qcental,  23 1  cave  -  3000,Coimbra  -  Portugal 


Abstract  -  This  paper  presents  an  interactive  approach  to 
deal  Villi 'the  inherent  uncer  tainty  and  imprecision 
a  firing  in  problems'  where  multiple  conflicting  objectives 
exist.  The  uncertainty  and  imprecision  are -associated  not 
just  vith  the  initial  data  and  the  imperfection  of  the  model 
as  a  representation  of  reality,  but  also  with  the  decision, 
maker's  preference  structure  which 'evolves  throughout 
the  interactive  decision  process  as  more  knowledge  about 
the  problem  is  gathered.  An  interactive  approach  aimed  at 
evaluating- the  stability -of  compromise  -solutions'  is 
proposed,  which  is  based  on  Ihe' representation  of  the 
solutions  in  the'- weight  space. 

I.  INTRODUCTION 

Most  complex  real-world. problems  are  characterized  by 
multiple,  conflicting  and  Incommensurate  objectives. 
Mathematical-  models  as  welf'as  the  perception  of- the 
problems  by  decision-  makers  become  more  realistic  if 
several  objectives  are  considered  explicitly.  In  a-  multiple 
objective  context,  the  conce'pl  of  optimal -solution  gives 
place  to  the  concept  of  nondominated  solution*  (feasible 
solution  for  "which  there  are  no  other  feasible  solution  that 
improves  in  objective  . 'unction  value  without  worsening  at 
least  other’ objective  fuaett  n  value).  These  decision 
problems  entail  tradeoffs  among  the  objectives,  in-order  to 
get  a  satisfactory  compromise  solution  from  *.  set  of 
nondominated  solutions. 

Different  methods  < for  tackling  multiple  objective 
problems  exist,  using  different  solution  techniques  and 
requiring  distinct  degrees  of  involvement  of  the  decision 
maker  (DM).  Tn' interactive  methods  phases  of  decision 
involving  the  DM  are  alternated  with  phases  of 
computation.  These  methods  reflect  a  diversity  of  strategies 
fcr;carrymg  cut  the  searen  for  nondominated  solutions 
and  differ  in  the  type  of  mformalioavrequired  from- and 
presented  to  the  DM.vThe  DM  intervenes  in  the  solution 
search  process  by  inputmg  information  into  the  procedure 
which  in  turn  is  used  to. guide  the  search  process  (thus 
minimizing  the  computational  effort)  in  order  to  compute  a 
new  solution  which  more  closely  correspond  to  his 
preferences.  For  a  review  sec  ML 

The,  interactive'  process  plays  a  significant  role  to  the 
enhancement  of'  the  knowledge  acquisition  process,  b  y 
improving  skills  to  gain  new  insights  into  the  problems. 
These  may  then  be  used  to  express  new  preferences  and 
progressively  narrowing  down  the  scope  of  the  search, 
thus  minimizing  the  computational  burden.  The  support  of 
a  learning  process  is  an  essential  step  to  construct  more 
realistic  representations  of  the  DM’s  preferences,  aimed  at 
capturing  the  lack- of  surety  and/or  preference  changes 
that  may  arise  as  the  interactive  decision  process  proceeds 
The  possibility  of  performing  some  kind  of  stability  or 
sensitivity  analysis  is  thus  a  very  important  component  of 
interactive  decision  aid  computational  tools  On  the  one 
hand,  it  enables  the  DM  to  consider  modifications  of  the 
original  data,  -concerning  uncertainty,  inaccuracy  and 
changes  associated  with  the  input  data,  as  well  as  the 
imprecisions  stemming  from  the  modelling  phase.  On  the 
other  hand,  it  enables  the  DM  to  exploit  changes  of  his 
preference  structure,  thus  accommodating  his  lack  of 
surety  in  specifying  new  search  directions  required  in  a 
next  computation  phase  of  the  interactive  process. 


The  approach  proposed  to  evaluate  these  effects  on 
hoedominated  solutions. is  integrated. in' the  operational 
framework  of  the  TRI MAP- package.  The  TRIMAP  method 
III  is  at  The  core  of  an  interactive  package  aimed  at 
supporting  the  DM-m  the  progressive- and  selective 
learning  of  the  set  of  nondominated  solutions  in  multiple 
objective  linear  .-programming  (MOLP)  problems.  The 
capabilities  of  the  computer  environment  (namely 
graphical  displays) 'are  exploited  in  order  to  provide  the 
DM  an  user-friendly  man- machine  interface. 

II.  AN  INTERACTIVE  DECISION  SUPPORTTOOL  FOR  MOLP 

Ar  MultiPl;  Objective  Linear  Program  mine 

Multiple  objective  Linear  programming  is  concerned  with 
problems  of  the  type: 

"Max"  f(i)-Cx  (1) 

s.  t.  x  €  F 

P-(I  CR^Ax-fe.  x*fl) 

where  C-  Igj  .£2  .  ~ .  CplT  (objective  function  matrix),  £k , 
L-:.2,...p.  are  n-dimensional- vectors,  A  (technological 
matrix)  is  a  mxn  matrix  and  fc,  (RHS)  is  a  m-dimensional 
vector.  “Max'  denotes  the  operation  of  finding  the  set  of 
nondominated  solutions  (in  a  maximizing  sense). 

The  set  of  efficient  (Pareto  optimal,  noninfcrior)  solutions 
is  defined  by  Fg-IxC  FrKx1)^  £(x)  ••  x^it'F ) 
where  £(x* )  at  £(x)  iff  £(x* )  a:  £(x)  and  £(x* )  *  Kx).  and 
1(X* )  *  Kx)  iff  fkfcl )  *  fk(x).  k-l,2,*,p. 

In  this  case  we  would  say  that  £  (x)  is  nondominated  when 
X  C  Fe-  In  general,  whereas  the  concept  of  nondominance 
refers  to  the  objective  function  space,  the  concept  of 
efficiency  refers  to  the  decision  variable  space 
A  process  of  computing- efficient  solutions  consists  in 
solving  scalar  problems  consisting  of  a  weighted' sum  of 
the* objective  functions  as  in' (2)  The  admissible  set  of 
weights  is  defined  by  A“(ik:A€RP,Z^k»l1Xk>0, 

k-1,2 . p  }.  By  fixing  a  set  of  weights  A  €  A  a  weighted 

linear  function  to  be  maximized  over  Y  is  obtained 

max  £  *k  4(X)  (2) 

k-1 

s.t.  X  €  F,  A  €  A 

It  has  been  proved  that  x  €  F  is  an  efficient  basic  solution 
to  (1)  if  and. only  if  it  is  an  optimal  solution  to  (2)  The 
graphical  display  of  the  set  j^-which  leads  to  each  efficient 
solution  can  be  achieved  by  means  of  the  decomposition  of 
the  weight  space  The  weight  space  may  be  perceived  as 
the  space  of  the  relative  importance  of  the  objective 
functions. 

From  the  simplex  tableau  corresponding  to  an  efficient 
basic  solution  to  (2)  the  corresponding  A  set  is  given  by 
tfyjzQ.  W-CgB  'A  t  is  the  reduced  cost  matrix  (one  line  for 
each  objective  function,  where  the  element  wk,  is  the 
marginal  rate  of  change  of  objective  function  fk(x)  caused 
by  the  introduction  of  one  unit  of  variable  xj  into  the 
basis),  B  is  the  basis  and  Cq  is  the  submatnx  cf  C 
corresponding  to  the  basic  variables. 
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The  region  comprising  the  set  of  weights  corresponding 
to  a  nondommated  extreme  solution  (region  where  QjWiO, 
^  €  A)  is  consistent)  is  called  indifference  region  I3J.  The 
DM  can  thus  be  indifferent  to  all  the  combinations  of 
veighis  within  it,  because  they  lead  to  the  same 
nondominaled  solution.  The  analysis  of  the  indifference 
regions  in  the  weight  space  is  a  valuable  tool  in  "learning 
the  shape"  of  the  nondominaled  solution  space.  The 
boundaries  between  contiguous  indifference  regions 
represent  the  -nonbasic  efficient  variables  (those  which- 
when  introduced  into  the  basis  lead  to  an  adjacent  efficient 
extreme  point  through 'an- efficient  edge).  A  common 
boundary  between'two  indifference  regions  means  that  the 
corresponding  efficient  solutions*  are  adjacent  and 
connected  by  an  efficient  edge;  If  a  point  A  €  A  belongs  to 
several  indifference  regions  this  means  that-  they 
correspond  to  efficient  solutions  lying  on  the  same  face. 

TRIMAP  (ll  is  an  interactive  MOLP  package  which  is  aimed 
at  aiding  the  DM  in. the  progressive  and  selective  learning 
of  the  set  of  nondominaled.  solutions.  TRIMAP  combines 
three  main  basic  procedures-  weight  space  decomposition, 
introduction  of  constraints  on  the  objective  function  space, 
and  introduction  of  constraints  on  the  weight  space. 
Moreover.  TRIMAP  enables  the  introduction  of  additional  - 
limitations  on  the  objective  function  values  to  be  translated 
into  the  weight  space.  The  dialogue  with Jhe  DM  is -made 
mainly  in  terms  of  the  objective  function  values  (which  is 
the  type  of  questions  that  do  not  require  an  excessive  effort 
from  the  DM.  unlike  assessing  marginal  rates  of 
substitution  of  other  forms  of  preferem^ elicitation).  Jn 
general,  the  weight  space  is  used  in  TRIMAP  as  a  valuable 
means  for  collecting  and  presenting  the  information. 

The  main  purpose  of  TRIMAP  is'  to  provide  a  progressive 
and  selective  filling  of  the  weight  space.  In  each  dialogue 
phase  the  DM  must. indicate  whether- the  study  of  solutions 
corresponding  to  not  yet  searched 'regions  is  of- interest: 
This  enables  the  DM  to  perform  a  "strategic  search"  of  the 
feasible  polyhedron  and  prevents  the  exhaustive  search  in 
regions  with  close  objective  function  values  (which,  often 
arise  in  real-world  problems)., The  interactive  process  only 
ends  when  the  DM  considers  to  have  gathered  "sufficient 
knowledge"  about  the  set  of  nondommated  solutions  which 
enables  him  to  make  a  decision.  The  stopping  rule  is  thus 
the  "satisfaction"  of  the  DM  rather  than  the  checkingfor 
convergence  of.  any  utility  function. 

TRIMAP  is  dedicated  to  problems  with  three  objective 
functions  (p-3)  Although  this  is  a  limitation,  it  allows  for 
the  use  of  graphical  means  which  are  particularly,  suited 
for  the  "dialogue"  with  the  DM.  In  order  to  enhance  his 
capabilities  in  processing  the  information,  the  DM  is 
offered  a  flexible  and  user-friendly  environment,  always 
keeping  the  control  over  the  solution  search  process.  The 
computer  interface  is  mainly  based  on:  a  menu  bar  at  the 
top  of  the  screen  lists  the  titles  of  the  available  pulldown 
menus,  grouping'  the  actions  available  to  the  user,  thus  hot 
occupying  screen  space  and  not  requiring  command' 
memorization:  overlapping  windows  arc  used  by  the 
program-  for  displaying  tabular  and  graphical  information 
to  the  user,  pictorial  controls  provide  the  user  an  intuitive 
way  for  specifying  his  preferences:  dialogue  boxes  are  used 
whenever  some  further  Information  is  needed  before  a 
given  command  can  be  carried  out.  being  also  useful  for 
conveying  status  information  to  the  user.  The  introduction 
or  constraints  on  the  objective  values  and  their  translation 
into  the  weight  space  enables  the  dialogue  with  the  DM  to 
be  done  in  terms  of  the  objective  values  accumulating  the 
resulting  information  in  the  weight  space. 


In  each  interaction  of  TRIMAP  two  graphs  are  presented  to 
the  DM  The  first  brie  is  the  weight  space  displaying  the 
regions  corresponding  to  each  nondominaled  extreme 
-solution  .already,  known.  Eventual  constraints  on  the- 
variatiqn  of  the  weights  are  also  presented,  whether  they 
are. directly  introduced  into  the  weight 'space  or  result  from 
additional  constraints  imposed  on  the  objective  function 
vaiues  The  •  second  graph  displays  the  nondOMioated 
solutions  already,  computed,  projected  on  a  plane  of  the 
objective  function  space.  Further  details  about  the  working 
of  the,TRIMAP  method  as  well  as  the  main  features  of  its 
computer  implementation. can  be  found  in  III.' 

III.  EVALUATING  THE  IMPACT  OF  CHANGES 
IN  DATA  AND  DM5  PREFERENCES 

In  single  objective  linear  programming,  sensitivity 
(stability,  post-optimal)  analysis  - deals  .with  computing 
ranges. on  the  variation  of. some  initial'data  such  that" the 
optimal  basis  remains  optima]  for  the. perturbed  problem. 
The.  concept  of  optimal  solution  (in  general  unique)  gives 
place  m  multiple  objective  programming  to  the  concept  of 
efficient  solution  (m  general  many,  even  if  only  extreme 
points  are  .  considered).  Moreover,  changes  in  the 
underlying  DM's  preference  structure  as  a  result  of  the 
information  gathered -throughout  the  interactive  process 
must  be  taken  into  account.  'The  definition  stability 
analysis  in  a  multiple  objective  context  is  not- uniformly 
adressed  in  the  literature  (see  (2)). 

In -this  , paper  linear  parametrizations  of  the  objective 
function  matrix  and  the  resource  availability  (right-hand 
side  of  the  constraints)  will  , be  considered.  An  analysis 
based  on  the  weight  space  is  proposed  in  the  framework  of 
the  TRIMAP  package,  which  enables. to  present  graphical 
information  to  the  DM  in  a  way  which  promotes  rapid 
comprehension. 

A.  Changes  in  the  objective  function  malm 

A  perturbation  of  the  objective  function  matrix  depending 
on  a  scalar  parameter  is  considered.  C(y)  -  C  *  y  D  ,  where  D- 
IfiU«Sl2*d3)T  15  a  constant  matrix  and  y  is  a  scalar  parameter 
(y  €  R). 

Let  us  use  the  partitions  A-lB.Nl.  C-ICb.ChI.  D-lDB.DNli 
X-IXBJtNl.  "where  B.  Cb  and  Db  correspond  to  the  basic 
variables  x$,  and  N.  Ctf  and  Dn  correspond  to  the  nonbasic 
variables  is  Let  be  the  set  of  nonbasic  variables. 

for  a  given  efficient  solution,  the  parameter  y  does  not 
have  influence  on  the  primal  feasibility  (B"*  For  the 

solution  to  remain  efficient  the  dual  feasibility  must  be 
satisfied  (with  A  €  A): 

We  then  have 

1T(CB  B-‘  N,  -  CN)> .  y  B-»  Nr  UN|)  *  a  ,  |  €  Jn  , 
and  l  c  A 

(where  N^ ,  %  j  and  Djj  j  denote  the  jth  column  of  matrix  N, 
Cfl  and  Djj,  respectively). 

Let  us  consider  flj-  Cb  B'1  N^  -  Cfjj  and  fij-  Djj  B‘*  N.j  -  Djj  j 
and  write  the  dual  feasibility  condition  as  a,  +  y  £j  *  & 
j  c  & ,  and  A  €  A. 

The  basis  B  will  correspond  to  an  efficient  extreme  point  if 
and  only  if  the  following  system  is  consistent. 

3 

I  >k  (Okj  +  y  Pkj)  i  0  I  €  jn 
k“l 

^  £  A 
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li 

}  }  Tbe  problem  ofV stability  .analysis  of-.a  given  efficient 

«  {  solution  >„will  consist  in  determining  the  range  on. 

|  parameter  y  for  which  the  solution  is  “still  preferred" 

.  ‘  according  to  a  “pattern  of  preferences'  represented  by  the 

indifference  region  of  this. solution  for  the  unperturbed 
'  (y-0),  problem.  The  problem  ‘insists  then  in  finding  the 

minimum  (ymin)  smd  maximum  (ymax)  values  of  the  scalar 
'  parameter  y,  for  each  efficient  solution  selected  by  the  DM. 

«  so’ that  the  intersection  between  .the  unperturbed  and  the 

£  perturbed  indifference  region  is  nonempty.  This  means 

finding  ymin  and  Ymax  subject  tq> 

S  £  «kj*k*°  .  i  £Jn  0») 

5  krl 

?  3  3 

I  <*kj*k  +  y  1  flkj*k*0,  \  €Jh  (3b) 

krl  '  •  krl 

\  For  three-objective  problems,  it  is  possible  to  use  the 

graphical  displays  of  the  indifference  regions  an  an  aid  , to 
the  DM  for  evaluating  the  stability  of  selected  solutions  to 
the  uncertainty,  imprecision  and  changes  associated  with 

the  data  and  his  preference  structt*  t.  Let  •  {j  €  Jn  ’ 

3  3 

£  Pk,  XpO,  in  (3a))  and  S'c  -  (j  £  Jn  :  £  Pkj  Xk<  0,  in  (3a)) . 

,  \  krl  krl 

f  3.3  * 

From  (3b)  it  follows; Ak/  £  Pkj  Ak,  for  j  CSC, 

i  krl  -k-1 

tndys-l  ayXk/  £  PkJ  >k .  for  j£  S'g 
krl  krl 

So,  considering  each  j  €  i*  separately  the  problem  would 
reduce  to  a  linear' fractional  programming  problem  to  be 
optimized  subject  to  (3a).  As  it  is  well  known., this  problem 
has  its  optimum  at  an  extreme  point  of  the  feasible  region. 
The  simultaneous  consideration  of  constraints  (3b)  (which 
are  nonlinear  in  y  and  Ak)  makes  the  problem  more 
complex. 

The  extreme  points  of  each  (unperturbed)  indifference 
region  are  known  explicitly  (given  by  (3a).  as  a  by-product 
of  the  decomposition  of  the  weight  space  in  TRIMAP),  Let  Ej 
»  be  the  set  of  extreme  points  of  the  indifference  region  (a 

>  convex  polygon)  corresponding  to  the  (basic)  efficient 

•  solution  s:  E.-KX^.X'j.X'jXr-I . R,.  |  Xj*l .  XfctO  ,  k-1, 2.3} 

The  dual  feasibility  condition  (3b)  can  then  be  tested 
exhaustively  at  A  €  Es  for  computing  the  range  of  variation 

•  ,or  the  parameter  y.  The  range  bn  the  parameter  y  which 

>  satisfies  (3)  for  a  given  nondommated  solution  s  can  then 
i  be  computed  in  the. following  manner 

j  3  3 

(lei  yj-X  Pki >'t Y- 

I  k-1  K"1 

i  .33 

!  ymta-min(»:j£Sc,  £  akj-Xk+yj  £  Pk|'*k*0, 

}  k-1  k-1 

|  Vj'£JM,j>j,VX£E.) 

j  ymin  "  ■  ®  IT  Sq  •  0 

i  3  3 

;  ym«"tMut(y):jeSc,  £  ay-Xt  +  yj  ,£  Pki’Xk^O, 

t  krl  k-1 

j  vreJN.mv^eE,} 

I  ymax  "  +  ■»  if  S'c  “  0 

,  The  computations  involved  arc  very  simple  and  do  not 

S  require  solving  any  auxiliary  programming  problem. 


The  indifference  regions  of  the  perturbed  problem  change 
continuously  (in  size,  and  form) /with  changes  of  the 
parameter  y.Note  that  due  to  the  nonlinearity  in  y  and  Ak, 
ymin  and  ymax  which'  satisfy  (3)  may  not  correspond  io  an 
extreme  point  of  the  unperturbed'  indifference  region. 
These  cases  are  easily  detected  by  the  algorithm  to  construct 
the  indifference  regions  from  the  system  of  inequalities 
resulting  from 'duaL  feasibility.  In  .these  .circumstances  a 
bissection.  technique  is  used  to  compute  the  actual  values 
Ymin  or’yaax  (or- both)  for  which  the  intersection  reduces 
to  one  point.  This. implies  reconstructing,  the  .perturbed' 
indifference  reglon  for  each  value  of  y.  generated,  by  the 
bissection.  technique  .and  calculating  the  region  where,  it 
intersects  the  -unperturbed >  indifference  region 
(intersection  of  two  convei  polygons^  From  a  computational 
-.point- of -view,  these  are  easy  tasks  .in  the  framework- of 
TRIMAP  (for  instance,  of  a 'much  lower  level  of  complexity 
than  solving  linear,  programming  problems  with 
parameters  in  the  technological  matrix), 

The  area  occupied  by  each  indifference -region  in  the 
weight:  space  may  be  understood  as -a  measure  of  the 
robustness  .of  the  corresponding  nondorainated , solution  to 
the  variation  of  the  weights/ 

The  DM  can  also  visualize  the  changing  of  the  indifference 
regions  in  an  ihtera'ctive  manner  by  varying  y 
dynamically.  This  type  of  visual  interactive  stability 
analysis  can  similarly  be  performed  for  a  group  of 
nondominated  solutions,  namely  those  defining 
nondommated  edges  or.  faces  (which  are  easily  identified  in 
the  weight  space),  by  previously  calculating  their  ranges 
on  the  parameter  y  and  considering  their  intersection.  The 
possibility  of  changing  dynamically  the  value  of  y  also 
enables  to  have  a  visual  interactive  information  about 
regions  of  the  weight  space  wheie  'holes’  appear,  meaning 
that  new  efficient  extreme,  solutions  to  the  perturbed 
problem  can  be  searched  in  those  regions. 

B..Chin8«.in.tbe.feaQui:cc.ayailthiliiy 

Let  us  consider  a  perturbation  of  the  resource  availability 
depending  on  a  scalar  parameter; 

t(t)-  fe*t  h ,  where  h  is  a  constant  m-dimensional 
vector  and  t  is  a  scalar  parameter  (t  €  R). 

For  a  given  efficient  basis  (corresponding  to  an  efficient 
extreme  point),  the  parameter  t  does  not  have  influence  on 
the  dual  feasibility.  For  the  solution  to  remain  efficient  the 
primal  feasibility  must  be  satisfied. 

B’> 

-  B’lh’lB’lfa  %Q. 

For  variations  of  .the  RHS  the  feasible  polyhedron  changes 
itself.  So  the  indifference  regions  do  not  change  in  a 
“smooth’  manner  (as  in  the  case  of  objective  function 
matrix  perturbation)  with  the  variation  of  the  parameter  t, 
but  they  change  ’suddenly’  as  efficient  extreme  points 
appear  or  disappear  because  of  the  changes  m  the  RHS 

Let  S^-(i :  Bj 1  n  >  0)  and  S^-tt ;  B]  *  fc  <  0) 

and  t,--B|*  b.  /  Bj*  h .  (Bj*  denotes  the  ilh  row  of  B'1), 
tmax  and  tmin  can  be  determined  in  the  following  way; 

tffiio  -  mu  ( ti .  l  €  sj>) ,  Into  -  j  St>-  0 

tmii  -  min  ( t, .  i  €  S^l .  tou  -  •“>  if  Sb-  0 

For  t  in  the  range  defined  in  this  way,  the  perturbed  and 
unperturbed  indifference  region,  are  the  same. 
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iv.  An  illustrative  example 

the  Hluitr alive  MOLP,  problem: 

-In  =  r’2  I ‘4  3*i  r60-j 

,1-12  ' O-i  '  ' 

x  € R4.  Let  D-  4'-I  0  -1'  be  ^'"constant  matrix.' 

I  0  1  3  2  j 

TRIMAP  first  com  pules  the  solutions  thal  optimise  each 
objective -functions solutions  1  and  3 'optimise  fj  and  t$, 
respectively:  solutions 2  and  A  are  alternative  optfmawith 
respect  to  f2:  In  fig.  Ib.  the  blaclc  -square' represents  the 
ideal  solution  (the  one  tbatvould  optimise  all  the  objective 
functions  simultaneously),  and'  thev number- after  the 
Identification  of- the  solution  is  the-value  of  (3,  Suppose  that 
the -DM'tben  decided  to  compute  some  more  solutions  in 
order  to  gather  some  knowledge  about  the  set  of 
nondominated  solutions.  The  situation  where  all 
nondominaled  extreme  solutions  are  known  is  displayed-in 
fig.  1  (in  general,  this  is  not  the  aim  of  TRIMAP).  The 
analysis  of  the'two  graphs  shows  that  3  nondominated  faces 
exist,  defined  by  solutions  (1.7,8),  (2.4.6)  and  (2, 6.5.7, 8).  The 
points  lying  on  the  facevdefined  by  solutions  (3.516)  are 
dominated  by  the  points  on  the  edges  (3.5)  and  (5.6) 


Let  us  consider 

r  3  1  2 
“Max"  I  I  -1  2 
-  1-151 


and  the  face  (3,5.6)  also  becomes  nondominated.  In  fig.  2b, 
solutions  2.  3  and  A  are  no  longer  nondominated,  and  only 
'the  face  (1,7,8)  and  the  edge  (5,7)  are  still  nondominated^ 


Fig.  2a  -  The  weight  space  corresponding  to  ymfn 


Fig.  2b  -  The  weight  space  corresponding  to  ym*x 


Fig.  la  -  The  weight  space  graph 


V.  CONCLUSIONS- 


Fig.  lb  -  The  projection  of  the  objective  space  on  f  jf2 


Let  us  suppose  that  the  DM  vants  to  investigate  the  range 
of  variation  of  the  parameter  y  with  respect  to  solution  8. 
The  weight  spaces  corresponding  to  ymin-  0.393  and 
ymax-0  750.  for  which  the  intersection  between  the 
unperturbed  (sec  fig.  1)  and  the  perturbed  indifference 
regions  13  nonempty  (reducing  to  one  point)  are  displayed 
in  ftg<  2.  In  fig.  2a  all  the  solutions  are  still  nondominated, 


An  interactive  approach  to  deal  with  the  uncertainty’  and 
imprecision  arising  in  MOLP  problems  has  been  presented. 
The  uncertainty  and  imprecision >arc  associated  both. with 
the  initial  data  and  the  DM’s  preference  structure  which 
evolves  throughout  the  Interactive  decision,  process, 

The  weight  space  is  used  as  a  graphical  means  for 
collecting  and  presenting  the  information  to  the  DM,  The 
stability  of  nondominated  solutions  to  changes  in  the 
objective  function  matrix  and  in  the  resource  availability 
have  been  analyzed  in  terms  of  the  intersection  of  the 
unperturbed  and  perturbed  indifference  regions 
(perceived  as  a  "pattern  of  preferences*  of  the  DM). 
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Abstract-  The  new  approach  to  investigation  of 
Petri  nets  based  models  of  FMS  is  presented 
in  this* paper.  Its  main  idea  is  to  describe  a 
given  PN  as’’ a  discrete  dynamic  system  and  to 
evaluate  properties  of  this  system.  This  leads 
to  determination  of  a  new  relations  between 
structural  properties  of  PN  and  ability  to 
solve  effectively  some  sheduling  problems  for 
FMS.  It  is  also  showed  that  incidence  matrix 
of  PN  gives  important  information  about 
behaviour  of  this  net  and  hence  about 
properties  of  system  being  investigated. 


I. INTRODUCTION 

We  shall  use  such  denotations:  PN  is  a 
quadruple 

N=(m,n;A*,A“) 

where  m  is  a  number  of  places  enumerated  from 
1  to  m  and.  designated  by  pJt  . ...  ,pm,  n  is  a 
number  of  transitions  enumerated  from  1  to  n 
and  designated  by  ....  t0,  matrices  A*  and 

A~  (both  have  size  mxn  and  tneir  elements  are 
nonnegative  integers)  describe  relations 
between  places  and  transitions: 
a  //  (*"/;> 

is  a  weight  of  arc  connecting  tj  and  pf  (p}  and 
tj,  respectively).  Initial  marking  is  denoted 
by  process  of  PN  firing  is  described  by 
the  chain 

m0  *1(0  '•>  m(I)  ->  *1(2)  ->  m(2)  ■ 

Succession  of  firing  transitions 
’id)  *1(2)" 

gives  a  word  s  of  free  prefix  language  Pf( m 
of  PN  N  and  Parikh  images  t=PI(s)  of 
all  elements  of  form  a  set  of  reachable 

states  T((t>q).  It  should  be  pointed  out  that 
every  reachable  marking  m  corresponds  to  s  as 

m=(A*-A-)PI(s)+m0 , 

where  m0  ->  s  ->  m.  Let  t°,  fU/  Do  a  set 
of  integer  nonnegative  n-vectors  connected  by 
the  recurrent  system  of  unequalities 

/  >'  »  s-’ 

l A*/''  (,! 

where  1=1, ..„k  and  t°=0n, 

2. RELATIONS  BETWEEN  WORDS  OF  PETRI  NET 
AND  SOLUTIONS  OF  SYSTEM  ( 1 ) 

JL&Eca.  1.  For  every  solution  of  system  (1) 
there  exists  a  word  s=sfs^,.SL.  of  Pr(mfi J  such 
that  for  every  f=1,...k 


t/zPKSjSf..^).  (2) 


2.  For  every  finite  $  belonging  to  P*(mrd  there 
exists  a  number  k  and  a  set 

(t°,  tki  - 

solution  of  system  (1)  such  that  for  every 
s  can  be  broken  into  such  subwords  s1t 
$2-',  Sj  that  (2)  holds. 

3.  For  every  positive  integer  k' there  exists 
an  isomorphism  between  subset 

F^(mp)  including  all  words  of  length  k  and  set 
of  those  solutions  of  system  (l)  which  are 
related  by  the  equation 

for  every  (=1,...,k.  D 

This  lemma  gives  ground  for  investigation  of 
properties  of  PN  N  by  means  of  analysis  of 
system  >(1)  which  can  be  treated  as  a. 
simulation  tool  for  real  system. 

Let  us  discuss  two  combinatorial  problems 
concerning  Petri  net  theory.  The  first  one  is 
formulated  as  follows. 

Suppose  that  PN  //describes  some  FMS  and  such 
a  task  is  formulated:  find 

max  (c,tk)  if  (tP,  t1,...,  tk) 

belongs  to  a  set  of  solutions  (S' 

of  system  ( 1 ) . 


Solution  of  this  task  will  give  optimal  in 
definite  sense  shedules  of  functioning  of  FMS 
given. 

Theorem  KHI  ).  1.  Task  (1),  (3)  is  strongly 
NP-complete. 

2.  If  PN  N  is  a  finite  automaton  or  a  marked 
graph  then  task  (l),(3)  can  be  solved  by 
time  which  is  evaluated  as  a  polynomial 
function  of  m,  n  and  k.  O 


Proof  of  part  t  of  this  theorem  is  based  on 
the  reduction  of  a  task  of  finding  a 
hamtltoman  path  in  arbitrary  oriented  graph 
to  task  ( 1 ) , (3)  having  special  structure. 
Proof  of  part  2  includes  nontrivial 
transformations  of  system  (t)  to  show  that 
matrix  of  limitations  of  this  system  is 
totally  ummodular. 

This  theorem  implies  that  for  sheduling  FMS 
we  have  to  use  heuristics  in  the  case  of 
general  PN  and  strict  algorithms  developed 
in  theory  of  combinatorial  optimisation  if  PN 
is  an  ordinary  one. 

The  second  problem  being  investigated 
concerns  relations  between  structural 
properties  of  its  incidence  matrix  A~A*mA  . 

It  should  be  pointed  out  that  there  exists 
theoretically  and  practically  effective 
algorithm  for  search  of  PN  invariants  (f2j): 
this  means  that  such  a  problem  is  of  real 
interest  when  investigating  and  designing  FMS. 

We  shall  analyse  system  Ax^  where  x  is 
nonnegative  real  n-vector.  We  shall  say  that 
condition  (4)  holds  if 
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set  of  nonnegative  solutions 
.  of-  system  of  unequations  Ax)Om 
coincides  with  set  of  nonnegative 
solutions  of  system'of  equations  (4) 
A’xxOm  where  A’  is  received  from 
A  by  deleting  those  rows  of  A 
which  correspond  to  excessive 
limitations  of  system  Ax#)m  where 
all  x  are  nonnegative 

Theorem  21  C3l ) .  If  condition  (4)  holds  then 
set  of  reachable  states  King)  of  .PN  is 
semilinear  and  free  prefix  language  P  (mg)  of 
this  net  is  regular  for  every  initial  marking 

Corollary.  Fair  (in..the  sence  of  definition  of 
paper  C*3  )  PNs'  have  semilinear  sets  of 
reachable  states  T(mg)  for  every  mg  d 
Note'  that  the  validity  of  condition  .(4)  for 
orbitrary  PN  , 

can  be  recognized  with  the  help  of  algorithm 
proposed  in  (F2 J).  .  - 

In  conclusion  it  should  be  pointed  out  that 
search  of  PN  invariants  is  realised  as  a 
program  for  IBM/PC  which  showed  good  results 
for  PNs  with  50  or  lessrivt^ansitions  (if 
T-invariants  were  being  sought).  Now  it  is 
being  modified  to  fit  tasks  with  1000 
transitions. 
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FINITE  ELEMENT  SOLUTION  OF  THE  REVERSE  SPAGHETTI  PROBLEM 


MadanaM.  G.  GopaL 
EASI 
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Abstract:  In  this  study,  a  mathematical  model  of  a  flexible  paper 
strip  moving  axially  in  a  fluid  medium  under  the  inertial  ana 


large  displacements.  The  model  equations  are  <3 
basis  of  Newton’s  laws  of  motion  and  moment  equilibrium. 

These  equations  are  solved  using  a  semi-discrete  finite  dement 
analysis  to  produce' the  strip’s  trajectory  and  configuration  in 
spatio-temporal  domain.  In  the  absence  of  the  aerodynamic 
forces  these  equations  degenerate  to  the  case  of  the  so-called 
"reverse  spaghetti"problem  of  an  elastica  emerging  from  a 
horizontal  guide.'  Further,  the  motion  of  the  strip  is  experimen¬ 
tally  investigated,  and  its  profile  and  the  leading  edge  trajectory, 
are  compared/  The  results  are  in  good  agreement  The  method 
of  analysis  and  the  results  are  applicable  to  the  study  of  motion 
of  thin  paper  strips  such  as  documents  in  a  document-processing 
machine. 

I.  INTRODUCTION 

Industrial  applications  of  axially  moving  materials  such  as 
traveling  strings,  band  saws  and  magnetic  tapes  are  numerous 
(1).  In  most  cases,  these  materials  are  continuous  and  move  on 
multiple  supports.  However,  there  are  situations  such  as  paper 
transport  in  copying  and  check  processing  machines  in  which  the 
material  has  finite  length  and  is  not  supported  on  both  its  leading 
and  trailing  edges.  Most  often  it  is  supported  on  one  edge  and 
the  leading  edge  is  free. 

Carrier  [2]  determined  the  motion  of  a  dangling,  inexten* 
sible  string  that  was  drawn  upward  through  a  hole  in  a  rigid  wall. 
He  called  this  type  of  problem  “the  spaghetti  problem,'  The 
'reverse  spaghetti  problem"  which  deals  with  the  drooping 
motion  of  an  axially-moving  elastica  issuing  from  a  horizontal 
guide  was  solved  by  Mansfield  and  Simmonds  (3),  where  only  the 
inertial  effect  on  the  motion  of  the  material  was  considered. 

This  analysis  is  applicable  only  when  the  elastica  has  a  high 
weight-to-stiffness  ratio  and  low  velocities. 

The  present  study  differs  from  the  above  problems  in  that 
the  moving  material  has  very  low  bending  rigidity  and  is  pro¬ 
pelled  at  high  velocities,  as  in  the  case  of  a  high  trajectory  of  the 
leading  edge,  which  is  one  of  the  main  concerns  in  a  high-speed 
environment.  Thepresent  effort  addresses  this  issue  with  a  view 
of  assisting  the  designers  of  high-speed  equipment  that  handles 
axially  moving  materials. 


•Xavier  J.  R.  Avula 
University” of  Missouri-Kolla 
Rolla,  Missouri  65401  U.S.A. 

where  6  (•  )  is  a  dirac-delta  function. 

The  equation  of  motion  in  the  y-direction  is 

♦  *{X,T)  ♦  f.u<  (X-L)  *il*  " 


.ill 

’  IT* 


(2) 


where  fd(X,T),  £u,  and  fjj^  are  the  drag,  lift,  leading 
edge  suction,  and  initial  impulse,  respectively. 

Using  the  notation 


x  x 

d<.- 

x  i 


•(■>-<•>«  Btc. 


t{ C,M  dr  ♦  =W«,t).i.*<L,T>d{ 


♦  fIM  »ll*(t,T)dC  ♦  Bb  I  ViT.  «**><*{ 


(3) 


where 


X({iT)dT  -  -mt b 


fX 

yTT«r*>dc  ♦  *v 

it 


X 

►  V*  y(((f,T)d< 


capable  of  undergoing  large  deformations  are  derived  using 
Newton’s  laws  of  motion  and  moment  equilibrium.  The  effect  of 
aerodynamic  fortes  is  brought  in  through  an  'added  mass a 
concept  developed  from  the  theory  of  hydrodynamics.  Applica¬ 
tion  of  these  equations  leads  to  a  system  of  coupled  nonlinear 
partial  differential  equations.  A  semi-discrete  finite  element 
approximation  is  used  to  solve  these  equations  subjected  to 
appropriate  initial  and  boundary  conditions. 

II.  MATHEMATICAL  MODF.L 

The  geometrical  representation  of  an  emerging  elastica  is 
shown  in  Figure  1.  An  element  of  a  thin  flexible  strip  with 
various  forces  acting  on  it  are  shown  in  Figure  2. 

Equation  of  motion  in  the  x*direction  is 


-  -B^btOxT  ♦  ZVCQxxtX.T)  -  Oxrt^*)) 

♦  V»(»IM(X,T)  -  •irv»(L,T>n 

fX  Bb It 

]  fsu  <<e,X,)»iM<L,T)d{  -  (QJt(X'.T) 

♦  2VQxt(L.T>  ♦  V*  »inV(L,T)  )»in#(L,T) 

j  f1H  «litf({,T)d<  -  J  ,laJ((,T)d{ 

-  »1VV*>C9T-L'2(QX(X,T)  -  Qx(I«,T)l 
ftb  I  yrr«.T>4t  -  *toOrr<x,T) 


(4) 


<5) 


(6) 


(7) 


Introducing  the  above  integrals,  the  equations  of  motion 
become 

H(X,T)  •  ab  |dx  -  |  (Oxr<I..T)  ♦  iVOx^L.T)  »iltf  <L,T) 

♦  VUinV<L,T)l  cos£ (L. T)  ♦  V»T-1/2{Px(X,T) 

-  ♦  r*ij  (8) 

where 


~  -  fd(X,T>  ♦  trjt  (X-L)  co*i  (L,T)  -  fIK  cotfi  -  (1) 
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-■i; 


<*«  ♦ 


(9) 


V(X,T)  -  *b|g(X-L)  +  (1  +  fOORt*  2*v  «bcT<x»T>  "  Oxt^’V) 

♦  lV*(tll*(X,T)  -  slx*(L,T)>  r  -  (Qjxft.T) 

♦  ZVQxxdvT)  sln/}(L<T)  ♦  v*  «lnV(L,T)>  *iitf(X.,T) 

♦  X*  pV^V^QjcCX,*)  -  Qx<L,T))|  (10) 


y< 


MSC€TMCOU©e.O<X<  C-VT  0UTS®€ THE OUOE .  t-VT  <  X  <  t 
x(X,T)  ■  X*  VT  X(X,T)  -  t*X,(X,T) 

V(X.T)  -0  y(x.T)  *  v,<x.T> 

Figure  1.  Geometrical  representation 
of  an  emerging  elastica. 
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and 


(15) 


qh(o).- 


N 

qj(f)*3(j(0) 


(16) 


where  ;  are  piecewise  linear  or  quadratic  polynomials  which 
satisfy  the  essential  boundary  conditions. 


r*2,i<0);-  +3,i<«>  “  0  (17) 

Using  backward  difference  for  time,  the  weak  formulation  of  the 
problem  is  performed.  ■  Omitting  the  details  due  to  length  limi¬ 
tation,  the  stiffness  matrix  expressed  in  local  coordinates  after 
assembly  leads  to  a  banded,  unsymmetric  global  stiffness  matrix. 
For  a  linear  element,  the  element  stiffness  matrix  is  6x6  and  for 
quadratic  element  it  is  9x9. -The  method  of  solution  discussed 
herejpertains  to  modified  Newton’s  method/  For  convenience, 
the  time  domain  is  approximated  by  backward  difference  and  it 
is  conditionally  stable.  <Thus  for  accuracy,  the  time  step  hr 
should  be  smaller  than  "h,"  the  grid  spacing  used  in  finite  ele¬ 
ment  discretization.  Choosing  a  smaller  time  step  also  ensures 
that  no  oscillations  in  the  solution  are  introduced.  In  most  of  the 
cases  considered,'  i.e.  for  problems  without  aerodynamic  forces, 
the  time  step  nr  *  0.02  or  nr  »  0.01  was  found  sufficient  to 
produce  good  results. 


Figure  2.  Internal  and  external  forces 
and  aonents  acting  on  an 
emerged  portion  of  a  flexible 
•trip. 


The  boundary  conditions  are 

•tX-L-VT,  J-Qx-0,  Px  -  L 
It  X  -  l  i  P-Q-0,  and  *x  *  jji* 


(U) 


With  the  added  mass  ratio  p  »  0  the  above  equations  degener¬ 
ate  to  the  "non*aerodynanuc"  case  and  yields  the  so-called  "rev¬ 
erse  spaghetti  problem." 

Ill,  THE  FINITE  ELEMENT  SOLUTION 

Using  the  finite  difference  for  the  time  domain  and  finite 
elements  for  space  discretization,  a  finiteelement  grid  is  con- 
stmeted  on  (0,11  with  0,  p,  and  q  as  the  nodal  degrees  of  free¬ 
dom. 


tot  P  •  L2P  arid  Q  -  L2q  X  -  L(1  -  s)  (12) 

and  T  *  (L/V) t  (13) 

where  s  -  1  -  (X/L)  or  (ds/dx)  ■  -(1/L) 

An  approximate  solution  is  constructed  on  an  element  of 
length  "h"  as 

*•»(«>  -  0j(T>n,j(«>>  (14) 


IV.  REVERSE  SPAGHETTI  PROBLEM:  DISCUSSION 

The  present  day  document  processing  equipment  depend¬ 
ing  upon  their  printing  or  processing  speed,  maybe  generally 
categorized  as  slow,  medium,  and  high  speed  processors.  In  the 
case  of  slow  processors  like  a  fax  machine,  or  simple  graphic 
plotter,  the  document  is  propelled  at  less  than  1  m/sec.  The 
motion  of  this  type  of  document  may  be  easily  modeled  under 
reverse  spaghetti  problem' category.  In  medium  speed  proces¬ 
sors  like  commonly  used  copy  machines,  the  typical  document 
velocity  may  range  from  1  to  3  or  4  m/s,  I^botn  these  cases, 
commonly  used  bond  papers  are  8x"x  11"  with  mass  density  of 
about  60  to  100  gm/m2,  tnickness  of  each  sheet  in  the  range  of 
0.01  to  0.02  mm  and  weight  stiffness  ratio  of  about  50  to  100. 


Figure  3  shows  the  effect  of  various  nondimensional 
velocities  for  a  specific  value  of  weight  stiffness  ratio.  As  no 
account  of  aerodynamic  forces  are  considered,  results  of  this 
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for  low  to  lower  mednim  velocities.  As  in  static  «sc,  the  qua¬ 
dratic  element  solution  better predicts  the  comment  kinematics. 
The  series  solution  given  by  Mansfield  and  Simmoods  [2]  is  also 
produced  as  a  part  of  the  overall  solution  scheme  for  comparison 
purposes-  In  fact,  the  series  solution  serves  as  a  good  startm* 
solution  to  tbe  FEM  approximation.  In  the  program,  when  the 
user  chooses  to  run  this  case,  then  the  program  automatically 
calculates  tbe  series  solution  for  each  time  step  before  proceed- 
ing-with  finite  element  solution.  Tbe  user  has  an  option  to 
indicate  for  bow  many  time  steps  this  series  solution  h  to  be  used 
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IKTERFAOAL  HEAT  TRANSFER  BY  EVERSION 


Wesson  Kkxa 
Ibsvrnltyof  Pcer^Rko 
Afryzgcez.  Pccrto  Rico 


Abstract:  Heat  transfer  tedaaqcc*  are  weB  established  with  an 
abendanceofrelevaaiEtriainre.  However,  this  paper  aaec^ts 
to  add  a  different  perspedivu  through  the  process  of  ccsvtctrc 
eversion;  which  im*>5csske  'mixing  of  fluid  layers  hot  or  cold, 
2cd,in  tbs  pztiiGL'iccisc  the  nrriag  of  cold  turbulent  bulk 
fluid  with  hoi  is&rfacul  Arid  ihrcwgn  the  process  cf  ojrface 
•renewal,  winds  was  first  propounded  by  Dane k»nrts  injmass- 
tniusfer  processes- 


Daacfcwens  theory  ct  surface  renews!  wvs  propounded  to 
explain  mass  transfer  across  interfaces  where  tarbclebce  from 
the  bulk  phase  extended  to  such  ialefa<ns,acd  where  stagnant 
layer  theory  led  to  erroneous  results. 

The  theory  of  Danckwerts  cot  only  led  to  more  satisfact¬ 
ory  results  for  mass  transfer,  but  was  more  physically  realistic  in 
explaining  the  mechanism  bywb'ch  transfer  took  place,  and,  in 
essence  described  the  more  modem  view  of  convective 
eversion. 


The  mean  rate  of  production  of  fresh  surface  will  be  constant 
and  equal  to  S  and  the  chance  of  an  element  of  surface  being 
replaced  within  a  pven  tune  is  assumed  to  be  independent  of 
its  age;  then  the  fractional  rate  of  replacement  of  elements  in 
any  age  group  is  equal  to  S. 

Let  the  area  of  surface  comprising  those  elements  having 
ages  between  0  and  (0  +  d9)by«(0)d9.  It  is  not  difficult  to 
showth2i 

J  4*  d9  -  1  and  «■  -  S  e_sQ.  (2) 

0 


The  rate  of  absorption  of  heat  into  those  elements  of 
surface  having  age  0  and  combined  area  Se"50  is  obtained  from 
above  as. 


R 


(Tx  -  T)  Se_s3 


(3) 


Convective  eversion  is  really  the  mixing  of  adjacent  layers 
or  regions  of  fluid,  with  an  interchange  of  all  physical  proper¬ 
ties,  such  as  mass,  heat,  momentum,  and  energy.  This  notion 
can  easily  be  extended  to  embrace  the  mixing  of  turbulent  bulk 
fluid  with  interface:  fluid  by  the  propagation  of  eddies  or 
migration  of  eddies  into  the  interface,  which  Danckwerts 
described  as  turbulence  extending  right  into  the  interface  due 
to  its  very  nature  of  a  mass  of  eddies,  which  incessantly  change 
shapes  and  sizes  and  fluctuate  randomly  beneath  the  surface. 


After  some  mathematical  manipulation  we  find  that 

R  -  (Tx  -  T)  JkS  .  (4) 

I1L  EXTENSION  OF  CONVECTIVE  EVERSION  TQ 
CONTAMINATED  SURFACES 


It  is  in  this  sense  that  the  principle  of  convective  eversion 
is  applied,  that  is,  the  mixing  of  propelled  eddies  from  the  bulk 
into  the  interface,  which  bring  their  own  physical  properties 
and  displace  surface  elements  with  corresponding  physical 
properties,  but  different  values. 


In  thephysical  model  contemplated  we  envisage  eddies  as 
pieces  of  fluid  impelled  into  the  interface  from  the  bulk 
turbulent  phase.  We  also  interpret  S  in  the  model,  which  is  the 
fractional  rate  of  replacement  of  fresh  liquid  in  the  interface,  as 
the  eddy  frequency. 


II.  HIGHLIGHTS  OF  DANCKWERTS  THEORY 
(Adapted  to  Heat  Transfer) 

When  a  liquid  is  in  turbulent  motion  it  is  a  mass  of  eddies 
which  incessantly  change  their  conformation  and  position. 
These  eddies  are  pictured  as  continually  exposing  fresh 
surfaces  to  the  gas  or  other  phase,  while  sweeping  away  and 
mixing  into  the  bulk  parts  of  the  surface  which  have  been  in 
contact  with  the  other  phase  for  vaiying  lengths  of  time.  The 
assumption  is  made  that  during  the  time  of  residence  of  any 
portion  of  the  liquid  it  absorbs  heat  at  a  rate  given  by 


*(*) 


(TX  _  T) 


(1) 


which  assumes  that  the  scale  of  turbulence  is  much  greater  than 
the  depth  of  penetration  of  heat  conduction  from  the  surface. 
Here,  T*  is  the  interface  temperature,  T  is  the  bulk  fluid 
temperature,  k  is  the  thermal  diffusivity  and  $  is  the  exposure 
time  of  eddy  to  upper  phase. 

In  particular,  consider  a  liquid  which  is  maintained  in 
turbulent  motion  by  stirring  at  a  steady  rate.  The  total  area  of 
the  surface  exposed  to  the  other  phase  being  unity  and  the 
average  rate  of  heat  absorption  is  uniform  over  the  area.  The 
motion  of  the  liquid  will  continually  replace  with  fresh  surface 
elements  that  are  older  and  exposed  for  a  finite  time  interval. 


Interfacial  heat  transfer  by  convective  eversion  should  be 
damped  and  sensitive  to  small  quantities  of  surface  impurities, 
and,  this  damping  must  be  hydrodynamic  in  nature,  hence  S  the 
eddy  frequency  must  be  damped. 

Hence  S  must  be  a  function  of  C$~\  the  surface  compres- 
sional  modulus  of  elasticity  of  the  film  molecules. 

Let  Vq  be  a  characteristic  velocity  with  which  an  eddy  is 
impelled  into  the  interface.  The  eddy  possesses  kinetic  energy 
proportional  to  pX3Vg2,  where  p  is  the  density  of  the  liquid 
and  A  the  average  size  of  an  eddy. 

This  energy  is  then  expended  at  the  interface  in  overcom¬ 
ing  the  forces  ofsurface  tension  and  the  surface  compressions! 
modulus  of  elasticity,  which  behaves  like  surface  tension  but 
acts  tangentially  to  the  surface  only. 

From  the  consideration  of  energy  balance,  neglecting 
gravity,  we  get 


(c'1  +  r)  *2 


or. 


x« 


+  \ 


fV02 


(5) 
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(3) 


Susce  tnrbdence  is  characterized  by  many  sspermposed 
p-riwEdties,  then  by  analogy  with  nee  motion  we  writs 
S  =  Vnft,  where  the  size  of  tbe.eddy  corresponds  to  tie 
wavelength.  Therefore,  s  =  p  VC3A  +  Cf1,  revealing  tie 
d2~!~g  effect  of  the  scrfactant  in  general  2S 


f  pkVp  I1/2 

E»  (7*  -  7)  - -  (6) 

U  +  ts"lj 

For  stored  nstems  Vo  =  NL,  where  N  is  the  stirring  speed  end 
Ltie  sire  of  tie  stirrer  blades.  Therefore, 


R*  (T*  -  7) 


(Re) 3/2 
h  +  cs'l)V2 


(7) 


For  tnrimkncs  in  genera 


(V03/2 

fr  +  cs-l)V2 


(i*  - 


7) 
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2-D  (DIFFERENTlAlr-DELAY)  IMPLICIT* SYSTEMS 

Stephen  L.  Caxiipbell*- 
Departorat  of  MaiheotiScs 
L  Center  for  Research  xa  Scientific  Compnti&g 
North  Cuclia  Sutc  Ushevty 
RaJes'gk  XC  27695-S205  USA* 


Attract.  Taa  paper  diseases  Eaev  time  rnrsnMS  differ  cctlzj-dcliy 
ifjoiptor  lyilrcs.  The  literature  b  surveys?  **rera3  rfjtcrvat  ions 
arecude. 

I.  INTRODUCTION 

The  gtaml  EnejC  time  invariant  2D  system  take*  tbe  form 
AD<D,x&s) + &pa(t,M)+CP,z(t,s)+Dx(t,s)  =  £w(t,s)  (l) 

whete  x(f,s)  &  *  function  of  two  variable*  acd J>,  a  cither  *  differ¬ 
ential  operator  J fir  or  a  shift  (delay)  operator  Vtz(r)  —  x{r  ^-  a). 
There  a  a a  extensive  bteralure  oa  Use  partial  differential  equations, 
Azt t  +  Bit + Cx,  -tDz-  £t»  where  both  operators  are  differentiation 
although  the  singular  case  is  lessstudied.  There  is  a  growing  literature 
oa  the  classical  2-D  system  (3J ^9]  when  both  operators  are  discrete 

Ax;/ -f  Bx,jti  +  Cr.+ij  +  =  £u,+i0+i  (2) 

Cess  work  has  t-een  <bc?  oa  implicit  singular  differential  delay  system* 
where  out  operator  is  discrete  and  the  other  is  continuous 

Az\t  -  o)  -f  Bxf(t)  +  Cz[t  —  o)  -t-  Dz{t)  =  £u(t)  (5) 

This  is  surprising  given  the  importance  of  delay  system?  and  the  grow¬ 
ing  literature  oa  singular  system*  (2)4fj,(5j.  The  ccly  paper  we  are 
aware  of  that  deals  directly  with  (3)  is  [3J  which  concerned  the  system 

Bz{t)-~-Ct{t~-a)  +  Dz{t)  aTii(f)  (4) 

la  this  paper  we  make  several  observation*  about  (3).  The  need 
fet  orr.ity  presents  a  careful  discussion  of  the  applications  of  delay 
equeiions.  However,  one  example  is  instructive, 

Example  1  Suppose  we  have  a  descriptor  control  system 

Bz\t)  +  Dz(t)  -  Eii[t)  (5a) 

y(0  =  Cz{l)  (5b) 

a©d  impltnwit  a  feedback  control  law  u(l)  —  Ay(<)  +  v(f).  However, 
a  twee  unit*  elapse  between  the  observation  j ,<(<)  and  application  of 
the  control.  Then  (5b)  is  Br'(l)+D*(l)  =  £*.(<  -a).  The  closed  loop 
system  is  then  in  the  form  (4) 

Bz\t)  -  EKCz[t  -  a)  -f  Dz(t)  -  Ev{i)  (5) 

If  we  feed  hick  velocity  information  using  u(<)  =  Ky(i)  +  Lp'(f)  +  i(f) 
the  system  U  in  the  form  (3) 

-  BLCz’(t  -  *)  +  Bz\l)  -  EKCx{t  -  a)  f  Dx(<)  =  £t(l)  (7) 

Derivative  feedback  can  sometimes  regularize  a  1-D  singular  sys¬ 
tem.  Equation  (7)  shows  that  derivative  feedback  with  delay  can  never 
regulame  the  system  by  producing  a  nonsingular  A  coefficient. 

With  (2)  any  result  based  ©d  the  pencil  A D  +  B  implied  a  similar 
■  result  based  on  U»®  pencil  AD  +  C.  The  differences  between  the  dif¬ 
ferential  and  delay  operators  problem  makes  such  a  duality  much  less 
obvious  iij  the  differential-delay  case. 

*R«<*xck  MpporUd  U  put  by  A#  U.  S  Army  Research  Ofice  a»da  DAALOJ- 
3S-D-OCOJ,  ud  tie  Haitosal  S<x»«  Fetiduiom  udu  ECS- 90 1 2 >00, 


2-  NOTATION 

Let  a  bs  a  cocaegalive  real  number  and  x,(t)  a  function  defined 
on  the  t- interval  (0,a).  The  solution  x(f)  of  (3)  is  x(l)  =  x.(<«)  foe 
-i  €  ((«  — J)a,toJ  for  i  >  0-  where 

rtia  ~  max  {integer*  m  :  t  —  ma  <  0}  (8a) 

<»  =  f-(m.-l)o  ($b) 

We  axe  interested  in  (3)  for  f  >  0.  The  function  xo(t)  gives  the  initial 
data  for  (3).  With  this  notation  (3)  becomes 

Aift) + +Ci.r<) + =  £«.«(<),  0  <  !  <  o,  i>0 

(9) 

3.  NONSINGULAR  D5LAY  EQUATIONS 
The  scalar  delay  equation 

az\t  -o)  +  bz(t)  +  ex(t  -  a)  +  dx(t)  =  /(f)  (10) 

is:  of  retarded  »ype  if  as  0,5  ^  0,  of  neutral  type  if  a  0,5  ^  0,  and 
of  advanced  type  if  a  ^  0,5  =  0  and  d  £  0  (1J,(8J 

Suppose  we  have  the  delay  equation  of  retarded  type 

*:„(<)  +  «fc.*i(f)  +  «,(<)  =  E«.f ,(f),  0  <  t  <  a,  i  >  0  (11) 

Given  xo(f),  we  may  recursively  so’ve  (II)  by  taking  the  initial  con¬ 
dition*  as  x,ti(0)  =  x,(o),  »  >  0  If  u(i)  is  infinitely  differcntiab’ ?, 
the  solution  will  be  continuous  and  not  differentiable  at  t  =  0  if 
Xo(a")  5^  xJ(O^)  Succeeding  x,(l)  become  smoother,  and  x(t)  will 
be  ut  least  i  —  1  times  continuously  differentiable  at  t  =  ia.  This  is 
important  for  numerical  methods  since  a  point  of  reduced  smoothness 
for  x(f)  can  create  difficulties  with  convergence  and  error  control.  If 
u(f)  ha*  a  jump  discontinuity  at  i  =  e,  then  xft)  will  still  be  i- 1  times 
differentiable  at  t  =  c+  fa. 

For  an  equation  of  advanced  type,  (10)  give*  x(f)  =  d“x[/(<)  - 
ax?{t  -  a)  -  ex(f  -  a)]  so  that  there  is  no  smoothing  effect 

4.  XB  +  D  REGULAR 

By  letting  y(t)  =  x(<  —  o),  s  =  (*r,yT)r,  (3)  can  be  written 

[;  :]■»*(;,  ;]*--[?)« 

(1!) 

which  is  (3)  with  A  =  0.  Thus  a  singular  differential-delay  system 
V7huh  appears  retarded  may  actually  include  subsystems  of  all  types 
In  this  note  we  shall  consider  systems  (3)  for  which  AD  -f  D  is  a 
regular  pencil .  7£-e  corresponding  pencil  for  (12)  is 


Proposition  1  TJle  pencil  (13)  is  re$#far  if  and  only  if  AD  -f  D  is 
rejator.  Also,  the  index  of  (13)  is  the  same  as  the  index  of  the  pencil 
AB  +  D. 
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v4«-> 


Scp^otc  that  XB  -r  D  m  x  regular  peed L  By  Propodtfoo  1  aid 
(12)  «>e  may  amiae  that  /{  =  0  aad  lie  system  m  ia  the  fora  (4). 
Tba  by  the  cscal  sagalur  system  theecy  [4)tf5]  we  ary  solve  (S)  foe 
Z4i(0  P'ta  x,(t).  Since  we  are  interested  ha  behavior  more  than 
sofa  lion  fonnula  here,  we  wj2  perform  constant  coordinate  change*  oo 
(4)  based  oa  the  penal  AB+Dtoget 

/(0+i>l*(0  =  BuM  +  CiKt-al  +  Cjvit-a)  (1 4a) 
Ad'OHstf)  =  £j<<)  +  q,r(f  -  o) +C,tr((  -  a)  (Kb) 

where  A'1  =  0,  A'*'1  £  0  for  Kxne  initjer  t  >  1  (4]J5).  Tbeo 

A— I* 

<r(<)  =  ^(-AT!£i^’(<i  +  G.-W((  _  „)  +  cJ»(t  -  a))  (14c) 

•sO 

Equations (14b),  (14c)  have  several  consequences.  Solutions  of 
(3>  e*a  be  continuous  oo  only  finite  intervals  even  IT  u(t)  is  infinitely 
differentiable.  An  example  is  prn  in  (3)  where  if  C,  is  the  set  of 
initial  conditions  whose  solutions  are  continuous  on  (0,n),  then  €*+ » 
is  a  proper  subset  of  C*  for  every  n  >  0.  Secondly;  discontinuities  do 
not  necessarily  get  smoothed  out  as  with  the  nonsingular  problem. 

Example  2  Consider  the  2-D  system  with  a  —  1 ,  r  =  [zT,  y^J”. 

J'IJpw+[“o  ?]*»-['!  (,5) 

The  x  component  of  z  satisfies  the  equation  of  advanced  type  x(t)  =r 
z'(<-l)  +  •/(()  to  tbit  i(lj  =  r<*)(l  -  m)  +  £2*  wt*t’>(<  -  0  Note 
that  x(f)  becomes  progressively  less  smooth.  Discontinuities  of  r(t) 
not  only  persist,  but  become  more  impulsive. 

Index  One  Systems^ 

For  the  remainder  of  this  subsection  assume  the  system  is  (4)  and 
that  AS  +  D  is  an  index  one  pencil  If  B  -  0,  then  D  is  nonsingular,* 
and  (14)  becomes  the  functional  equation 


v(t)  =  Giw(f  -  o)  +  £at»(0 


with  solution 


If  we  set  a  =  0  in  the  *>stem  (3),  we  get 

(A  +  B)z'(t)  +  (C  +  DM!)  =  £u(t) 


The  pencil  A(A  ■*■£)+ (C+D)  need  not  be  regular  if  XB+Du.  Even  if 
A  is  3ccsmgclar,  (19)  cax  bcsicgclar  and  the  pen  ril  X{AJ.B)+  (C+  D) 
may  be  regular  oc  sisf’vlar  (19).  Do  the  sclutbcs  converge  rs  a  —  Q*? 

Suppose  that  we  have  (4).  One  likely  necessary  condition  is  that 
tie  roots  of  the  complex  f aactico 


*{*)=  det  (sB+D+Cc"") 


(20) 


are  bounded  above  in  real  part  as  a —  0*.  See  (l)#)  for  the  malbe- 
matical  techniques.  Here  we  look  at  a  simple  index  one  example. 

Example  3  Consider  the  index  one  system 

uif)=  e<«-«)  +  *(9  (21) 

whose  solution  is 

•u-1 

“••(<)  =  r”u(l  -m„a)  +  £  r'rfl  -  ia)  (22) 

ltd 

The  limiting  equation  for  (21)  as  a—  0**  h 

a(l)=nc(l)  +  c(()  (23) 

which  is  degenerate  if  r  =  1  and  forri  1  has  the  unique  solution 


“<<)  =  ^‘(0 


(24) 


If  [r|  >  1,  aj)d  either  c(t)  f  0  or  u(0 ~ )  jf  0,  tbcn,tr0(t)  docs  not 
converge  to  (24)  On  the  other  hand,"  if  [r|  <  1,  ud  v(t)  is  continuous 
on  the  interval  (—1,1),  then  v0(t)  does  converge  to  (24)  as  a  —  O'*-. 
Note  that  for  (21)  we  have  that 


*(*)  =  det  («B  +  D  +  Ce"~)=(l-re— ) 


(25) 


(16) 


(17) 


ic(t)  =  C?*tc(l  -  m*a)+  £  C^£r2u(<  —  ia) 

««o 

where  m«  is  given  by  (8).  If  B  0  and  B  is  angular,  (14)  is 


r'(t)  + Diz(t)  =  £iu(0  +  Cjz(<  — a)  +  Cau(f  —  a)  (18a) 
u>(t)  =  Eju(t)  + Cjr(t  -a)  +  C«u{<  —  a)  (18b) 

Given  r(f),ir(f)  on  («a,(i+  l)a),  the  system  (18)  gives  z(f),ic(t) 
on  ((t  +  l)a,(i+  2)aJ.  Discontinuities  interior  to  the  interval  may  be 
carried  over  in  v(t)  but  not  in  r(t).  If  u(t)  is  piecewise  continuous,  wc 
will  have  that  w(l)  «  piecewise  continuous  but  r{<)  is  continuous.  If 
u(t)  is  continuous,  and  the  initial  conditions  are  continuous,  then  u-(t) 
is  continuous  except  possibly  at  integer  multiple*  of  a  and  z(t)  will 
be  differentiable  except  possibly  at  integer  multiple*  of  a  where  z(f) 
will  be  only  continuous.  However,  z{<)  will  have  left  and  right  hand 
derivatives  at  these  point*. 

Thus  if  XB  +  D  is  index  one,  (3)  act*  like  either  a  neutral  or  a 
retarded  system.  If  A 13  +  D  is  higher  index,  the  behavior  will  possibly 
be  of  advanced  type. 

5.  CONTINUITY  IN  a 


which  has  the  roots  *  =  -  £  (-  In  |r|  +  »2nr) ,  i  =  0,1,2,..  The  roots  of 

(25)  have  real  part  bounded  above  as  a  —  0*  if  and  only  if  |r|  <  1  The 

case  r  =  1  show*  that  additional  conditions  beyond  the  boundedness 

of  real  parts  are  needed  to  get  convergence  as  a  -*  0*. 
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Abstract  The  paper  investigates  the  behaviour  S  cf  a  singular  2D 
jjstem  on  a  half  plane.'  Some  connection  between  the  matric es  ap- 
peuisj  in  thevnpdaticj  equations  and  the  restriction  of  5  to  the 
separation  sets  axe  presented. 

1  Introduction 

Consider  a  2D  system  given  by  the  fallowing  equation 

Ez{h + 1,4+1) = Ai(h,  £+ 1)  +  Rx(A  +  l.*j  (1) 

where  SO  R"  and  are  q  X.  n  matrices  with  entries  ia'R- 

Clearly,  if  rank  £  =  n,  (1)  can  be  reduced  to  the  equation  of  an 
unforced  noasinjulv  2D  system  (!}, «  follows 

x(h+i,k+ 1)  =  (Er’S)-'BrAHh,krl)  ,,, 

+(ST£)-'AJ'K(/i+l,t)  1  1 

U  rankfi  =  r  <  n,  we  are  allowed  to  introduce  two  nons:Dguhr  matri¬ 
ces  Q  0  R,Xf  and  N  G  R"xn,  such  that 

(3) 


«^=[o  o] 
=^]=a"'5,q7.v=[^;  £].«»-(*;  £]  w 


So,  letting 


equation  (1)  can  be  rewritten  as  follows 


So,  &e  power  series  X’  =  1^2%>x(i+j,s  -  j)CJ, 

f  “5j^'"  we  assodated  with  the  restrictions  of  the  signal  x  to  the 
taxation  jets  S',  i  =  0,1,. _ 

Let  denote  by  F~  and  C*  respectively  the  space*  of  polynomials 
in  £,£“*, A  and  of  formal  power  series  in  £  A"1,  with  coefficient* 

in  R*.  Introduce  in  F"  x  C*  a  condegenerate  bilinear  function  (v)« 
that  associate*  with  a  polynomial  p  =  p,,  &X*  in  f**  and 

a  series  X  =  £/ “  «>  2(* + i»>  - j)C  ^  in  C*  the  coefficient 

of  the  constant  term  in  the  Cauchy  product  pTX 


0>,-r)»  =  £  £  ffcO'.i)-' 

i* 0/*-^ 


P) 


Every  series  X  ia  C*  induces  a  linear  function  <py  on  A”,  defined  by 
<PX, P  ►— •  (p*X)„.  Moreover,  the  linear  mapping  that  associates  X 
with  the  linear  function  <p%  is  an  isomorphism  of  C  onto  the  space 
£[f"j  of  linear  functions  on  F*  and,  consequently,  each  series  in  Gn 
(or  ,  equivalently,  each  signal  z  .  X  —*  Rn)  can  be  identified  with 
an  element  of  the  algebraic  dual  space  £[P*j.  This  accounts  for  the 
possibility  of  expressing  many  features  of  signal  spaces  with  support 
in  X  in  terms  of  properties  of  suitable  subspaces  of 

Let  Af(£,£_1,A)  be  a  q  X  n  matrix  with  er tries  in  Rl^f*1,  A)  and 
consider  the  linear  mappings 


. Gn - 


'F1  :  P*- 
•C4  :  X- 


•  AP’K.r’.A)? 
-After1.®)* 


P) 


Here  o  is  the  shift  operator  in  Gl 


zx(h  +  ltk  + 1)  =  Anzi(ht  *+ 1)  +  l,i) 

+  riuxj(A,A*H)-f  Rij*j(A  +  I,h)  .. 

0  =  riji*i(h,h+ 1)  + BnZi{h-r.ljk) 

+  Ai\x *(A,  fc  + 1)  +  I,  A) 

In  the  particular  case  when  An,  Bn,  An,  are  rimuHajicduslyiero, 
xj  can  be  viewed  as  an  n  - »  dimensional  input  and  (5)  provides  the 
state  updating  equation  of  a  nonsingular  2D  system.  Mors'geserally, 
however,  xj  is"  the  direct  sum  of  exhogenoua  variables  (i.e«-  inputs), 
and  auxiliary  variables  that  induce  some  dynamical  constraints  ca  the 
system  trajectories,  and  (5)  can  be  considered  a  singular  2D  system, 
as  studied  in  (2). 

This  paper  constitutes  a  preliminary  report  on  a  research  still  in 
progress,  concerning  the  analytical  structure  of  the  trajectories  of  sys¬ 
tem  (5)  in  the  half  plane  X  =  {(h;Jb)  :  h  +  k  >  0).  No  “a  priori* 
assumption  b  made  on  which  components  of  xj  can  be  given  the  role 
of  exhogenous  variables.  Following  the  philosophy  that  underlies  the 
behavioural  approach  by  J.  Willems  and  P.  Rocha  (3-5j,  the  nature 
of  the  input  functions  is  determined  “a  posteriori",  after  establishing 
what  variables  are  constrained  by  equations  (5). 

2  An  algebraic  approach  via  duality 

All  signals  x  that  will  be  considered  in  this  paper  are  sequences  in¬ 
dexed  on  the  half  plane  A  and  taking  values  in  some  finite  dimens’onrJ 
R*vector  space  x  '.X  —  R" :  (h,k)  *—*  x(h,k).  The  single  step  up¬ 
dating  structure  (5)  mak«  it  convenient  to  introduce  a  partition  of  X 
into  a  countable  family  of  separation  sets  Sl  =  {(/i,k) :  h+Jt  = »),  c  = 
0,1,,..  and  to  associate  with  x  a  formal  power  series 

4  oo  4  oo 

*  =  £  £  rf+s, i-fle1*'1  (s) 

IxO/c-co 


E“(i+/,''-)lr'r  a  £«<«■+ !+r,i+ (9) 

and  p  and  p*  are  dual  mappings  (Sj,  as  {pp,X)n  =  (p,p*X)f  holds  for 
all  X  inC"  and  p  in  F*.  We  therefore  have 

lierji'  =  (im(i)A,  (10) 

where  imp  denotes  the  Aj-module  generated  by  the  columns 

of  the  matrix 

In  order  to  analyze  the  trajectories  of  system  (5),  we  introduce  the 
following  series 

+00  fco 

ii  =  EE  *<(f+i.*-j)r,A'‘.  i= 1.2  (h) 

uO/s-oo 

and  the  matrix 

mM-\  -An-Bni  -An-Bnd  (12> 

The  constraints  induced  on  x  by  equation  (5)  are  expressed  as 

«te®)[£]=o 

Therefore  the  behaviour  of  (5)  can  be  viewed  as  the  kernel  of  the 
linear  operator  p*  or,  alternatively,  as  the  orthogonal  subspace  to 
the  R[£,£~*,A]-module  At  geaerated  by  the  columns  of  the  matrix 
Afrte>): 

8  =  {(£]ec"  Afte®)[*]=o}  =  ke,M-  =  «i  (13) 

In  our  context  an  important  consequence  stems  directly  from  the  fact 
that  Gn  is  the  algebraic  dual  namely 

B1  =  (At1)1  =  At  (14) 
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Actually  (M)  abort  that  the  module:  M  b  uniquely  detrained  by  5, 
so  that  3  can be described as the kerne! of  someraatrix hi{£to) if and 
odj  if  the  colo^cs  of  both  A)  aad^fc  A)  generate  the  sari* 

Rlf.rS^J-r.iodale- 

The  duality  theory  provides  also  aa  useful  tool  for  analyzing  the 
restrictions  S^AJ  of  the  behaviour  3  to  the  set*  5°  U  Sl  U  U  Sl. 
This  b  easily  ate a  by  considering  the  linear  mappings 

*  -  p  f  ^  (  «) 

c"/»*c*  —  fr  —  w 


PrcimlljpIyiEj  both  Cj  Ci  by  .  i-jiUble  nsirocdulir  zutrix  U, 
ope  get s 


f*l 

r»*i 

UCo=  D0 

,  oj 

II 

o  o 

where  both  Do  and  £>,  hire  fell  row  rank.' Jest  rewriting  (18)  as 


DoX°-0 

DiX'sI&X0 


(20) 


where  F?  b  the  R[£,  ^“‘{-submodule  of  the  polynomial  columns  ip  F* 
baring  degree  less  than  or  equal  to  k  in  the  indeterminate  A,  GnfrG * 
is  (isomorphic  to)  the  Rfo^j-sabmodule  obtained  by  truncating  in 
each  series  of  Gn  all  terms  with  degree  greater  than  k  w.r.  to  A-1,  the 
maps  i  and  i  are  canonical  injections,  x  and  x  are  canonical  projections. 

Obviously  G*jokG*  b  isomorphic  with  the  space  C[F£)  of  lin¬ 
ear  functions  on  F£.  Moreover  F^/imp  is  isomorphic  with  a  direct 
complement  of  imp  in  F",  and  using  the  duality  theory  on  direct  de¬ 
compositions  [6]  gives  kerp*  =  (imp)1  —  £[F"/imp].  The  first  and 
the  last  space  on  the  second  row  of  (15)  can  be  viewed  aa  the  algebraic 
duals  of  the  corresponding  spaces  on  the  first  row  and  the  maps  7oi, 
x  oi  in  (15)  are  dual  linear  maps  w  r.  to  the  bilinear  function  induced 
on  the  pairs  (FT,  Gr%/okG")  and  ( ,  kerp*).  Consequently  the 
restriction  b  given  by 

gjOAj  ~  _  ker{7o  i)A  (16) 


we  easily  see  that  all  solutions  of  equation  (20.1)  can  be  viewed  as 
restrict ioosof  admissible  trajectories  to  the  separation  set  S°.  In  fact 
D\  has  full  row  rank,  and,  therefore,  given  any  Z°,  eq.  (20.2)  can  be 
fulfilled  by  suitably  chosen  values  of  X1. 

We  are  now  in  a  position  for  establishing  the  following 

Theorem  A  signal  X  =  X'A’*  belongs  to  3  if  and  only  if  X* 

satisfy  the  following  equations 


DoX°-0 


PROOP  Suppose  that'X  satisfies  (2l).  Then  we  have 

Cf,J!(0[irti]=0.  *=0.1....  (22) 


The  above  relation  characterizes  oi°-43  u  the  subspace  of  all  signals 
with  support  in  S°uSxU---USk  and  values  in  Rn  that  correspond  to 
formal  power  series  XiA~i  satbfying  the  orthogonahty  condition 


which  implies 

r  +  A-^eSM  *  =  0,1,...  (23) 


<£c(«)V .  £r'.v-(>.=(*»(f)  ?,«) ...  .,(£)] 

»*0  <»0 


X° 

X1 

xk 


=  0  (17) 


for  all  polynomial  vectois  £,k,0  e,(()A'  in  the  R[£,  f  x,  A  j- module  imp. 

The  }-*ubmoduIe  of  Rx*'^k+x)(£,£“xj  whose  elements  are 

ibe  rows  |co(0  ci(£)  •  -  •  ct{()}  that  satisfy  the  condition  £,k,,  c,($)A< 
imp  is  finitely  generated.  Therefore  there  cxbts  a  polynomial  matrix 
with  n(*  + 1)  columns  such  that  fiP-k)  -  kerC^tf)- 
Jn  the  next  section  we  shall  take  advantage  of  the  particular  struc¬ 
ture  of  given  by  (12),  when  detcrmmg  the  R{£,  £_1  j-submod  u  1  e 


3  Computation  of  trajectories 

The  following  lemma  directly  provides  a  matrix  C&XJ(£)  whose  rows 
are  given  in  terms  of  submatrices  A,j  and  Ii{j  that  appear  in  the 
partition  (12)  The  proof  is  based  on  Cayley-Ifamilton  theorem  and 
can  be  found  in  (7), 


TLo  degree  of  all  columns  in  M7^,  A)  w.r.to  A  b  less  than  or  equal 
to  one.  So,  any  such  column  can  be  written  as  e<>(£)  +  Aej({)  and  we 
have  ,  . 

(Mf)+^i(«)P¥,r). 

=<(«(<)  +  Ac, (£))«,  X‘  +  X-'X' ’«  +  ...)„  (24) 

=(M£)  +  Ac, ({))?,  X'  +  A-'r“)„  =  0 

as  a  consequence  of  (16)  and  (23)  This  shows  that  X  is  orthogonal  to 
imp  and  therefore  X  €  3.  The  converse  b  obvious. 

Equations  (21)  provide  a  recursive  procedure  for  generating  the 
system  trajectories.  Moreover,  the  difference  n  -  rank  |  j  gives  the 
number  of  free  variables  that  appear  in  system  (5),  i.e.  the  variables 
that  can  be  arbitrarily  chosen  on  all  separation  sets  Sf. 
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Abstract 

The  t«<HKmensi°aal  implicit  Roesser  model  is  considered  from 
a  geometric  point  of  vSw.  Its  (A,E,B)  and  (E,A,B>.invumut 
subspaces  are  related  to  the  existence  of  solutions  within  a 
Certain  subspace.  The  (A.E.BJ-icrariant  subspace  oi  the  dual 
S)-Stem  is  utilized  in  designing  on  asymptotic  unknown-input 
observer.  An  illustrative  example  is  included. 

1  Introduction 

The  importance  of  the  two-dimensional  (2-D)  implicit  mod¬ 
els  is  due  to  their  multidirectional  dynamic  structure  as  well 
as  their  ability  to  express  algebraic  relationships  among  the 
system  semis  lutes.  They  can  model  systems  with  boundary 
conditrons  specified  all  around  the  domain  of  interest,  such 
as  systems  modelled  by  the  elliptic  cquatiou.  This  cannot  be 
accomplished  by  the  2-D  state-space  models. 

We  consider  the  implicit  Roesser  model  (IRM)  [1]  from 
a  geometric  point  of  view.  This  point  of  view  allows  one  to 
represent  infinitely  many  system  behaviors  in  a  very  efficient 
way.  It  also  allows  one  to  characterize  the  boundary  condi 
tions  which  arc  projections  of  a  solytiv*.  onto  the  boundaries 
of  the  domain  of  interest. 

In  the  next  section  wc  briefly  present  the  semlstatc  prop¬ 
agation  mechanism  of  the  IRM.  In  Section  3.wc  design  an 
asymptotic  observer  which  uses  only  the  outputs,  and  not  the 
inputs,  of  the  actual  system  to  reconstruct  the  semistate.  We 
include  an  illustrative  example. 

2  Dynamics  of  the  Implicit  Roesser 
Model 

Consider  the  implicit  Roesser  model 


where  xh  €  R"1,  x9  €  R"2,  and  u  €  Rm  are  the  horizontal 
semistate,  vertical  semistate,  and  input  vectors  respectively. 
The  constant  coefficient  matrices  E,  A,  and  D  arc  of  appropri¬ 
ate  dimensions.  Wc  assume  neither  invcrtibihty  nor  square¬ 
ness  of  any  of  them.  Therefore,  solution  to  (1)  may  or  may 
not  exist. 

Denote  {0,1,...,#  -  1}  by  H.  Wc  say  a  semistate  se¬ 
quence  (x/j)  is  a  solution  if  it  satisfies  (1)  in  the  domain 
{(*»  j)I»  €  M  and  j  €  M  }. 

Define  Ei  .=  E  diag{ 7*1,0}  and  £?  •=  E  diagfO,/**}, 
and  Ai  likewise.  To  characterize  the  semistate  propagation  of 
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IRM  wc  need  to  define  certain  invariant  subspaces  and  some 
subspaccs  derived  from  them.  We  first  consider  the  forward 
propagation  case. 

Define  the  subspace  V  c  R*  as  a  (2-D)  (A, B,B)- (controlled) - 
invariant  subspace  for  (J)  if  it  satisfies 

The  horizontal  and  vertical  (A,E,  D) -invariant  sub  spaces  of  (1) 
V*  :=  {x*  £  r.”'|  |  ^  ]  e  V  for  some  w  6  R“2} 

and 

V'  :=  {x'  e  R”2|  [  j.  J  e  V  for  some  y  €  R"1} 

are  useful  in  describing  the  forward  propagation  mechanism 
of  the  IRM: 

Theorem  1,  (5)  Let  V*  and  V'  be  a  pair  of  horizontal  and 
vertical  (A,E,B)-invariant  subspaces  for  (1).  If  xjfi  €  V'  Vi  <E 
M  and  €  VA  V;  €  M  then  there  exist  a  solution  for  (1) 
such  that  the  horizontal  and  vertical  semistates  of  the  solution 
are  contained  in  Vh  and  V*  respectively.  D 

Using  this  theorem  one  can  test  each  semistate  on  the  left 
and  bottom  boundaries  of  the  domain  of.  interest  to  decide 
whether  it  gives  rise  to  a  solution. 

For  the  backward  propagation  case  define  a  (2-D)  (E,A,D)- 
(controllcd)-invariant  subspace S  for  the  IRM  by  replacing  Au 

Ai,  and  E  in  (2)  by  Ei,  &i,  and  A  respectively.  Then  the  hor¬ 
izontal  and  vertical  (J5,A,D)-invariant  subspace s 

S‘:=  {x'€R"'||^j  SS  for  some  u>eR“2} 

and 

S'  :=  {r'  6  Rn2|  |  j,(=  S  for  some  y  6  R"1) 

describe  the  backward  propagation  of  the  semistates,  as  for¬ 
malized  by  the  next  theorem. 

Theorem  2.  [5],  Let  S*  and  S*  be  a  pair  of  horizontal  and 
vertical  (E,A,B)-mvariant  subspaces  for  (1).  If  the  boundary 
conditions  x'iS  e  S*  Vi  €  M  and  €  S*  Vj  €  N  then  there 
exists  a  solution  for  (1)  with  these  boundary  conditions  such 
that  each  semistate  in  the  solution  is  contained  in  S.  O 
Note  that  the  conclusion  of  Theorem  2  is  stronger  than 
that  of  Theorem  1. 

Theorem  3.  [5}  Let  V\  V*,  S\  and  S*  be  defined  as  in  the 
preceeding  theorems.  If  the  internal  semistates  x*^  6  S*  f) 
V"  V»  6  1  and  xfj  €  S*  fl  V*  Vj  €  k  then  there  exists  a 
solution  with  these  internal  semistates  such  that  horizontal 
and  vertical  semistates  are  contained  in  SAfi  V*  and  S*n  V* 
respectively.  O 
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3  Unknown-Input  Observer  Design 

la  this  section  we  design  an  unknown-input  observer  for  the 
IBM  Uiicg  geometric  terms.  The  terminology  unknovn-input 
refers  to  the  fact  that  the  observer  does  not  use  the  input 
but  only. -he  outputs  m  reconstructing, the  semistate  x,’j. 
Thus,  Uij  could  be  an  unknown  disturbance 'acting  on  (1). 

The  existence  of  the  observer  - ill  formulated  using  the 
parameters  of  the  dual  system.  Bythe  dual  system  to  (1)  wc 
mean  the  system  with  the  coefficient  matrices  (£,.A|,./12,  B,  C) 
replaced  by  (E7*,  Ajt  Aj,  C7,  BT). 

*\Ve  call  an  (A,E,B)-invariant  subspace  for  (1)  contained  in 
Ker  C  an  output  nulling  (A, E,B) -invariant  subspace  for  (1). 
Let  V  be  an  output-nulling  (A,E,B)-invariant  subspace  of  the 
dual  system.  Then  using  the  Wonham  techniques  [6],  one  can 
show 

T o[Ai  A l,  ,){[S5]nK.[*  °c  o]j  (3) 

holds  with  T  an  orthogonal  complement  of  V.  We  call  T  a 
conditioned-invariant  subspace  of  (1)  (2). 

Select  any  Q  of  full  row  rank  such  that  KcrQ  .=  T.  Then 

(3)  is  equivalent  to  the  existence  of  matrices  Tj,  r2l  A|,  and 
A2  such  that  the  2-D  Lyapunov  equation 

q\ai  a2  b\  =  \ SiQB  r tQB  o]+[a xC  AjC  o] 

W 

holds. 

The  Lyapunov  equation  (-1)  yields  a  2-D  state-space  ob¬ 
server  for  (1)  if  the  2-D  state-space  triple  (/,  rlf  P2)  satisfying 

(4)  is  asymptotically  stable  in  the  standard  2-D  state- variable 
sense  (3): 

Theorem  4.  (6]  Let  Q  satisfy  (4),  so  that  T  =  Ker(Q),  with 
Q  a  full-row-rank  solution  to  (4)  for  some  Tj,  P2,  Aj,  and 
A2.  If  the  triple  (I,  Pj,  T2)  is  asymptotically  stable,  then  the 
state-space  system 


-f  +Aiy,'j4.i, -fA2y.il  j-  (5) 

estimates  QEx jj  with  decreasing  error  as  (1  j)  moves  away 
from  the  initial  boundary.  □ 

Note  that  if  T  =  0  and  the  corresponding  Lyapunov  equa¬ 
tion  is  stable,  then  the  entire  external  variable  Extj  can  be 
reconstructed.  Therefore,  we  may  alternatively  call  T  the 
tmfaiotwwnput  unobservable  subspace.  Note  also  that  the  ob¬ 
server  is  a  2-D  state-space  system,  not  an  implicit  system. 
Example 

Consider  the  system  modelled  by 


5  2  0 
0  1  ] 
0  0  4 


|+ij 

*I»i 


2  4  0 

r  j  1 

o' 

= 

0  0  .3 

y  + 

1 

0  .5  1 

i*i< 

0 

(0) 


with  nt  =  2  and  n2  s=  2.  Wc  wish  to  design  an  asymptotic 
observer  for  the  states  that  are  not  seen  through  the  output 
equation  directly. 

Wc  use  the  shifted  representation  of  Equation  (6)  (sec  [6}) 


Ex.+ijii  =  Aix.j+t  +  A2f,ijj-f  Bu,Jt 

which  is  obtained  by  the  transformation  x*y  =  |  J  -  Ob¬ 

serving  that  span[.0.  1  0  j*  is  a  conditioned-invariant  sub¬ 
space  for  (6),  one  may  select* 

to  obseive  QEx.j  =  j  j .  A  solution  to  the  Lya- 

punov  equation  yields  the  unknown-input  state-space  observer 

-.Ho.l  =  [  0  o]I,Jtl+[o  .25  ]  ',tlJ  +  [  3.5  ] 

The  error  equation 


Cifij+i  --  j+t  —  lj+l 

=  [  0  S]«W+[o  .25  ]*■*'•> 

is  stable,  so  that  the  error  goes  to  zero  as  (1  j)  moves  away  from 
the  initial  boundary  The  solution  is  completed  by  shifting  x 
back  to  x. 

For  some  experimental  inputs  and  boundary  conditions 
the  true  and  observed  semistates,  and  the  errors  are  given  in 
Fig.  1  -  Fig.  C.  0 
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REGULARITY  AND  REGULAR  I Z  ABILITY  OF  SINGULAR  LINEAR  SYSTEMS 
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Politechnika  Warszawska 
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Abstract.  New  conditions  for  regularity  of 
singular  linear  system  are  presented..  Then, 
regularizability  problem  for  the  system  is 
considered.  New  conditions  for  system 
regularizability  are  given* 


Proof . 

Since  system  (1)  is  regular  iff  is 
the  condition  for  system  regularity 
matrix  F(N)  has  full  rank  for  all 
where 


pvi 

-A  E  0  0  ...  0  0* 

I.  INTRODUCTION 

h 

0  -A  E  0 

F(N)“ 

» 

:  ■  .  : 

Linear  discrete-time  time-invariant  system 
can  be  described  by  the  following  equations: 

6  ...  0  -A  E 

solvable 
is  that 
N,  (7J, 


*((N+l)*n) 


Ex(k+1)-Ax(k)+Bu(k)  ... 

y(k)-Cx(k)+Du(k) 

where  x«Rn.  u«Rm,  y«R^  and  E,  A.  B,  C  and  D 
are  real  matrices  of  appropriate  dimensions 
If  £*I  the  system  is  named  implicit  or 
descriptor.  161 ,  and  in  the  consequence  x  is 
defined  as  a  descriptor  vector,  u  is  an  input 
vector  and  y  is  an  output  vector.  If  E  is 
nonsquare  or  det  A-0  the  system  is  called 
singular. 

Implicit  and  singular  systems  were  considered 
in  many  papers,  e.g.  (1.3,41.  It  follows  from 
these  papers  that  one  of  (  the  most  important 
and  fundamental  system  properties  is 
regularity. 


The  matrix  F(N)  has  les3  rows  than  columns. 
Therefore,  it  doesn't  have  a  full  rank  iff  its 
rows  are  linearly,  dependent.  From  the 
structure  of  matrix  F(N)  it  follows  that  its 
rows  are  linearly  independent  iff.  there  are 
linearly  independent: 

(i)  rows  of  Fj 

(ii)  rows  of  Fj  and  FJ+1.  and 
(ill)  rows  of  Fj  and  F^. 

This  is,  however,  equivalent  to  the  given 
condition  (ii). 

Condition  (i)  is  simply  rewritten  from  the 
definition  and  has  not  to  be  proven. 

~  o 


In  this  note  system  regularity  and  regulariz- 
ability  are  considered.  New  conditions  for  the 
existence  of  these  properties  are  presented. 


II.  THE  MAIN  RESULT 


A  matrix  pencil  (A-sI)  which  is  square  and 
does  not  vanish  identically  is  termed  regular 
or  nonsingular  (2,9).  Consequently.  the 
following  definition  was  introduced. 


Rg.HiU.Upn  U  ,  J  , 
System  (1)  is  regular  iff  (i)  matrices  E  and  A 
are  square,  and  (ii)  there  exists  sQ  such  that 

det  (Es  -A)*0, 


The  main  feature  of  a  regular  singular  system 
(17  is  that  Hie  system  is  solvable  and 
conditionable  (71.  Moreover,  almost  all  the 
known  results  related  to  singular  system 
dynamics  depend  explicitly  on  the  assumption 
of  system  regularity  (5) . 


Unfortunately,  assumption  of  regularity  is  not 
preserved  under  linear  feedback  Indeed, 
assuming  system  (1)  is  regular,  in  most  cases 
it  is  quite  easy  to  construct  feedback  u-Kx+v 
such  that  the  closed  loop  singular  system  is 
nonregular.  It  was  the  motivation  for 
introducing  the  regularizability  notion  in 
(81 .  Here  this  notion  is  adapted  for  discrete¬ 
time  system: 


DefiniHpn  , 

System  (1)  is  requlanzable  iff  (i)  matrices  E 
and  A  are  square,  and  (ii)  there  exist  KeRm*n 
and  sQ  such  that  det(E30-(A+BK) )*0. 


The  sense  of  the  above  is  that  there  exists 
proportional-feedback  such  that  the  closed 
loop  singular  system  is  regular.  It  is  clear 
that  thi3  property  is  invariant  with  respect 
to  feedback. 


Then,  the  following  theorem  can  be  proven. 


Now,  the  following  theorem  can  be  formulated. 
Theorem 

System  (1)  is  regular  ifX  < i )  dim  E-n*n,  and 


Thsprsn  lx. 

System  (I) 
(ii) 


(2) 
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i3  regularizable  (i)  E«Rn*n 
-A  E  0  b]"2" 


and 


Proof 

Condition  (1)  is  evident,  it  follows  from  the 
definition. 


It  is  easy  to  see  that  always: 


rank 


["•o’ 


L-BK 


E  °W„nvr-A-BK  E  0  B  Ol 
-A-BK  Ej-rank[  0  -A-BK  E  0  Bj. 


rank[  0  -A  E  0  bJ 


Hence,  the  theorem  1  implies  necessity  of 
condition  (ii).  On  the  other  hand,  it  is  also 

clear  that  always  exists  K^cR®  n  such  that 


rank[-*’BK<> 


E 

-A-BK 

o 


E  0B01 
-A-BKQ  E  0  BJ 


-rank['J 


E  O'B  ,01 
-A  E  0  BJ 


This  completes  the  proof  of  sufficiency  of 
condition  (ii). 


a 


Remark 

From  the  proof  of  theorem  it  is  rather  clear 
that  condition  (ii)  is  necessary  and 
sufficient  for  .the  existence  of  matrices 

F,K«Rm*n  such  that  dot  ( (E-BF).-i-(A*BK)  1*0. 

a 


III.  CONCLUDING  REMARKS 

New  conditions  for  regularity  and  regulariz- 
ability  of  singular  system  has  been  derived. 
The  conditions  are  very  easy  and  simple  for 
U3e,  based  on  theorem  1  the  regularity  of 
singular  system  can  be  verified  simpler  than 
using  shuffle  algorithm  proposed  by  Luenberger 
in  (71. 

Finally,  it  should  be  emphasized  that  the 
conditions  apply' for  discrete-  as  well  for 
continuous-time  system.  It  follows  from  remark 
to  theorem  2  and  fact  that  conditions  for 
system  regularity  are  the  same  for  conti nuous- 
and  discrete-timo  systems.  (5J. 
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INVERTXBILITY  OF  SINGULAR  2-D  LINEAR  SYSTEMS 


Tadeusz  KACZOREK 

Accadeaia  Pblacca  delle  Scienze 

2,Vicolo  Do  na,COl87’Ro=a,  Italy 

Abstract-Two  approaches* for  finding  a  whole  class  of 
inverse  systeas  for  a  given  singular  2-D  linear  systea 
are  presented .The  first  approach  can  be  applied  to  regu¬ 
lar-  2-D  linear  systecs  with'  transfer  satrices  of  :full 
row  ranks .The  second  approach  can  be  also  applied  to 
nonregular  singular  2-D  linear  systecs. 

I. INTRODUCTION 

The  problea  of  inverting. linear  and  nonlinear  cultiva- 
riable  systeas  has  been  under  investigation  for  a  long 
tioe  10,123. The  first  coaplete  solution  of  the 

problea  for  regular  linear  systeas  was  given  by  Silver- 
can  Cl ll-Moylanjj?} has  considered  the  question  of  stabili¬ 
ty  of  the  inverse  systea  and  he  has  codified  the  struc¬ 
ture  algorithn  by  introducing  output  and  state-space 
transformations. The  inversion  of  singular  1-D  linear 
systecs  has  been  considered  intl,2f5f12l.Lewis![12]and 
Beauchasp  Cljhave  used  the  singular  systea  structure 
algoritha  to  construct  an  inverse  systea  for  1-D  singu¬ 
lar  systecs. 

The  cain  purpose  of  this  paper  is  to  present  two  appro¬ 
ach  for  finding  a  whole  class  of  inverse  systecs  for  a 
given  singular  2-D  linear  systea. 

XI. DEFINITION  OF  INVERSE  SYSTEM 


Consider  the  general  singular  aodel  of  2-D  linear  sys- 

tessEJ 

where  i,J  are  integer-valued  horizontal  and  vertical  co¬ 
ordinates,  respectively ,x,  .G.Ir  is  tho  local  secistate 
vector  at  the  point(i, jUju.  e.JT  is  the  input, y.,^!^ 
is  the  output  and  E.A^fll  & 

Rp  ,k»o,1,2  ore  realKcatrlces.It  is  assumed  that  KE 
is  singular, i.e.  det  E  *0  when  q*n. 

Let  us  consider  an  onther  singular  2-D  linear  system  of 
the  fora 

“i  j°Hoxi  ,3*1  ® 

where  x<  .SR11  is  the  local  set  tat*  vector, u.  .and  y. . 
are  the1^  as  inO)and  E.FjSk*  ** 

Rsx^>,k*o,%2  ore  real  catrices. 

The  sy8tea(2)is  called  an  inverse  systea  forO)if  the 
output  y . ,  of  (file  the  input  fort2)and  the  input  u..  of 
<ttis  the  Output  tor(£).  ** 

III. REGULAR  SINGULAR  2-D  LINEAR  SYSTEMS 
The  syeteaCOis  called  regular  if  and  only  if 


detOlz^-A^-A^z^-A^Zg*]/  0  for  sose  z^z^G. C  O) 

where  C  is  the  field  of  coaplex  nuabers, 

IfC)holds,then  the  transfer  catrlx  of  Cj)is  given  by 

♦b2*2)+  VVt’Va 

nak  Ttf^z^r,  P  for  sons  ZyZ^C  (5) 

then  there  exists  a  right  inverse  TD  « 
given  by  00  R  R  1  2 

TR.xtCretJ1»(ia-TttTPt]'',,r)2  (6j 

where -the  upper  index  t  denotes  the  transposition,!  is 
the  exa  identity  catrix  and  Z  is  an  arbitrary  exp  matrix 
Solving  the  equation  Y(z..,z.)=TU(p.,z_)with  respect  to 
°tz1,z2^w*  obtain  t^h^re  D(z.,z^, 

Y(Zi,Z-)are  the  2-D  Z  transforms  of  ^u.-and  y, ..respec- 
tivelyTUsing  one  of  the  realization  theory  »etHods|J0 


for(6)we  say  find;catrices  Jj_,,k=o,1,2  of(2)  . 

Note  that  the  catrices  depend  on  Z  which  is  arbitrary. 
Therefore, we  have  shown  the  following 
Theorea  1. 

If  the  systeaO)is  regular  and  its  transfer  matrix CW 
has  full  row  rank, the  there  exists  a  whole  class  of 
inverse  systeas  which  can  be  found' by  the  use  of  the 
realization  theory  methods. 

Exaaple  1. 

Find  an  inverse  systea  to O) with 

p  1  o]  to  o  o]  Pi  0  o]  (5  o  i"T  15  o] 

£=  0  0  0  ,A .  0  0-1  ,A  =  0  1  0  ,A,=  0  0  0  ,B  =  0  0  , 

bo  oj  °  lo  0  oj  1  Lo  0  o]  2  Lo  0  ij  0  b-ij 
fo  0 

B2=M  0  ,B1=  0  ,C0*fb  1  0],  Ct=C2=0,Dk=0  for  k*o,1,2 
It  is  easy  to  check- that  the  systea  is  regular  and 

t-iVVVrVsL  ’■  ~r~  ® 2  -»J*f  J 

Z1X2  L® 


*■12  o  "1*1  2  "  12  3 

Z1x2  L°  0  -4J 

The  transfer  aatrix(*»)of  the  system  is 

t'-T-Wl 

Z1Z2 

and  its  right  inverse  is  given  by 

1  a 

uz2 

where  p^and  p.  are  arbitrary. 

Using  the  oetKod  given  in [3] we  obtain  a  realization  of 
(?)ln  the  fora  _  _  « 

0010“  00001  1-1  000  [o  0  0  0* 

P  0000  p  OOIOL  0000  p  00  0-1 

o  o  o  o  •  o  ooo  ori*  poo-i  *  a**  0100  • 

3  0  0  oj  to  1  0  oj  Lo  0  0  oj  0  0  0  oj 
<40  0  0  o]  _  pi  0  _poi°l  p°o  o’) 

0  ,0  rt,  ,  o' 12  1  0-2>,J'  1  boo  oj,Ha  b  0  Pj  oj- 

0  2  ’  Jk«0  for  k*o,1,2. 

IV, GENERAL  APPROACH 
Using  the  catrices 

Ai.CA0,A,.A23,B:=tBo,B,,B2l,C:L<:o.C,.C^,D;.tDo,t1,Dp 

and  the  vectors 

4j!’,LxiJ’ItHl,4’xMiiT,4j!”^li,“i+i,4’ui,3.i1 

we  nay  write  (Jinn  the  form 

*1.1, 4*1  *  A^4  *  BHd3  ^ 

yi4"  c-iJ  *  ^4 

Let  rank  E  »r<.q.Then  there  exists  a  nonsingular  catrix 
M&Rq  q  such  that  Ig*] 

where  E.G Rrxn  has  full  row  rank, 

Presultxplyxng(£a)by  M  and  using(5>)we  obtain 
Eixi.l,4»i  =Ii*i4  *  ^4 

0  =**4  4  S*4  QOb) 

■  ;;?>■»  '¥”*>■» 
The  equations  (£b)  and  C)  Ob)  cay  be  written  as 


piji  pi  p] 

0  ^  [e?J" 
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Then  there  exists  a  nonsingular  matrix 
.  Pii  si?l .  si^R  rixp 


SR0>«I-ri?$«l-r) 


where  C^R*!  ^  has  full  row  rank. 

Presulti plying  Ql)by  S  and  using  (l2)vc-  obtain 

S21yi 

where , _ , 

n.',n  t^x3B  , 

CzJ  U  52^stp^-r-ri'«” 

A  right  inverse  0.  o£  B  is  given  byOQ 

DseBt05B‘]-1eCVB’SCt^1ti)P 

where  P  is  an  arbitrary  3=xn..  natrix. 
ProsQjaVwe  have  * 

Kete  that  (JOa^aniCyb) can  be  written  as 

N*..,  ,.,>P4..,*rsi]u.,  +  r°  I-.. 


Ef1—  ”  H  *  &}* + PJy‘3  **> 

SibstituMon  of  (vjltato  (36  )yieMs 

fron  coeparison  of  (t7ynd(2)v«  have  J 

.q'.ptq-r,.  L  2  J  L  21  J  3 

DS  “tDRi>DE2,DRjl’  ■!)Kt^R'!Xri  for  k*<>'1'2 
Ihen  free  OS') we  got 

uij  '-\Ms  'Vn^j  Q9) 

and  froa  cospari3on  of  09)and(J)we  have 

»i*U<#.81,S^«-!>r151  .VVlI’W0  Q») 

Therefore  ve  have  proved  the  following 
Theoroa  2. 

If  rJ>0,then  there  exists  fortl)an  inverse  systca(2)with 
mtrices  given  by  Q8)ond(20j. 

If  r.>0,thcn  aatrices  of(£)can  be  found  by  the  use  of 

the  following 

Procedure: 

Step  1. Find  M  satisfying(?Wd 
_S_tep_2.Flnd  S  satisfying  Q2)and  B.CJ.C^.Do  r 
Ste£_J.t!aingQ8)and(pD)find  ofid  KJk  for  k»o,1,2. 

Sxaanle  2, 

Find  an  inverse  eystea  to Oi with 

J  1  ol  p  o  ol  p  o  o’)  fo  t) 

Si  b  0  0  ,A  =0,A  S  0  0  0  ,A  =  p  0  1  ,B  =  1  0  ,B.=8,=0, 

|  booj  2  bool  °  |co|  1  %  ' 

Co|  1  oj’W0,  Bo=j?  i].  »t*V  0 

It  i©  easy  to  check  that  the  condition (3 Hs  not  satisfied 
for(2l>Iherefore,the  first  approach  can  not  be  applied, 
using  the  Procedure  ve  obtain: 

St®2JL*^«  eatrix  M  satisfying (?)has  the  fora 

M  (0  0  0  1  0  00  0  oj  6 1  [010000I 

=  ka  * \oo_ 0  00  0  o_o_o  J1  «  00  0  0  0  0 

LM  ® 0  0  0  0  0  0  0  ’Jfel  b  0  0-0-5  ol 

SteEjvrha  eatrir  s  satisfying  (ja)has  tho  form 

•tiill-'-Bsssss]. 


fsl  rci  fo  0  0  °  0  0  0  0  ft  , . 

L1=s  -  3  IP- 1  £>Jl  2  JLQ.  P.o)  (23) 

«y  iy  u  0  0  0  0  c  0  0  0) 

Taking  into  account  that  EB*.^  froa  (j4)we  obtain 
Dg  -  I  *2|  ,P2  ls  arbitrary  <tx2  aatrix  <Jlf} 

Step  j.nsing  (l9,@4^J)ond(2h)wo  obtain 

■  jo  0  0 Jo  0  0;0  0  ol,G  »PlDES1llj  0  0  ,0  e(3 .0  , 

(1  0  010  0  olo  0  oj  L  -s21  j  ii  ol  1  2 

i?  SS  0  oio  it' Wi,|  °] 

J^cJ.cO 

Hence  the  desired  inverse  systea  has  the  fora 

foion  fo-1 0]  pool  foil 

1 0  Ip*1’1*1  I?  0  oJXiJt  0  of13 

uU'[o-i  o]*iJ  *  [0  0  oJxij.i*  [0  ?]yi3 

With  slight  codifications  both  approaches  can  be  exten¬ 
ded  for  n-D/n>2/  singular  linear  systems. 
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Abstract  -  The  general  singular  2-D  linear 
systems  with  variable  coefficients  are  con- 
sidered.The  concept  of  coaplete  controllabi¬ 
lity  is  extended  for  the  singular  model  and 
variable  coefficients. Using  the;  general  res¬ 
ponse  formula  necessary  and  sufficient  condi¬ 
tion  for  complete  controllability  in  a  given 
rectangular  is  established. 

I.  SYSTEM  DESCRIPTIOH 


?0-+1*3+l)G±-pll*l-q+1  ”  + 

for  1  /  p  and/or  j  /  q  • 

Using, the  transition  matrix  G^  the  exact 
fomula  for  the  solution  of  the  system  /I/ 
with  admissible  boundary,  conditions  /2/  can 
be  derived  /see  the  paper  [4]  for  details/. 
II.  BASIC  B3PIKXTI0HS 


Let  us  consider  the  general  model  of  sin¬ 
gular  linear  2-D  system  with  variable  coeffi¬ 
cients  given  by  the  following  equation 

B(l+1,J+1)x(i+1tJ+1)  -  A0(l.d)x(l.J) 

+  A1(l+1fj)x(i+1,j)  +  A2(i,3+1)x(i,3+l)  + 

B(i,3)u(i,j)  /I/ 

where  ifj  are  integer- valued  vertical  and  ho¬ 
rizontal  coordinates ,res-pectivelyf  x (i,3)6H* 
is  the  local  semistate  vector  at  (i,j),  u(i,j) 
£Rn  is  the  input  vector.and  Ak(i,j),k*1#2f0f 
B(i,j)  are  real  matrices  of  appropriate  dimen¬ 
sions  with  entries  depending  on  i  and  j.The 
special  feature  of  the  model  /I/  is  that  the 
matrix  B(i,j)  may  be  singularj£2»3j[3])(’fi] , 
Boundary  conditions  for  /I/  are  given  by 
*(.Ui0)  »  *jyo  .  i*i0,i0+1,i0+2,...  ^ 

x (*o * 3)  “  xigj  ’  J-WW2’— 

for  i0,30  -  0,1,2,...  ,  ehere  and  xi  ^ 
are  known  vectors. 

It  is  assumed  that  the  system  /I/  is  solvable 
and  boundary  conditions  are  admissible  C23#C*fls 
The  transition  matrix  for  system  /I/  is 
defined  aa  follows  [<0: 

E(i+i.^i)°^;i!3-q+1  -*<>  m< “oo  ^ 

+  A1(pt1,q)G^*q  +A2(p,q+l)Gg;f1  +I 

for  i  »  p  ,  1  .  q 

B(i+1,J+l)G^;^^+1  .0  M 

for  p^i+n^+1  and  q^j*n2+1  ,  n^n  »  »2^n 


How  we  shall  introduce  the  concept  of  com¬ 
plete  controllability  of  the  system  /I/. 

Definition.  The  system  /I/  is  said  to  be 
completely  controllable  In  the  rectangular 
)»(r»8)]^  for  any  admissible  boundary 
conditions  72/  and  every  vectors  ^ » 

1  »jQ+l i Jq+2 ,  .  «  » ,s—1  R  #k*i0+1  fiQ+2,... 

•  ,  x  g6Bn  , there  exists  a  sequence  of 

input  vectors  for  (l0*  jQ)^  (i, j)  ^ 

(r«i|  ,  a+ng)  auch  that 

xfr.l)  »  xrl  for  l*30+1td0+2,...ts-1 
x(k,s)  -  xkg  for  k*tQ+1  *iQ+2 ,  ♦  .  .  ,r-1  /4/ 


x(r'3)  »  xra 

It  is  easily  to  verify,that  complete  con-» 
trollability  in  a  given  rectangular  implies 
local  controllability  in  the  same  rectangular. 
The  converse  statement  i&  not  always  true  [Vjfc] 
In  order  to  formulate  the  necessary  and 
sufficient  condition  for  complete  controllabi¬ 
lity  let  us  introduce  the  following  notations: 


“  [^.4V^)H^.wos0»+1,1« 


Qrj  ore  a  x  n(r+n1-i<j)-dlnensioml  natrioeo 
for  3  ■>  30,30+1,...,8tn2-1. 

In  a  quite  sinilar  wiy  we  may  define  the  oatrl- 
ceo  Qpq  ,for  p  =  l()+1,i0+2,...,r  and  q»30,30+1» 
30+2 . »-1. 
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— K-i?,34,+i-s3(1o-s^;  °i—i°]  /«/ 

HI.  C0S2E0UABILIK  COIDITIOSi 


Using  aatrlces  and  we  nay  define 

the  so.  called  coaplete  controllability  na- 

trix  wf’3*  fcr  the -system  /I/  and  riven 
0,J0 

rectangular  ftt0,30) ,  (r,a)]  . 


^0  ° 

^O  qr.30+1 

Qr,J0  Qr,j0+1 
_r  ,s  ** 

i0.30  "  Qi0+1,30  ql0+1 »30+1 

qio+2.30  qi0+2,3o+1 

**  Qr,J0+1 

...  0  0 

...  0  0 

5ri3„2_, 

Ql0+1,a«2-2  q1()+i  ,0+a2-1 

IV  IV 

•**  9i0+2,s«2-2  Qi0+1l3tJ2-1 

9r,3+n2-2  ^r.atn^-l 

WV30  is  »f«a-1-l0-J0)r  tt(r«t,l0). 

•(8*“j-30)  "dimensional  constant,  i^atrix. 


Theorem.  She  dyn,'-nical  system  /I/  is  complete¬ 
ly  controllable  in  the  rectangular  £(i  ,3  ), 
,(r,s)]  if  end  <>nly  if 

rani  ,r?s3°  "  “(w’-’-VV  /8/ 

Proof.  We  shall  present  only  the  sketch -of  the 
proof.  Usings  the  general  response  foraolawe 
aay  obtain  the  relations  between 'the 'Vectors 
/4/  and. the  control  sequence, in  the  recta  ngu- 
lar  [(V^o)*  (r+ni  »  s+n2)3  .2his  relation  is 
siren  by  the  coaplete- controllability  mtrlx. 
Hence.the  full  row  rank  of  the  coaplete  con¬ 
trollability  matrix  is  the  necessary  and  suf¬ 
ficient  condition  of  the  existence  of  control 
sequence, which  steers  the  dynamical  system  /I/ 
to  the  desired  final  states  given  by  the  for¬ 
mulas  /4/. 

IV.  COHCLUSICHS 

In  the  paper  using  complete  controllabi¬ 
lity  matrix  necessary  and  sufficient  condition 
for  complete  controllability  in  a  given  rectan¬ 
gular  of  linear  singular  2-D  system  is  formu¬ 
lated. She  above < considerations  can  be  also 
extended  for  the  general  model  of  K-D  linear 
singular  systems. with  variable  coefficients. 
Using  the  coaplete  controllability  matrix  it 
is  possible  to  solve  analytically  the  so  cal¬ 
led  minimum  energy  control  problem,  ['ll,  [SJ. 
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Abstract  The  work  described  in  this  paper  summarises  the 
-esulrs  obtained  from  a'.vsearch  project.  A  model  of  colour  vision. 
Hum-Alvey  Colour  Appearance  Mode!,  was  derived  to  accurately 
predict  colour  appearance  under  a  wide  range  viewing  conditions 
on  both  self-luminance  display  and  reflection  print  media.  The 
mode!  gave  good  prediction  to  a  comprehensive  experimental 
database  which  was  also  obtained  in  this  project  A  mctlwd  for 
characterisation  imaging  devices,  Le.  self-lummance  displays  and 
electronics  printers,  was  established  and  its  associated 
mathematical  models  were  also  devised 

By  integrating  all  the  results,  an  image  process,  named  the 
"four-stage  transform",  was  formed  and  the  processed  image 
presented  on  a  hardcopy  gave  a  good  appearance  match  to  that 
displayed  on  a  monitor.  This  implies  that  the  dream  of 
WYSIWYG  colour  could  become  a  reality. 

I.  INTRODUCTION 

The  range  of  compiler  supported  applications  where  colour 
selection  and  manipulation  is  a  natural  characteristic  of  the  user's 
task  is  large,  and  includes  such  disciplines  as  graphics, 
architectural,  textile,  product,  interior,  medical,  and  engineering 
design/ With  the' proliferation  of  colour  display  capability  on 
computers  and  the  recent  availability  of  cheaper  digital  colour 
primers,  the  market  is  growing  even  faster.  The  typical  problem  of 
these  systems  is  that  what  users  see  on  one  display  cannot  be 
faithfully  reproduced  onto  another,  and  cannot  be  reasonably 
presented  on  a  hardcopy.  This  is  a  serious  problem.  For  eximple, 
^computer  user  may  spend  hours  painstakingly  choosing  and 
•  balancing  colours  on  screen  in  order,  to  create  the  desired  image, 
then  takes  only  a  minute  to 'print  out  a  hardcopy.  Yet  it  is  the 
hardcopy  that  is  often  the  primary  communication  tool  used  for 
judgments  about  the  design.  The  outcome  from  poor  fidelity  of 
hardcopy  could  be  both  practical  and  aesthetic.  Hence  *V/hat  You 
See  Is  Whit  You  Get",  WYSIWYG,  has  been  an  elusive  target  for 
the  computer  colour  industry. 

There  arc  two  causes  of  this  problem:  First,  there  is  a  lack  of 
understanding  of  the  properties  of  human  colour  perception  in  the 
different  viewing  conditions  used  for  displays  and  reflection 
pnnts.  The  second  cause  of  this  problem  is  that  the  colour 
primaries  and  colour  generating  technologies  of  displays  and 
printers  are  veiy  different.  With  til  this  in  mind,  a  consortium  was 
formed  in  1986  to  tackle  above  problems.  This  research  project 
entitled  "Predictive  perceptual  colour  models"  was  earned  out 
under  the  auspices  of  the  UK  government's  Alvey  programme. 
The  member  of  consortium  were  Grosfield  Electronics,  Sigmcx 
Displays  and  LUTCHI  .  (LUTCH1  stands  for  Loughborough 
Umverainr  of  Technology  Computer-Human  Interface  Research 
Centre.)  The  objectives  of  the  project  were  to: 

*  ^?cr'vc  a  computer-based  model  of  colour  vision  to  predict 
the, colour  appearance  under  various  viewing  conditions,  i.e. 
filomfnants,  luminance  level,  background,  and  media 
(displayed  reflection  print). 

*  Develq?  practical  methods  for  characterising  different  colour 
printing  and  display  devices,  so  that  proofing  simulation  is 
possible. 


n.'  DERIVING  HUNT-ALVEY  COLOUR  APPEARANCE 
/  MODEL 

The  work  for  achieving  the  first  goal  was  divided  into  three 
stages  which  were:  1)  to  conduct  a  large  scale  psychological 
experiment  for  assessing  colour  appearance  under  various  viewing 
conditions  by  a  panel  of  normal  colour  vision  subject*  2)  to 
implement  existing  available  colour  models  and  to  compare  its 
performance  using  the  data  obtained  from  stage  one,  and  3)  to 
modify  a  particular  model  which  performed  the  best  from  stage 
two  until  the  best  fit  was  achieved  to  the  data. 

The  first  stage  of  work  was  the  acquisition  of  experimental 
data.  Ten  observers  were  asked  to  nuke  judgements  of  lightness, 
colourfulnfss,  and  hue  of  a  sequence  of  test  patches  at  the  centre 
of  a  complex  viewing  afield,  using  a  magnitude  estimation 
technique.  Painted  colour  chips  stuck  onto  a  grey  card  were 
presented  in  a  viewing  cabinet  and  its  replicate  displayed  on  a 
high  resolution  colour  display.  The  decorating  patches  around  the 
periphery  of  the  field,  were  selected  randomly  to  create  a 
"mondrian"  pattern  to  simulate  a  complex  field  within  which  to 
assess  the  test  colour.  The  overall  experiment  was  divided  into  23 
phases,  with  approximately  100  colours  in  each  phase,  to  allow 
four  different  parameters  to  be  studied: 

•  four  types  of  illurmnants  (D65,  D50,  white  fluorescent, 
tungsten) 

•  two  luminance  levels  (high  (250  cd/m2)  and  low  (40  cd/m2)) 

•  three  backgrounds  (white,  grey,  and  black) 

•  '  two  displayed  modes  (self-luminace  display  and  reflection 
prints) 

Altogether  this  generated  an.  experimental  database  containing 
over  43,000  estimations,  one  of  the  largest  such  body  of  data  to  be 
amassed  anywhere  in  the  world  since  1945.  This  data  set  is  named 
the  LUTCHI  Colour  Appearance  Data  (I]« 

At  the  second  stage  of  work,  this  data  was  used  to  test  the 
predictive  accuracy  of  various  existing  colour  models.  For  five 
models  tested,  the  Hunt's  model  12}  outperformed  the  others  and 
gave  reasonably  accurate  prediction  to  the  data.  This  model  was 
further  refined  to  become  Hum-Alvey  Colour  Appearance  Model 
(Hunt-ACAM)~t3J.  The  errors  of  prediction  from  this  mode!  are 
similar  to  those  between' each  individual  observer's  and  mean 
visual  results.  This  implies  that  the  model's  performance  is  close 
to  that  of  typical  observing  variation.  A  block  diagram  of 
Hunt-ACAM  is  given  in  Figure  i  to  illustrate  its  functions.  The 
input  parameters  arc  CIE  XYZ  tnstimulus  values  (4J,‘  which  are 
physical 'quantities  to  define  a  particular  colour  in  question, 
together  with  the  information  of  illuminant,  luminance  level, 
background,  and  display  mode.  (The  tnstimulus  values  are  a  set  of 
values  for  specifying  a  colour  arid  recommended  to  be  used  by  the 
Commission  Internationale  d'Edairage  (CIE).  The  X,  Y,  and  Z 
are  the  amount  of  red,  green,  and  blue  lights  in  order  to  match  a 
colour  in  question )  The  output  information  from  the  model  are  the 
predictive  perceived  attributes,  ic.  lightness,  brightness, 
colourfulness,  chroma,  saturation,  and  hue  15)  The  reverse  model 
has  also  been  derived  to  transform  the  lightness  (L), 
colourfulne$$(C),  and  hue  (H)  of  a  colour  under  a  particular  set  of 
viewing  conditions  to  the  corresponding  CIE  XYZ  values. 
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Figure  1.  A  Node  diagram  k  iCusratc  the  Hottf-Alvcy  CoJoar 
Appeu-ucx  Model  (Haoi-ACAM) 

I1L  DEVICES  CHARACTERISATION 

For  tackling  the  second  causc  of  poor  fidelity  problem,  we 
developed  a  method  for  the  characterisation  of  colour  display  and 
printing  devices.  We  generate  a  "colour  cube"  with  nine  levels  for 
each  of  the  three  colour  primaries  of  the  device,  i.e.  red,  green  and 
blue  for  the  tficplay.  and  cyan,  magenta,  and  yellow  for  the 
printers,  fc  total,  9x9x9  *  729  samples  were  produced  for  each 
device. 

For  a  colour  printer,  or  proofing  system  such  as  Oomalin,  with 
cyan,  magenta,  and  yellow  inks,  this  method  included  printing 
nine  chans  each  containing  9x9  samples,  as  shown  in  Figure  2. 
These  samples  were  then  measured  with  a  Macbeth  MS2000 
spectrophotometer  to  obtain  their  absorption  spectra,  from  which 
the  GE  XYZ  tri stimulus  values  were  calculated. 

For  a  colour  monitor,  with  red,  green,  and  blue  drive  signals,  a 
corresponding  sequence  of  colour  patches  is  generated  "at  the 
centre  of  the  screen  via  a  telespcctroradiometcr  (TSR)  to  obtain 
their  emission  spectra,  from  which  again  the  XYZ  tristimulus 
values  were  calculated. 
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Figure  2.  Arrangement  of  colour  patches  in  device  calibraocn 
'  cube  (using  cyan,  magenta  and  yellow  ink  primeries) 


The  Foward  and  Reverse  Device  Models  .were  developed 
in  order  to  calculate  the  XYZ  values  for  any  intermediate  values  of 
CMY  (or  RGD)  and  vice  versa.  The  formula  is  given  in  Mitchell 
and  Wait[6)  and  used  by  Stone  et  al  [7J. 

For  the  forward  device  model,  the  transformation  of  the  device 
coordinates,  say  (r,g,b),  into  tri  stimulus  values  (X, Y,Z)  was 
carried  out  by  using  the  transformations 

'■i^,<r'S’b>V  (t-X.Y»  (I) 

i»l 

where  the  functions  9,(r.g,b),  (i«l,2,...,8),  are  given  by 


.  9i=  <&=  Dffl-DgXI'Dfc),  etc., 

where  Dp  Dg  and  D£  are  the  distances  from  the  origin  of  the 
tekeud  cube  to  the  desired  value;  normalised  to  lie  in  the  range 
'  0-1.  Tbsvertices  are  numbered  as  in  Hgure  3. 


An  arbitrary  hexahedron  is  thus  transformed  into  the  unit  cube  in 
(r,g,b)  space.  The  interpolation  formula  used  to  obtain  the  value  of 
X  corresponding  to  a' colour  produced  by  a  set  of  (r,g,b) 
coordinates,  where  (r,g,b)  is  known  to  lie  in  this  cube,  is  then 
defined  by 


X(r,g,b) «  2A<r’*b>  Xt 

»1 

a  0-Dr)(lrDsXM>b)Xi  +  D,(l-DgXl-Db)X2  + 
(l-Df)Dg(l-Db)Xi+ DrDg(l-Db)X4  + 
(l-DfXl-Dg)DbX5  +  Dtf-DJDfa + (l.Df)DgDbX7 + 

DrDgDbXg  (2) 

Similarly  the  Y  and  Z  tristimulus  values  were  calculated. 

For  the  reverse  device  model  for  finding  ,$ay  (c,m,y)  from 
given  tristimulus  values  X.Y.Z,  we  Stan  from  the  middle  cube 
which  has  an  origin  of  (5,5,5)  and  find  the  corresponding  c,m,y 
and  X,Y,Z  values  at  that  point  We  then  calculate  AX,  AY,  AZ, 
and  obtain  the  differences  in  c,  m,  y  values  by  solving 

(Ac  Am  Ay)T  »  J  »  (AX  AY  AZ)T  (3) 

subsequently, 

(c  m  y)A^iyr  b  (c  m  y)W  +  (Ac  Am  Ay)MT  (4) 
where  k  is  the  iteration  parameter  and  J  is  the  Jacobian  matrix  given 
by 


dX 

dX 

dX 

dc 

dm 

dy 

dY 

dY 

dY 

dc 

dm 

dy 

dZ 

dZ 

dZ 

<L 

dm 

dy 

By  applying  the  interpolation  formula,  see  equation(2),  a  new  set 
of  (X,  Y,  Z)  values  are  then  calculated  and  (AX  AY  AZ)<*+1>  is 
found.  If  these  values  are  within  tolerance  wc  exit  with  the 
calculated  c,  .u  and  y  values  of  equation(4).  Otherwise,  we  proceed 
to  the  next  iteration,  cquation(3) ,  using  the  same  matrix  J  if  still  in 
the  same  cube.  If  it  is  not  in  the  same  cube  we  start  the  same 
procedure  with  the  new  calculated  poinf  (new  cube),  until 
convergence  i:  achieved. 
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iv.  image;  processing 

The  real  achievement  of  the  project  has  been  the  ablity  to 
combine  the  device  characterisation  method  with  the  colour. 


s  us  not  only  to  simulate  the  appearance  of  an  image  when 
produced  on  a  different  device  but  also  to  predict  the  change  in 
appearance  of  a  coloured  image  under  various  different  conditions. 
When  incorporated  into  a  computer-based  design  system  this 
technology  would  allow  the  users  to  visualise  on  the  display 
monitor  the  true  colour  appearance  of  the  finil  product  and  to  have 
t  high  degree  of  confidence  that  this  colour  specification  would  be 
preserved into  production. 


This  image  processing  software  is  designated  the  Tour-Sage 
Transform"  (FST)  and  it  is  illustrated  in  Figure  4.  For  this 
particular  example,  the  operator  wishes  to  produce  a  "hardcopy- 
version  of  the  displayed  image.  Initially,  the  image  represented  m 
the  red,  green,  and  blue  primaries  of  a  display  was  converted,  pixel 
by  pixel,  into  an  equivalent  XY2  image  using  the  Forward 
Device  Model-  Secondly,  this  XYZ  image  was  then  transformed 
by  the  Forward  Hunt-ACAM  to  obtain  the  LCH  image  in  which 
the  LCH  are  the  perceived  attributes  under  the  display  .viewing 
conditions.  Thirdly,  the  LCH  image  was  converted  to  an  XYZ 
image  which  preserves  the  same  appearance  but  viewed  under  the 
hardcopy  viewing  conditions  using  Reverse  Hunt-ACAM. 
Finally,  the  XYZ  image  was  processed  to  achieve  cyan,  magenta, 
and  yellow  ink  primaries  of  a  printer  using  the  Reverse,  Device 
Model. 


Several  images; have  been  tested.  The  results  are  quite 
satisfactory  that  good  appearance  match  between  the  primed  and 
displayed  mage  was  obtained. 

V.WHATS  IN  STORE? 

We  commenced  a  further  three-year  research  project  in  Febroary 
1990.  The  new  consortium  consists  of  Grosfkld  Electronics  and 
Loughborough  University,  as  before,  but  also  includes  Coats 
Vfyella,  the  giant  textile  manufacturer.  The  aims  of  the  new  project 
are; 

•  to  extend  the  functionality  of  Hunt-ACAM  into  new  ir.-^dia 
and  viewing  conditions. 

•  to  improve  the  device  characterisation  methods. 

•  to  develop  better  user  interfaces  for  the  management  of 
-colours.. 

•  to  bring  the  image  processing  technology  nearer  to 
commercial  exploitation. 

.We  are  confident  that  this  on-going  research  effort  will  yield 
great  benefits  in  the  medium  term  for  management  of  colour  in 
computer-based  design  systems,  and  will  enable  the  dream  of 
WYSIWYG  colour  to  become  reality. 
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Figure  4.  A  block  diagram  to  illustrate  the  four-stage  transform 


HISTOGRAM  SEGMENTATION  FILTERING : 

A  GRAPHICS  DCT  COMPRESSION  POSTPROCESSOR 


R'.A.KIRK 
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Abstract:-  Many  DCT  compression  systemswork 
well  on  scanned  images  but  cannot  compress 
text  and  other  artificial  features  without 
injecting  visible  errors:  These  features 
generally  have  -sharp  eiges  between  flat 
regions.  The  histogram  of'  the  pixel  values' 
near  such  a  feature  will  typically'  show  a 
few  sharp  peaks.  If  we  segment  the  histogram 
of  a  DCT  block  into  its.  peaks  we  can 
identify  these  features  and  reconstruct  the 
original  values.  Crosfield  DCT  compression 
results  are- presented. 


Experimental  DCT  compression  -of 
artificial 'test 'image  without 
Histogram  Segmentation  Filtering 


The*  discrete  Cosine  Transform  (OCT)  of  a 
small  (8*8)  section  of  an  image,-wiH  often 
have  many  values  close  to  zero.  DCT 
compression-  techniques  [I]-(4J  exploit  this 
feature  -by" quantizing  -these  values  to  give  a 
compactly  codeable-  set'  of  integers,  most  of 
them  zero.  This- introduces' small  errors.  The 
quantization  is  usually  optimised  to  make 
the  errors  invisible  -under  normal 
conditions. 

Sharp  edges  between  flat  regions  present 
particular  problems  for  a  DCT  compression 
system.  A  sharp  feature* will  have  many  large 
values  in  -its  transform,  so : the  quantization 
errors  may  be  unusually  severe.  The  errors 
will  be  unusually  visible  against  the  flat 
background. 


Fig.  2.  Same  image  as  fig.  1  after 

Histogram  Segmentation  Pilfering 
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st  image? 


In.-a'scan  of  a  photograph -Such -features  are 
rarely  a  problem;  '  a  correctly  sampled  edge 
should  not  be  sharp,  and  there  is  usually 
noise  and  future'  present.  He  notice  the 
worst  effects  in  the  artificial  features  of 
an  assembled  page,  such  as  text,  borders,  and 
cut-outs. 

We  can^see  these -errors  easily  in  the  example 
in  figure  1  whenever  an  8*8  block-  overlaps  a 
sharp  edge.  The  original  image  had  flat  text, 
sharp,-  edges;  and  smooth  background.  The 
compression  was  done  using  "  an  experimental 
DOT  algorithm  that  could  compress  scanned 
photographs  with  ho  visible  degradation.- 

We  could  segment  the  blocks  using. -the  graph 
theory  approach  of  'Morris  'and  Constantinides 
(5]  to  determine  whether  the  block  is  likely 
.to  have  been  corrupted'  and  to  clean  it'  up. 
Instead  we  have  segmented  the  histogram.  This 
gave  much  the  same  sort -of  result  and  “was 
simple  to  > implement . 

2., ,  SEGMENTING  -THE 

We  calculate  the  histogram  of  our  8*8  ' pixel 
block.  We  have  a  sec  of  -peak  values  V(n); 
n-l..N  with  frequency  F(n)  >  0.  We  say 
segment  this  histogram-as  follows... 

(1) .  Find  the  two  peaks  with  the  closest 

values.  ^ 

(2) . <Jf  those  peaks  are  separated  by  more 

than  a  threshold  value,  then  stop. 

(3) .  Merge  the  two  peaks... 

Frequency  -  sum  of  old  frequencies 
Value  -  weighted  average  of  old  values. 

(4) .  Loop  back  to'(i). 

This  should  tidy  the  histogram  up,  sweeping 
loose  clusters  of  values  into  sharp  peaks. 


I^THE, .SEGMENTATION  THRESHOLD 

When  the  threshold  was  kept  constant  the 
image  lost  its  low  contrast  features,  while 
high  contrast  noise  from  high  contrast  edges 
remained. 

Making  the  threshold  a  fixed  fraction  of  the 
total  range  in  the  block  gave  better  results, 
but  that  was  too  easily  influenced  by  extreme 
values. 

The  best  results  for  this  compression  system 
were  given  by.°i . 

Threshold.*  Standard  deviation  *  8 

This  gave  the  same  sort  of  filtering  on  a  low 
and  a  high  contrast  block.  This  suited  the 
compression  system  we  were  using.  A  value  of 
8-0.6  gave  a  good  balance  between  filtering 
noise  and  filtering^ out  genuine  image  detail. 

On  low  contrast  blocks  the  calculation  can  be 
dominated  by  the  rounding  errors.  To  avoid 
this  we  turned  off  the  filter  if  the  standard 
deviation  fell  below, 15. 


4, -DETECTING  WHERE  TO  SEftHPffT 

We  can  determine  what  sort  of  image  we  have 
in  our  block  from  the  number  of  peaks  in  our 
segmented  histogram:  Our  test  image  (fig.  1) 
had  only  one  peak  for  the  flat  blocks,  and 
2-4  peaks  -for  the  blocks  with  a  feature. 
Typical  scanned  image  blocks -could  have  up  to 
9  peaks. 

If  the  standard  deviation  was  greater  than 
the  section  3  limit  filter  as  follows: - 

Peaks  Action 

1  No  filtering.  Either  flat  or  subtle 

texture.  Compression  should  not 
have  •cr.used- significant  error. 

'2-4  Substitute  corresponding  segmented 

histogram  values.  Probably  flat 
.areas  with  sharp  edges. 

>5  No  filtering.  Probably  texture. 

^RESULTS 

The  effects-  of  the  histogram  segmentation 
filter  are  shown  .in  Fig.  2.  Most  of  the 
artefacts  from  compression  have  been  removed. 

The  same  filter  was  tried  on  filtered  and 
unfiltered  versions  of  real  images  containing 
text,  colour  charts,  wood  grain,  face,  and 
hair  details.  The  results  are  not  presented 
here.  Only  a  few  blocks  were  altered  and  no 
harmful  effects  were  seen. 


6,. CONCLUSIONS. 

We  have  a/filter  that  can  remove  fine  detail 
from  sharp  edged  features.  This  may  be  used 
to  cleanv  up  the  severe  problems  with  text 
resulting  from  compression  and  decompression;. 
As  the  filter  acts  on  the  image  aftex 
decompression,  it  has  no  effect  on  the  data 
compression  ratio.  It  appears  to  have  little 
effect  on  ordinary  image  data. 

We  can -get  a  better  value  of  threshold  if  we 
get  more  data  from  the  compression  system.  We 
may  estimate  the  quantization  error  to  be 
zero  if  the  quantized  value  is  zero,  and 
(quantization  interval)/4  if  not.  All  of  the 
errors  added  in  quadrature  give  an  estimate 
of  the  error  signal  amplitude.  We  have  used 
this  approach  with  success  (6),  but  it  may 
•not  work  with -all  compression  systems. 
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ISO/I EC  JTC1/SC2/WG8  1990 
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ANALYSIS  IN  PRINT  PROCESS  DEVELOPMENT: 
TVO  CASE  STUDIES 


H.  J.  KERRY 

Cambridge  Consultants  Ltd 
Science  Park 
Milton  Road 
Cambridge  CB4  iVi 

Abstract  -  This  paper  describes  two  case-studies  in 
which  mathematical  analysis  and  computer  simulation 
have  been. used -in  the  development  of  printing  equip- 
nent  and  processes.  The  common  these  of  both  exasp les 
is  ensuring  uniformly  high  ieage  quality,  although  the 
printing  technologies  involved  are  very  different. 

The  first  exarple  describes  a  design  tool  for  offset 
litho  print  towers,, and  the  second .describes  the  anal¬ 
ysis  of  acoustic  effects  in  a  novel  eulti-channel 
drop-on-denand  ink  jet  printer  array.  Both  case  stud¬ 
ies  illustrate  the  way  in  which  simple  mathematics  can 
be  exploited  to  address  complex  and  challenging  design 
problems. 


DESIGN  TOOL  FOR  AN  OFFSET  LITHO  PRESS 

In  the  offset  lithography  process  the  image  is  etched 
onto  the  surface  of  octal  plates.  This  etching  changes 
the  surface  properties  so  that  the  plate  becomes  oleo¬ 
philic.  Ink  Is  applied  froa  one  or  oore  ' force' 
rollers  carrying  a  thin  fila  of  ink,  which  the  ioage 
plate  accepts  only  where -is  has  been  etched.  The  Ink 
is  then  offset  onto  a  blanket  roller  before  final 
transfer  to  the  paper  or  other  cedlua. 

The  ink  film  on  the  force  roller  is  replenished  froo 
a  supply  via  a  series  of  interoediate  rollers  arranged 
in  a  tower.  This  tower  oust  perform  two  functions. 
First,  it  rust  act  as  a  buffer  between  the  internment 
supply  and  the  force  roller  -  it  is  not  possible  to 
provide  adequate  control  of  the  replenishment  with  a 
continuous  supply.  The  second  function  of  the  tower  is 
to  even  out  the  variations  in  filn  thickness  around  the 
force  where  ink  has  been  transferred  to  the  Ioage. 

Poor  performance  by  the  tower  will  result  In  variations 
In  overall  ioage -density,  or  in  ’ghosting'  within  the 
ioage. 

Generally  speaking,  the  core  rollers  there  are  in  the 
tower  the  better  the  performance  -  typically  there 
night  be  12  to  20  rollers,,  in, a  seall  press.  For  a 
fixed-foreat  oachine  in  wnfch  the  ioage  size  is 
standard,  the  tower- configuration  can  be  optialsed  by 
a  process  of  trial  and  error  starting  froa  existing 
designs. 

In  order  to  design  a  variable  format  print  tower,  in 
which  different  ioage  sizes  are  to  be  accommodated,  a 
note  sophisticated  approach  is  needed.  The  requireaent 
for  a  variable  format  aakes  is  desirable  that  there  is 
only  a  single  forme  roller,  which  oakes  is  core  diffi¬ 
cult  to  achieve  an  even  inking  of  the  ioage.  Further¬ 
more,  the  tower  configuration  oust  be  'optinised'  so 
that  none  of  the  Image  sizes  would  suffer  from  poor 
quality. 

In  order  to  enable  designs  to  be  evaluated  on  paper,  a 
cooputer  simulation  was  developed.  This  modelled  the 
variation  in  ink  filn  thickness  around  the  circumfer¬ 
ence  of  each  roller.  A  nunber  of  simplifying  assumpt¬ 
ions  were  Bade: 


*  the  rollers  are  assumed  to  be  in  rolling  contact 
without  slippage; 

*  ink  flow  on  the  surface  of  the  rollers  is 
neglected,  so  that  ink  as  assused  to  be  trans¬ 
ported  between  the  contact  points  around  the 
perieeter  of  the  rollers. 

*  the  sisulation  was  one-dieensional ,  so  that 
longitudinal  motion  of  the  rollers  was  not 
codelled. 

A  sieple  ink  transfer  law  was  assused  at  each  point 
of  contact  -  the  total  amount  of  ink  prior  to  contact 
is  divided  between  the  rollers  after  contact,  according 
to  a  fixed  ratio. 

The  sisulation -in  started  froo  an  initial  fully  xnked 
state,  and  settles  down  into  a  steady  state  froo  which 
statistical  eeasurecents  of  the  variation  in  ioage. 
density  can  be  obtained.  It  was  validated  qualitat¬ 
ively  by  cosparing  the  predictions  for  existing  towers 
known  to  give  respectively  good  and  poorer  quality 
isages,  and  then  used  to  predict  the  performance  of 
alternative  designs  and  for  fine  tuning  of  the  pre¬ 
ferred  candidate. 

The  use  of  this  tool  resulted  in  a  roller  configurat¬ 
ion  which  required  no  further  development  before 
being  put  into  production,  and  produces  print  of 
excellent  quality,  exceeding  expectations. 

CROSS-TALK  EFFECTS  IN  HIGH  DENSITY  INK  JET  ARRAYS 

Print  quality  in  an  ink  Jet  printer  is  determined 
primarily  be  the  accuracy  of  the  landing  positions  of 
the  drops  on  the  paper.  Errors  in  drop  position  can 
arise  froo  a  nuaber  of  sources,  and  in  this  paper  we 
are  concerned  with  cross-talk  -  variations  arising 
froa  Interactions  between  one  nozzle  and  Us  neigh¬ 
bours  . 

In  designing  an  ink  jet  array  for  a  given  print 
quality  requirement,  the  tolerable  error  in  landing 
position  oust  be  shared  carefully  between  these 
various  sources.  Typically,  for  a  300dpi  printer, 
this  overall  tolerance  is  in  the  region  >of  20im,  and 
cross-talk  effects  oust  not  exceed  about  X  of  this. 

The  landing  error  depends  on  the  relative  error  in 
drop  velocity.  It  is  highly  desirable  to  operate  at 
a  drop  velocity  such  that  satellite  drops  do  not  form  - 
about  3o/s  for  typical  inks.  For  a  high  speed  array 
printer,  this  brings  the  permissible  variation  of  drop 
speed  to  around  &a/sec  (about  an  8%  variation)  in 
order  to  keep  the  landing  error  to  less  than  5yn. 

Analysis  of  the  'shared  wall'  actuator  design 

The  figure  below  shows  a  novel  design  of  printer 
array,  fabricated  froa  PZT,  which  can  be  made  at 
densities  up  to  300dpi.  In  this  design  each  wall  is 
actuated  in  shear  node  so  that  it  deflects  sideways 
and  can  operate  the  two  channels  either  side  of  it. 


1846 


CHEVRON  PRINTHEAD 


Fig.  1.  Shared  wail  actuator,  fabricated  from  a  single 
block  of  PZT  ceramic. 


Although  this  design  has  many  advantages,  the  walls 
are  relatively  compliant.  This  results  in  coupling 
of  pressure  between  any  channel  and  its  near- 
neighbours,  and  is  the  primary  source  of  cross-talk. 


The  volume  of  the  delivered  drop  depends  on  the  flow- 
rate  through  the  nozzle  and  on  the  drop  delivery  time. 
The  flowrate  depends  on  the  channel  pressure  generated, 
and  the  delivery  tine  is  Just  the  tine  taken  for  an 
acoustic  wave  to  propagate  along  the  length  of  channel. 
The  analysis  described  in  this  paper  was  initially 
undertaken  to  explain  the  apparent  variation  of 
acoustic  velocity  for  different  firing  patter, rs  in  a 
prototype  actuator.  It  has  been  developed  specifically 
for  this  type  of  array,  but  Is  generally  applicable. 

The  actuator  is  modelled  as  a  number  of  identical  two- 
dimensional  channels  containing  ink.  As  described 
above,  the  walls  separating  the  channels  are  compliant, 
and  a  pressure  difference  across  the  walls  will  cause 
a  proportionate  lateral  deflection.  We  nay  neglect 
wall  inertia  as  the  resonant  frequency  of  Wall  vibrat¬ 
ion  is  ouch  higher  than  the  frequencies  associated  with 
drop  ejection. 


The  equations  which  describe  flow  in  the  channel  are 
the  momentum  equation,  the  nass  conservation  equation, 
and  the  constitutive  relationship  between  pressure  and 
density  for  the  ink.  The  flow  is  essentially  one- 
dimensional  and  of  small  amplitude,  and  the  linearised 
acoustic  equations  for  each  channel  are: 


l2  2  >2P, 


Where  k  is  a  constant,  the  ratio  between  the  compliance 
of  the  wall  and  of  the  ink.  The  set  of  equations  U) 
can  be  written  as  a  matrix  wave  equation* 

(I  ♦xA)  — -  a  cft  — -  (2) 

*tr  ix2 

where  A  is  the  second-difference  outrix.  For  an  N- 
channeT  array  there  are  K  wavelike  solutions 
g(x,t)  a  gfagct)  corresponding  to  the  eigenvectors  of 
the  matrix  A'.  These  the  acoustic  modes  of  the  array, 
and  for  each  eigenvalue  \  the  corresponding  mode' 
propagates  non-dispersively  with' speed  c  «  c0//(l*X<). 


The. eigenvalues  of  A  lie  between  0  and  d,  so  that  for 
a  typical  compliance  ratio  k  a  0,3  the  theoretical 
variation  in  propagation  speed  is  around  30%  of  cQ, 
consistent  with  the  observed  variability. 

The  theoretical  analysis  has  been  confirmed  by 
comparison  with  experiment,  and  has  shown  excellent 
agreement.  The  existence  of  acoustic  modes  is 
confirmed, by  measurements  of  the  resonant  frequencies 
of  an  array. with  open. ends.  The  expected  linear  " 
relationship  between  eigenvalue  and  (period)^  has 
been  demonstrated,  and  agrees  with  independent 
measurements  of  the  compliance  ratio  k. 

Cross-talk  and  cancellation 


The  analysis  can  be  readily  extended  to  give  the 
channel  pressure  distribution  j>  generated  by  a  given 
excitation  pattern  V: 

(3) 

where  a  is  the  excitation  parameter.  This- shows  that 
when  a  firing  voltage  is  applied  to  only  a  single 
channel,  the  effect  of  wall  compliance  is  to  reduce 
the  pressure  in  the  actuated  channel  and  to  induce 
cross-talk  pressures  throughout  the  array. 

A  video  illustrating  cross-talk  in  an  actuator  will 
be  sho/*n. 

Equation  (3)  enables  the  pressure  field  generated  by 
a  given  applied  voltage  pattern  to  be  computed,  but 
also  ofiers  a  method  of  compensating  for  the  effect  of 
compliance.  It  is  straightforward  to  determine  the 
required- initial  pressure  distribution  to  generate  any 
given  pattern  of  drops,  and  the  above  equation  can 
then  be  solved,  giving  the  voltage  pattern  needed  to 
generate  the  required  pressures. 

In  this  way,  with  signal  processing  incorporated  into 
the  drive  electronics,  the  array  can  be  made  to 
operate  entirely  free  of  cross-talk. 
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ABSTRACT  a  general  model  is  introduced  for 
the  transfer  of  ink  to  paper  from  solid 
letterpress  pl3tes.  By  isolating  the  non¬ 
linear  from  the  linear  region  m  the  graph  of 
ink  transferred  against  ink  available,  atten¬ 
tion  is  focussed  on  the  need  for  a  raodel 
giving  the  area  of  ink  m  contact  with  the 
paper  at  low  ink  film  thicknesses.  A  possi¬ 
ble  model  is  proposed  for  this  contact  area. 

INTRODUCTION 

The  experimental  technique  for  investigating 
ink  transfer  requires  the  amount  of  ink  <x) , 
applied  to  the  letterpress  plate,  to  be 
varied  and  the  resulting  ink  transferred  (y) 
to  be  measured.  This  can  be  done  by  weighing 
the  plate  before  and  after  the  application  of 
ink  and  following  the  transfer  of  the  ink  to 
the  paper.  From  the  resulting  set  of  <x,y) 
co-ordinates,  two  graphs  can  be  plotted. 
Figure  1  shows  the  first  graph:  ink  trans¬ 
ferred  against  ink  available  (y  against  x) . 
Usually  this  has  the  typical  "S"  shape  shown 
and  the  upper  region  which  corresponds  to 
high  ink  film  thicknesses  is  normally  linear. 
Figure  2  shows  the  second  graph:  proportion 
of  ink  transferred  against  ink  available  (y/x 
against  x) .  This  has  a  distinct  maximum.  It 
is  this  curve  which  has  received  most 
attention  in  modelling  the  phenomenon. 


Figure  1 


Figure  2 


In  their  classic  paper  of  1955,.  Walker  and 

Fetsko2  laid  down  the  foundations  of  the 
theory  of  ink  transfer.  Since  then  various 
criticisms  have  been  made  of  the  model  they 
proposed  but  invariably  other  workers  have 
established  it  as -a  basis  and Jhave  .attempted 
to  improve  upon  it  by  minor  modifications. 
Also  it  has  been  extended  to  the  processes  of 
gravure  and  lithography.  For  an  excellent 
review  and  comparison  of  the  various  models 
proposed  over  the  years  for  letterpress,  the 
reader’  is  referred  to  the  1982  paper  by 

Mangin  et  al2. 

THE  WALKER  AND  FSTSKO  MODEL 

At  very  low  ink  film  thicknesses,  the  contact 
between  the  paper  and  ink  is  incomplete  due 
to  the  microscopically  rough  nature  of  paper. 
Walker  and  Fetsko  suggested  that  the  area  in 
contact,  A,  be  given  by  the  exponential 
function 

A  ■  I  -  e 
where  k  is  a  constant. 

Focussing  next  on  the  paper  in  contact  with 
the  ink,  Walker  and  Fetsko  proposed  that 
paper  has  a  limiting  capacity  for 
"immobilising"  or  absorbing  ink  during  the 
impression  time.  Again  they  suggested  an 
exponential  function  to  give  the  quantity  of 
ink  immobilised  per  unit  area  y2 

y2  “  (l  -  e'x/b>b 

where  the  constant  b  represents  the  maximum 
quantity  of  ink  per-  unit  area  immobilised  by 
the  paper. 

Consider  next  the  ink  which  is  not  immobil¬ 
ised  by  the  paper,  i.e.  the  free  ink  film. 
Since  the  total  amount  of  ink  available  for 
transfer,  y^,  is  given  by  «  x,  it  follows 

that  the  quantity  of  free  ink  is  given  by 
y1  -  y2<  Walker  and  Fetsko  proposed  that 

the  free  ink  film  splits  by  a  fraction  f 
which  is  constant  independent  of  x,  so  that 
the  fraction  of  the  free  ink  film  transferred 
to  the  paper  will  be  f • (y^  -  y2) . 

Combining  these  relationships,  it  is  seen 
that  the  quantity  of  ink  transferred  y  is 
given  by 

y  -  A[y2  +  f(y2  -  y2>] 

y  •  (1  -  e"**)  J(1  -  e'x/b)  bU  -  £>  i  fxj 

At  high  ink  film  thicknesses,  i.e.  as  x  -»#, 
y  **  b(l  -  f)  -f  fx. 

a  straight  line  with  intercept  b(l  -  f)  and 
gradient  f. 

DEVELOPMENT  OF  GENERAL  MODEL 
The  first  assumption  to  be  made  here  is  that 
the  printing  plate  is  capable  of  immobilising 
a  small  quantity  of  the  ink  ,  say  x0<  Define 

a  new  variable,  z,  the  maximum  amount  of  ink 
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available  for  transfer  to  the  paper  such  that 
z  =  x  -  x0  and  yt  =  z. 


Suppose  the  maximum  amount  of  ink  immobilised 
by  the  paper  is  given  by  b' ,  as  for  the 
Walker  Fetsko  model,  but  that  the  quantity  of 
ink  immobilised  by'the  paper  Is  given  by 
y2  ■  A(x  -  xo) -b'  *  A(z) .b' x 
where  A(x  -  x0)  is  the  fractional  area  of 
paper  in  contact  with  the  ink  and  if,  some  as 
yet  unspecified  function  of  x  -  xQ. 

Now  A(z)  *•  A (x  -  x0>  must  have  the  following 
properties 

As  X  ■»  x4,_  A(z)  0  0 
As  x  -t  «,  A(z)  •>  1 

The  gradient  of  A(0)  must  not  exceed  unity 
and  A(z)  has  the  familiar  "S"  shape. 

Again  adopting  the  Walker  Fetsko  technique, 
it  is  seen  that  the  free  ink  film  is  given  by 
yj  -  y2  and  assuming  that  a  constant  fraction 

"f"  of  this  is  transferred  to  the  paper, 
independent  of  x  it  follows  that  the  quantity 
of  ink  transferred 

y  ■  y 2  +*•<*!-  y2> 

*  A(x  -  x0)*b'  +  f((x  -  x0i  -  A<X  -  x0)*b'l 
*•  A(x  -  x0)‘b'(l  -  t)  *  £(x  -  x0) 

or  v  "  A  (z)  b*  (1  -  f)  +  fz 
At  higher  ink  film  thicknesses  x  ->  »  and  A(z) 
is  defined  to  approach  1  giving 
pb'U  ■  f)  +  fz 


ISOLATION  OF  NON-LINEAR  REGION 

In  order  to  focus  attention  on  the  type  of 
function  required  for  the  contact  area  A(z), 
it  is  possible  to  extract  the  A(z)  trend  from 
practical  data  by  isolating  the  non-linear 
region  of  the  ink  transferred  against  ink 
available-  graph  (y  against  x) .  It  is  possi¬ 
ble  to  determine  for  the  linear  region  the 
gradient,  f,  and  the  intercept,  b' (1-f) ,  by 
a  technique  such  as  least  squares.  It  is 
seen  that  v 

A<*>  - 

If  such  calculations  are  made  for  all  non¬ 
linear  data  points  then  a  graph  of  A(z) 
against  z  can  be  constructed.  This  will  give 
the  S  shape  to  be  modelled  by  the  function 
A(z) .  This  curve  lends  itself  to  numerical 
techniques  to  determine  the  function. 


CONTACT^ AREA -MODEL 

Several  models  were  tried  which  have  the 
necessary  properties,  defined  for  A(z) .  That 
which  gave  the  best  fit  to  practical  data  is 

based  on  the  work  of  BayrSung  Hsu*,  who 
investigated  the  variation  of  the  area  of 
contact  of  paper  with  the  thickness  of  the 
ink  film  applied.  Hsu  proposed,  that  paper 
obeys  a  relationship  of  the  form 

fractional  area  in  contact  0  A  B  c,n 
fractional  "area  untouched  1-A 

A  *  fractional  area  of  paper  in  contact  with 
the  ink 

c  and  n  are  constants  of  paper 

t  «  original  thickness  of  the  ink  film 
measured  from  a  plane  through  the  tops  of  the 
maximum  peaks 

From  this  it  follows  that  the  fractional  area 
of  paper  in -contact  with -ink  will  be 
C£n 

A’  *  ■ 

i  +  c  r 

Hsu  found  this  function  to  give  a  very  good 
fit  to  experimental  data  measured  by  previous 
investigators,  the  value  of  n  varying  between 
1.7  and  2.7  depending  on  the  type  of  paper. 
The  problem  in  this  context  is  how  to  relate 
the  ink  film  thickness  below  the  maximum 
plane  to  the  quantity  of  ink  on  the  printing 
plate.  It  is  Assumed  that  the  top  peaks  on 
the  paper  cannot  penetrate  the  layer  of  ink 
cf  quantity  xQ,  immobilised  by  the  plate. 

Accordingly  the  original  ink  film  thickness 
below  the  maximum  plane  will  be  z  *  x  -x*0  and 
the  new  model  for  the  fractional  area  of 
contact  becomes 


A(Z) 


Substituting. into  the  general  model  gives 
— n 

b' (1-f)  +  f *z 


1  +  Cz 


and  the  proportion  of  ink  transferred  is 
given  by  . 

1  .  — _  .  b'  <1-0  +  £ 

2  1  +  Cz" 

It  follows  that  at  the  maximum,  the  z  value 
and  the  area  of  contact  are  given  by 
m  h  B  n-1  ^  >  P**l, 

2m  ana  \  ” 
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ELECTROTHERMAL  RIBBON 

RICHARD  J.CAMERON  AND 
Manchester  Polytechnic 
Chester  St 

Manchester  Ml  5GD  UK 
Abstract 

A  mathematical  model  of  the 
electro-thermal  ribbon  printing  process  , 
has  been  developed.  The  process  and  the 
model  are  described  and  some  results  of 
the  heat-flow  calculations  are  given. 


PRINTING  PROCESS 
DAVID  N.M.IBRAHIM 

Heat  Generation 

Heat  is  generated  in  the  ribbon  by 
three  distinct  mechanisms : 

i.  in  the  body  of  the  polycarbonate  by 
Joule  heating 


Introduction 

The  electro-thermal  ribbon  (ETR) 
printing  process  was  developed  bv  IBM 
under  the  trademark  QUIETWRlTER. 
The  process  uses  a  matrix  of  dots  and  is 
capable  of  printing  up  to  200  characters 
per  second  with  high  quality.  Ink  on  the 
ribbon  is  heated  by  heat  generated 
electrically  within  the  ribbon.  The  current 
enters  the  ribbon  through  a  set  of 
continuously  moving  electrodes.  The 
paper  and  nbbon  are  in  contact  only  for 
about  5  ms.  The  heat  generated  flows  into 
the  ink  by  conduction,  the  ink  becomes 
sticky  with  the  heat  and  adheres  to  the 
paper  forming  a  black  dot.  A  lot  of  early 
work  is  described  in  the  IBM  Journal  of 
Research  and  Development,  Volume  29 
1985. 

The  aims  of  the  modelling  are  to 
predict  print  quality  from  the  material 
parameters  and  to  improve  control  over 
the  electric  heating  of  the  ribbon. 

Structure .  oltho.  Ribbon 


The  ribbon  has  three  layers,  as  shown 
in  Figure  1  and  is  traversed  by  an  array  of 
tungsten  electrodes  each  of  which  is  about 
50  microns  square  with  a  spacing  between 
electrodes  of  50  microns.  The 
polycarbonate  gives  the  ribbon  structural 
strength  and  is  doped  with  carbon  black 
to  make  it  electrically  conductive.  The 
aluminium  is  sputtered  on  to  the 
polycarbonate  and  acts  as  a  return  path 
for  the  electric  current.  The  ink  is  applied 
either  directly  onto  the  to  the  aluminium 
or  on  to  a  very  thin  release  layer. 


Electrode 


Polycarbonate  15pm 


Aluminium  1000  A  -4 


Ink 


Figure  1.  Structure  of  the  ribbon 


ii.  at  the  contact  between  the 
polycarbonate  and  the  Tungsten  contact 

iii.  at  the  boundary  between  the 
polycarbonate  and  the  aluminium. 

The  last  is  due  to  water  in  the 
polycarbonate  combining  with  the 
aluminium,  during  the  sputtering,  to 
produce  a  very  thm,  possibly 
monomolecular,  layer  of  oxide.  A 
significant  proportion  of  total  heat 
generated  is  generated  in  the  oxide  layer. 
This  is  beneficial  to  the  process  because 
heat  is  generated  close  to  where  it  is 
needed,  in  the  ink. 

The  contact  and  oxide  resistances 
behave  like  Zenner  diodes.  Their 
break-down  voltages  introduce  a 
non-linearity  into  the  boundary  conditions 
for  current  uow. 

In  order  for  the  process  to  work,  the 
ink  must  be  subjected  to  very  rapid 
heating  which  in  turn  requires  that  the 
energy  densities  in  the  heated  region  of 
the  nbbon  must  be  very  high. 

During  normal  operation  the 
polycarbonate  undergoes  a  glass 
transition  at  about  1600  which  requires 
the  addition  of  energy.  Below  15 OC  it 
behaves  like  an  ordinary  resistance  which 
of  course  varies  with  temperature.  The 
current  voltage  charactenstic  of  the 
ribbon  is  shown  in  Figure  2. 
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The  glass  transition  stresses  the  film 
with  the  result  that  the  film  buckles  after 
printing  and  is  usable  only  once.  The 
changes  in  the  state  of  the  polycarbonate 
and  of  the  ink  make  it  necessary  for  the 
model  to  be  able  to  take  account  of 
variationof  material  parameters  with 
temperature. 

Calculations 

The  current  flow  in  the  ribbon  is 
calculated  at  the  same  time  as  the  heat 
transport  using  finite  differences. with  “a 
variable  grid.  The  current  within  the 
polycarbonate  obeys  the  equation 

V.oV?  =0 

where  9  is  the  electric  potential . 

and  o  is  the  electrical  conductivity. 

The  normal  heat  conduction  equations 
are  complicated  by  the  glass  transition  in 
the  polycarbonate  and  melting  of  the  ink. 
Iterative  techniques  are  used  to  establish 
i.  the  boundary  conditions  for  current  flow 
at  the  oxide  layer  and  ii.  the  regions  of  the 
ribbon  undergoing  the  glass  transition. 

Topically,  currents  of  25  mA  are  used 
in  the  printing  at  40  characters  per 
second,  giving  a  potential  difference 
across  the  ribbon  of  TV.  Figure  3  shows 
the  pattern  of  the  electric  potential  in  a 
section  across  the  ribbon  below  the 
electrode.  In  Figure  3  the  electrode  is  at 
TV  and  the  aluminium  is  at  ground. 

Figure  4  shows  the  temperature  pattern 
after  current  has  flowed  in  the  ribbon  for 
0.6ms.  The  ambient  temperature  is  20C. 

It  can  be  seen  that  the  heat  is  produced 
largely  under  the  ’footprint’  of  the 
electrode  and  that  there  is  little 
interference  betweeen  adjacent  electrodes. 
Figure  4  also  shows  clearly  the 
importance  of  the  aluminium  oxide  layer 
to  the  heating  process. 


Figure  4.  Heat  pattern  after  0.5ms 
Validation 

One  of  the  major  problems  with 
modelling  printing  processes  is  calibrating 
and  validating  the  model.  Here,  the  small 
size  of  the  components  is  compounded  by 
the  speed  of  the'processes  and  the  wide 
range  of  temperatures  that  are  involved. 
Observation  of  temperature  in  the  nbbon 
has  only  been  possible  by  using  a  special 
printing  rig  with  infra-red  microscope 
observing  the  surface  of  the  ink. 
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Abstract:  A  semantic  framework  for  discrete  event  languages  has 
been  defined  and  built  up.  Its  use  as  a  tool  for  abstract  description 
and  implementation  of  such  languages  is  sketched. 

Semantic  description  of  discrete.event  languages 

Discrete  event  languages  have  been  traditionally  presented  by 
means  of  a  series  of  examples.  When  needed,  a  summary  account  of 
the  inner  operation  of  the  simulation  mechanism  is  provided,  usually 
to  prevent  model  misbehavior  due  to  this  very  mechanism.  On  the 
other  hand,  people  planning  to  build  some  specific  purpose  (restricted 
domain)  simulator  usually  must  reason  in  very  primitive  terms,  for 
lack  of  a  proper  level  of  abstraction.  In  both  cases  what  is  missed  is  a 
semantic  context  to  describe  precisely  the  meaning  of  the  langage 
constructs.  Given  a  proper  semantic  framework,  the  rigorous  and 
complete  description  of  the  behavior  of  each  language  construct  could 
be  provided  or  designed  in  a  very  compact  form. 

The  category  of  system  specifications 

The  formal  setting  wc  are  thinking  of  is  based  upon  the  formal 
developments  of  Zeigler  [4}.  The  hierarchy  of  system  specifications 
originally  proposed  in  (41  can  be  assumed  as  the  domains  of  a 
semantic  category  which,  provides  the  meaning  of  discrete  event 
languages  The  structure  of  such  a  category  is  given  in  fig.  1.  The 
arrows  in  this  figure'represent  two  kinds  of  functions:  semantic  (sfj) 
and  behavior  (b,)  funtions.  Semantic  functions  provide  the 
interpretations  of  objects  in  a  domain -as  objects  in  a  lower  domain. 
Behavior  functions  map  any  other  domain  into  the  distinguished 
domain  DS,  which  specifies  the  collection  of  traces  and  statistics  that 
constitute  the  behavior  of  systems.  So,  behavior  functions  provide  the 
behavior  associated  with  systems  specifications.  The  hierarchy  of 
system  specifications  is  a  hierarchy  of  operational  abstractions  and  the 
category  of  system  specifications  is  the  formal  interpretation  of  those 
abstractions. 


figure  1  The  system  specification  category 


Objects  in  the  IOS  domain  correspond  to  Input/Output  systems 
whose  specification  includes  their  input,  state  and  output  spaces, 
along  with  the  oansition  and  output  functions.  The  behavior  function 
ho  of  this  domain  involves  the  iterative  application  of  those  functions. 

Objects  in  the  DEVS  domain  correspond  to  the  discrete  event 
specifications  originally  proposedby  Zeigler.  The  semantic  function 
sf|  involves  the  composition  of  the  internal  and  external  transition 
functions  with  the  time  advance  function  to  construct  the  transition 
function  of  the  equivalent  IOS  object  The  behavior  function  bjean  be 
computed  as  bQ°sfj. 

Objects  in  MDEVS  and  HDEVS  domains  correspond,  respectively, 
to  the  modular  and  hierchical  specifications.  Their  respective  semantic 
functions  sf2  and  sf3  provide  respectively  the  mechanisms  to  couple 


components  and  submode!s.  Again,  the  respective  behavior  functions 
can  oe  computed ’as  compositions  of  semantic  functions,  that  is. , 

>2  *bo0sfi°sf2 
b3  =bq0tfl0sfifsf3 

in  the  OB,  EF,  MEF  and  HEF  domains  correspond  to  the 
specification  of  experimental  conditions  at  the  various  levels  of  the 
hierarchy,  that  allowthe  selecuon of  subsets  of  the  complete  behavior 
of  the  system  (empty  experimental  frame).  The  semantic  functions 
apply  also  to  thou.  "  '  '  *'  *  ‘ 

Implementation  of, the  semantic  framework. 

A  concrete  specification  of  the  domains  and  functions  involved  int 
this  category  of  system, specifications  was  given  in  (1].  This 
specification  can  be  straightforwardly,  implemented  using  any  .object 
onented  programming  Ianguage.  In  fact,  it  has  been  implemented 
using' SIMULA, (21. .In  brief,  domains  are  implemented  as  classes 
whose  bodies  realize  the  semantic  functions  oyer  their  respective 
domains.  The  structure  of  arrows  in 'fig.  1  is  implemented,  then, 
through  the  class  inheritance  structure.  Finally,  the  mechanism  of 
virtual  function  definitions  are  used  to  allow  any  final  system 
transition  functions  to  be  implemented.  The  -  user  of  .  this 
implementation  of  the  semantic  framework  can  abstract  from  any 
operational  consideration  and  concentrate  on  the  specification  of  the 

{> articular  mechanics  of  his  language.Hedoes  so  specifying  his 
anguage  constructs  as  MDEVS  or  HDEVS  objects. 

Our  approach  diverges  from  the  implementation  of  the  DEVS 
formalism  reported  in  [5]  in  the  sense  that  the  semantic  framework  is 
not  intended  to  be  a  language  by  itself,  but  as  a  mean  to  conceptualize 
and  realize  more  specific  languages. 

Applications  of  the  semantic  framework 

The  framework  has  already  been  used  to  implement  a  subset  of 
RESQ  (3J,  including  us  hierarchical  features.  It  is  also  being  used  as 
teaching  support  for  a  simulation  subject.  The  students  use  it  to 
implement  a  simulator  for  generalized  stochatic  Petri  Nets,  Other 
languages,  dialects  of  GPSS  and  SLAM  (including  MHEX)  are 
under  analysis  and  semantic  specification  headed  towards  their 
implementation  upon  the  semantic  framework.  The  final  goal  is  to  get 
a  collection  of  languages  such' that  for  each  of  them  a  rigorous 
semantic  description  has  been  worked  out,  and  an  implemetation 
following  it  has  been  realized. 

Extensions  to  the  semantic  framework 

The  inclussion  whithin  the  formal  framework  of  concepts  relative 
to  the  graphical  presentation  of  the  model's  behavior  and  state  could 
be  of  great  interest  in  order  to  provide  the  semantics  for  languages  that 
include  animation  capabilities. 

Semantic  specifications  of  the  languages  must  be  independent  of 
the  concrete  computational  device  (sequential  or  distributed),  they  are 
operationally  independent.  Distributed  realization  of  the  semantics  of 
the  upper  semantic  domains  means  the  distributed  realization  of  the 
languages  mapped  into  them.  This  is  obviously  independent  of  the 
precise  distributed  strategy  (whether  conservative  or  optimistic).  We 
are  also  searching  in  this  direction. 
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Abstract:  A  sales  of  expairoects  have  been  performed  applying 
Bolmnann  Machines  (B)»pio  the  resofetioa  of  the  SarisSatiEty  (SAT) 
problem  in  the  propositional  calculus  setting,  which  shows  some 
interesting  features:  resolution  time  insensitivity  to  clause  type  and 
size,  and  linearity  against  the  number  of  proposmoos  involved. 

BM  and  the  SAT  problem 

BM's  area  class  of  recurrent  neural  compoting  mechanisms  that 
have  been  proposed  originally  in  [3J.  BM's  are  defined  by  giving  a 
set  of- logical  units,  concretions  between  them  and  the  strength 
associated  with  each  conixxmomComputmg  in  the  BM  is  performed 
through  search  of  the  max/mi n  consensus  configuration  by  the 
simulated  annealing  algorithm.  A  configuration  of  the  BM  is  a  map 
from  units  to  their  local  states.  The  consensus  function  gives  a 
consensus,  value  for  each  configuration  which  is  computed  as  the 
summation  of  the  strengths  associated  with  the  set  of  active 
connections  in  the  configuration.-A  connection  is  active  when  the 
states  of  all  the  units  connected  by  it  are  *ON".  A  special  kind  of  cnit. 
the  sigma-pi  unit  (introduced  in  [5])  serves  to  model  higher  order 
connections  between  more  than  two  units,  allowing  higher  order 
expressions  for  the  consensus  function.  A  broad  discussion  of  die 
application  of  BM's  to  optimization,  classification  and  learning  can  be 
found  in  flj. 

On  the  other  hand,  the  SAT  problem  is  a  well  known  problem  in 
computer  science  f2J.lt  can  be  stated  as  a  particularization  of  the 
more  general  MAX-SAT  problem  (finding  the  maximum  subset  of 
clauses  sarisfiable  simultaneously),  where  the  maximum  searched  is 
the  whole  set  of  clauses.  Under  this  interpretation  of  the  SAT  problem 
we  dared  to  apply  BM  to  its  resolution.  As  already  prosed  by  Pinkas 
14],  any  propositional  SAT  problem  can  be  mapped  into  a  recurrent 
neural  network.  He  also  showed '  hov,  sigma-pi  units  can  be 
introduced,  reducing  the  number  of  hidden  units  involved,  and 
increasing  the  order  of  the  energy  function  (the  consensus  function  in 
BM). 

Our  approach  has  been  a  straighforward  one.  The  clauses  have 
been  modelled  by  variable  order  (the  size  of  the  clause)  sigroa-pi 
units,  and  the  propositions  by  common  (order  two) units.  Wc  built  up 
a  BM  to  mode!  a  given  SAT  instance,  and  a  maximum  consensus 
configuration  is  searched.  The  SAT  instance  is  positively  answered 
when  a  configuration  in  which  all  the  clauses  are  satisfied  is  reached. 
As  a  side  product,  a  random  truth  assignment  that  satisfies  them  is 
obtained.  The  negative  answer  is  produced  when  a  number  of  trials 
(annealings)  have  been  performed  without  success. 


Experimental  results 

Through  the  experiments  reported  here,  a  quite  conservative 
annealing  schedule  has  been  used,  because  we  were  more  interested  in 
the  cualitative  features  of  the  approach,  and  it  was  our  intention  to 
factor  out  the  effects  of  poor  annealing  strategies.  This  accounts  for 
the  high  values  of  the  performance  metric  shown  in  the  figures.  The 
basic  performance  metnc  (time  in  the  figures)  is  the  number  of  trials 
in  the  annealing  that  reaches  a  satisfaction  configuration  for  a  given 
instance.  We  realized  the  experiments  over  sets  of  30  instances 
generated  randomly  along  some  parameters:  size  of  the  clause 
(Uniformly  distributed.2,S,6,7),  number  of  clauses  (NC)  and 
number  of  propositions  (NP). 

The  figures  include  95%  confidence  intervals  and  interpolation 
lines.  Figure  1  shows  that  for  clause  sizes  distributed  uniformely  up 
to  7  propositions  per  clause,  die  time  needed  to  reach  the  satisfaction 
configuration  grows  linearly  with  the  n,.mber  of  propositions 
involved,  figure  2  and  figure  3  show,  respectively,  that  the  time  of 
resolution  is  icsentitive  to  the  distribution  of  clause  sizes  and  the 
number  of  clauses. 


Figure  1  linear  behavior  of  time  versus  XP 
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figure  2  Independence  of  time  from  the  distribution  of  clause  sizes 


NC 

figure  3  Insensitivity  to  the  number  of  clauses 
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Abstract.  A  method  of  algorithm  development  for 
dynamical  systems  based  on  input  and  output  sig¬ 
nals  decomposition  with  respect  to  basis  in  the 
functional  Hilbert  space  is  proposed.'  Application  of 
the  projection  theorem  permits  the  advance  eva¬ 
luation  of  the  precision  error  and  computing  effi¬ 
ciency  of  the  algorithms.  A  library  of  basis  allows 
for  their  most  suitable  choice  for  a  particular 
system,  further- enhancing  the  algorithms  precision 
and  efficiency. 

I.  INTRODUCTION 

In  the  modelling  and- simulation  of  systems,  the 
first  arising  problem  is  a  good  comprehension  of 
their  qualitative  functionning  followed  by  the  quan¬ 
titative  description,  both  validated  with  respect  to 
some  criteria  in' the  reference  to  a  real  system.  As' 
a  result,  a  mathematical  model  is  obtained  which  is 
then  implemented  on  a  computer,  most  ofte  1  nowa¬ 
days,  a  microcomputer.  The  mathematical  model 
must  undergo  transformations  into  a  "best’  algo¬ 
rithm,  meeting  the  needs  and  expectations  of  a 
design  or  field  engineer. 

The  development  of  mathematical  models  still 
remains  a  scientific  art,  relying  on  accumulated 
knowledge  in  a  particular  domain.  The  development 
of  the  algorithms,  though,  may  be  put  into  a  more 
defined  and  generalized  framework.  The  purpose  of 
this  paper  is  to  propose  an  approach  which  appears 
promissing  in  both  the  efficiency  and  precision  of 
the  resulting  algorithms. 

The  mathematical  model,  in  order  to  yield  a  com¬ 
puter  algorithm,  must  undergo  some  mathematical 
processing  such  as  time  discretization,  approxi¬ 
mation  of  noniinearities,  etc.  Currently,  such  pro¬ 
cessing  relies  mostly  on  the  use  of  polynomials  and 
is  based  on  the  Weierstrass  approximation  theorem. 

It  hardly  allows  for  an  advance  precision 
evaluation,  neither  doos  it  assure  the  algorithms' 
convergence.  This  paper  introduces  higher  order 
approximations  by  functions  in  the  Hilbert  space,  in 
conjunction  with  the  projection  theorem  for 
evaluation  of  the  approximation  precision, 

II.  OPTIMAL  MEAN  SQUARE 

APPROXIMATION  USING  FUNCTION  BASIS 

The  projection  theorem,  (1,  2],  needs 
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reformulation  for  osir  purposes  into  a  more  compact 
matrix  form.  Real  systems  and  their  models  when 
used  in  design  or  simulation  are  subjected  to 
different  kind  of  input  signals.  They  are 
deterministic  for  testing  -  and  stochastic  -in  control 
applications:  the  implementation  of  models  on  ! 
computer -requires  . a  representation  of  these  signals.  ■ 
in  numerical  form,  meaning  their  approximation.  We 
shall  consider  approximations  by  a  set  of  linearly 
independent  functions  {9,(0}  in  the  n-dimensional 
Hilbert  space  L2(t,  ,t2)  with  the  inner  product 
defined  over  it  by 

?  ! 

(9,,9j)  -  J"  9,(0  9,(0  dt.  (1) 

’>  j 

The  L^  space  has  the.  induced  norm 
t2  L. 

111(011  -  (J  |f(0|2dt)2.  (2) 

1. 

Any  signal  x(t)  may  -then  be  approximated  by  a 
linear  combination  of  basis  {q>j} 

*(t)  -  Xa  o  (t)  .  aT9(t),  (3) 

1-1  ’  '  t 

where 

9T(I)  -  91  -  [9,  92  -  9n)  (4>  ; 

is  the  basis  vector,  and 

aT  - 1  at  a2  ...  an )  (5) 

is  the  vector  of  approximation  coefficients, 
suitably  chosen. 

The  approximation  (3)  describes  the  initial  signal 
x(t)  with  some  error 

e(t)  -  x(t)  -  x(t),  (6) 

the  value  of  which  may  be  estimated  by  the  norm 
(2)  applied  to  (6)  The  value  of  the  mean  square 
approximation  error  is 

l|e(t)|f  -  ||x(t)  -  aT9(0ll2  P) 

or 


when  properties  of  the  inner  product  are  taken  into 
account.  In  short  hand  matrix  notation,  (8)  is  given 
by 

lle||2  -  IM|2  +  (aT9,aT9)  -  (x,aT9),  (9) 

or  by 

l|e||2  -  llxll2  +  9TaaT9  -  x9Ta.  (10) 

In  the  approximation  formula  (3),  the  coefficients  a 


1 


may  bs  chosen  to  provide  the  best 

approximation  possible  in  the  basis  '9-  They  must 
minimize  the  relation  (9),  which  requires  that  the 
first  variation 

5(||e||2)  -  2(99r)a  -  2(x,9)  (11) 

vanishes,  it  occurs  when 

<J>a-B,  (12) 

where  the  square. nxn  matrix  <J>  has  the  entries  given 
by 

9,-j  -  (<P1.«P|)  03) 

and  the  column  nxl  vector  B,  by 

bj  -  (f,9l).  (14) 

The  second  variation  -of  '(9)  indicates  That  the 

solution  (12)  indeed  minimizes  the  approximation 
error  (6)  with  respecFio  its  mean  square  value. 

Under-  condition  (12)  the  minimal  mean  square 
error  is  then  found  to  be 

llblL-IMf-aW  (IS) 

It  steadily,  decreases  with  the  increase  of  the 
approximation  order,  i.  e.  with  the  increase  of  the 
dimension  of  the  coefficient  vector. 

Equation  (15)  may  also  be  developed  from  geome¬ 
tric  relations  between  the  approximation  error  and 
basis  in  the  Hilbert  space.  In-order  to  assure  the 
minimization  of  the  error,  these  vectors  should  be 
orthogonal 

(e.9)-0.  (16) 

This  is  called  the  projection  theorem,  [1],  and  it 
also  yields  the  results  given  by  (12)  and  (15). 

The  effects  of  the  projection  theorem  are  shown 
in  Fig.  below,  where  the  approximation  of  the  ramp 
function  in  term  of  exponential  basis  is  illustrated 


Fig.  Mean  squaro  error  for  a  unitary  ramp  signal 
approximation  by  the  exponential  basis  9Me'irand 
two-dimensional  coefficient  vector  with  the  optimal 
value,  (12),  a*[1. 19232,  -0.99576). 


Iff.  MODELLING  AND  SIMULATION  OF  DYNAMIC 
SYSTEMS  VIA  FUNCTION  APPROXIMATION 

For.  dynamic  systems,  input  and  output  signals  are 
decomposed  with  respect  to  the  same  set  of  basis. 
Both  signals  and  their  approximations  are  related 
one  to  another  through  the  mathematical  model  of  a 
given  system.  Consequently,  the’  chosen-basis  should 
-represent  a'complete  and  closed  set-with  respect  to 
mathematical  operators  of  the .  model.  For  linear 
systems,  these  operators  are  integrators  and 
multipliers  by  a  constant.-  Mathematical  models  of 
systems-will  be  reflected  by  relationships  between 
the  input  and  output  approximation  coefficient 
vectors  and  given  by  integration  matrices  and  the 
systems'  coefficients  only.  Such  important  features 
as  precision,  convergence  and  efficiency  of  the 
algorithms  could  then  be  evaluated  in  advance' from 
an  imposed  desired  value  of  the  approximation 
errors. 

A  library  of  different  types  of  basis  may  be  const¬ 
ructed  allowing  for  the  most  suitable  choice  for  a 
particular  problem.  This  further  and  significantly 
enhances  the  efficiency  of  the  algorithms.  Indeed, 
it  has  been  noted  in  litterature,  [3),  the -many  fold 
reduction  of  computing  time  in  certain  situations. 
Simultaneously,  the  algorithms'  precision  may  also 
be  increased  since  the  approximation  error  (7) 
depends  upon  the  choice  of  basis. 

IV.CONCLUSIONS 

The  proposed  approach  of  the  algorithms'  const¬ 
ruction  using  function  approximations  offers  good 
potentials  for  the  increase  of  their  precision  and 
efficiency.  Function  basis,  especially  those  with 
orthogonal  or,  better  still,  orthonormal  properties, 
provide  proniissing  new  mathematical  tools  for  the 
computer  modelling  and  simulation  of  systems. 
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ON  THE  OPTIMAL  ALGORITHM  OF  SIMULATION  MODEL  OF  DEDS 
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Abstract-*?'!  his  paper"  tries  to -explore  the  approach  to 
solve  optimization  of  DED Stimulation  model  and  proposes  a 
heuristic  algorithm  that  is  specifically  designed  lothcDEDS 
simulation.  Its  bask  idea  is  to  approximate  the  optimal  point > 
gradually  in  the  processes  of  Iteration  using  a  set  consisted  of 
several  points  instead  6  f  using  -  one  point  as  in  conventional 
optimal  techniques.  Obviously,  such  set  'can  filter  more 
stochastic  errors  caused  by. the  simulation  itself  than  one  point 
can.  Finally  the -.paper  - testifies  the  algorithm  on  a  typical 
'  DEDS  exdnipis  and  the  results  show  very  satisfactory. 

Keywords::  Discrete  Eycnt.Systcm  Simulation, 
Optimization  .of  ‘System, Set.  Approximating 

Algorithm(SAA) 

(.Introduction 

Discrete’ E\enf  Dynamic  Systems  (DEDS)is  a  kind  of 
complicated  systems.  Such  system's  a  $  computer  networks, 
transported  manufactory  systems  are. vivid  DEDS  exam¬ 
ples  In  general  Analytical  method  and  Simulation  method 
are  two  main  methods  to  study  DEDS(Referencc  1 )  Here 
only  Simulation  method  is  discussed. 

"  At  first  the  optimal  problem  of  DEDS  simulation  dis¬ 
cussed  in  this  paper  is  described.  For  simplicity,  the  optimal 
proble  m  is  deft  n'ed  as: 

min  f(0)  (*  ) 

Here  f(0)is  the  object  function  of  the  simulation  model 
of.m-dimension  decision  variable  0=»( 0,,  •••  ,0„  )  It  is  on¬ 
ly,  after  running  the  simulation  model  in  computer  that  the 
value  of  f(0)can  be  obtained.  The  object  of  problem  (•?  )i$ 
to  select  suitable  parameters  0'and  reach  the  minimum  of 
fffiT)  Problem  (*  )i$  the  simpIcstTorm  of  all  optimal  prob¬ 
lems.  As  for  other  forms  of  optimal  problems  of  DEDS  such 
as  the  problems  with  constrained  conditions,  further  re¬ 
search  is  needed  although  the  principle  of  the  algorithm  dis¬ 
cussed  here  will  be  useful. 

Up  to  now,  there  arc  no  any  mature  methods  which  can 
manipulate  optimal  problem  ( *  )  easily.  In  general, several 
reasons  listed  below  hinder  the  development  of  optimal 
techniques  about  DEDS  simulation. 

(1) .  The  obfict  fQ)  usually  cannot  be  manifested 
analytically  and  even  no  conslce  and  beautlfiil  models  to  des¬ 
cribe  DEDS  now.  So  it  is  difficult  to  study  DEDS  theoretically 
and  there  are  almost  no  any  general  conclusions  on  DEDS 
simulation. 

(2) .  Because  the  valves  of  fid)  obtained  are  simulation 
results  of  some  random  variable  Junctions  and  which  errors 
cannot  be  neglected.  It  Is  not  probable  to  use  conventional 
non-linear  programming  techniques  on  DEDS 
directly  (Reference  4J). 

(3) .  It  is  rather  difficult  to  compute  the  gradient  values  of 
fiO)  in  DEDS  simulation.  In  one  hand  a  large  number  of 
Sfnulatlon  time  Is  needed  (especially  for  complicated  multh  va¬ 
riable  systems)  and  on  the  other,  hand  the  values  of  gradient 
got  in  simulation  usually  have  big  errors.  So  the  information 
about  gradients  which  plays  an  important  role  in  conventional 
optimal  process  cannot  be  simply  used  in  DEDS  simulation 
areas. 

In  spite  of  the  obstacles  mentioned  above,  because  of 
the  importance  of  the  optimal  techniques  of  DEDS 
simulation,  many  researchers  have  already  done  a  lot  ol 
work  and  made  much  progress  although  it  still  has  very  long 
way  to  go  to  be  able  to  call  them  practical! Reference 
2,3, 6 ,7  \ 


2.  Set  Approximating .  Aigorllhm(SA  A) 

This  section  describe  a  new  heunsuc  optimal  algorithm 
about  DEDS  simulation. 

At  first  a  new  concept  is  put  forward  which  is  in  fact  a 
set  denoted  by  .  Here  f  stands  for  object  function  f(0) 
and  1  for  the  number  of  elements  in  the  set  The  set  is 
consisted  of  I  values  >f(0,V**,f(£D  of  object  function  on  1 
points  0, 0{ , 

Intuitively  a  big  and  heavy  /novmg  object  cannot  be  in¬ 
fluenced  easily  by  random'  factors  in  environment  and  in  the 
same  reason  that  a  set  consisted  of-several  values  can  filter 
more  stochastic  errors  caused  by, simulation  itselfthan  single 
value. can  and  thus  corresponding  algorithm  will’ to  the 
optimal  point  more  accufatly  and  smoothly  than  others,  do. 
This  is  just  the  heuristic  principle  of  the  algorithm  proposed 
below  and  so  this  algorithm  is  called  Set 
Approximatting(SA)  algorithm. 

Until  now  there  is  no  any  general  conclusion  about  how 
to  select  1.  In  general,  the  bigger  I  is,  the  more  accurate  the 
optimal  solution  is  and  correspondingly  the  more  simulation 
lime  will  cost  Here  exists  a  trade-off  between  the  accurate 
and  the  time.  This  problem  is  certainly  important  to ’the 
optimal  theory  of  DEDS  simulation  and  worthy  careful  ex¬ 
ploring.  Here  temporiary  let  l«m+l,  arid  m  is  the  dimension 
of  decision  variable. 

Below  SA  algorithm  is  described  in  detail  and  at  first 
some  denotations  must  be  defined: 

(2.1) 

max^>»“maxff<01y,*>’f<$  (2.2  )• 

d$tj>i*max$<f)~mm<|tf)  (2.3) 

And  then  defining  6  rclatived  to  minfuj)  as  and 
max  fip  as  0^  0«**here  is  also  called  the  'worst'  point 
in  ,  It  will  be  replaced  by  better  one  according  to  the 
algorithm  The  improvement  of  the  relation  between  the  set 
and  the  optimal  point  will  guarantee  the  algorithm  to  reach 
the  optimal  point- gradually.  The  value  of4^(f)  isas  the 
stopping  criterion  of  the  algorithm.  When  its  value  is  very 
small  that  also  means  that  the  value  of  max  nears 
closely,  at  this  time  it  can  be  said  that  the 
algorithm  has  already  gotten  satisfactory  results. 

SA  algorithm  can  be  described  as  below: 

1) .  Initialize  the  set  <Jm4>  and  Idle*!; 

2) .  Let  >J ; 

3) .  If  d  <£‘*W  )<€ ,  then  stop  else  goto  2). 

(end  of  algorithm) 

In  the  algorithm,  e  is  a  positive  constant,  tf  is  the  map 
from  R‘  to  Rv .  This  map  should  guarantee  the  behavior  of 
S’Tpis  better  than  that  of  $<f> ,  That  is  just  the  condition  be¬ 
low;  1  1 

min^Vj^min^ty  (2.4) 

or  (2.5) 

In  general  the  key  to  construct  one  succcsful  algorithm 
is  to  construct  suitable  map  that  will  be  discussed  below. 

The  most  important  element  in  map  of  SA  algorithm  is 
how  to  get  a  new  point  from  previous  sets.  Recently  a  new 
approach  proposed  by  Y.C.Ho  et  al.  put  forward  a  conve¬ 
nient  and  face-to-simulation  way  that  is  called  Perturbation 
Analysis  (PA)method.  Its  main  advantage  is  to  be  able  to  get 
the  gradient  of  Performance  Measure  of  the  system  with  re¬ 
spect  to  parameter  using  only  one  Monte-Carlo  experi¬ 
ment,  Although  PA  method  is  not  well-developed  now  and 
still  needs  a  lot  of  work  to  do,  the  SA  algorithm  proposed 
here  will  use  PA  method  to  look  for  the  search  direction  to 
the  new  point  (Reference  8,9,10) 
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Map  5  is  constructed  as  be!ov/:_  # 

At  first  obtaining  a'  new  point  ©using  PA  .method,  then 
getting  f(0'  )by  simu'attori.  This  time  there  will  occur  one  of 
three  cases  listed  below: 

*T)  f(©')<minf(f)  (2,6) 

2)mihfrfjl)<f(e')<max  (2.7) 

3  )f(  6*  )>  max  4* <j>  (2.8 ) 

SA  algorithm  will  replace  6U,(wor$t-  point)  with  ©'• 
(better  one)if  case  1  )or  2) occurs' and  other  components' r£ 
main,  constant,  is  constructed  and  then  obviously ‘if 
will  satisfy  condition  (2.4  >r(2.5)  ‘ 

But  as, the  value  of  ft©')" got  ih  simulation  unavoidable 
has  error,  it  maybe 'cause  the  failure  of  the  algorithm.  For 
example,  the  simulation  value, of  f(8')i$  probably  much  less 
than  real  counterpart  and  maybe,  even -less -than  accurate 
optimal  solution  of  f(0‘)  If  the  occurence  of  such  cases  in 
Emulation  is  more  than  1,  it  is  probably  that  all  . elements  m 
setfty  ’are  less  than  f{0*)  after  Nthlterattori.  'A?  this  time" 
onlyvcase(2  8 ) is  occuredjand  the  algorithm  cannot  proceed¬ 
in' Tact  to  the  optimal.- point., So  the, real  optimal  solution 
cannot  be  obtained;  In  the  operation  of  SA  algorithm,  more 
simulation  length  and  time  are  used  to  decrease  the 
occurence  of  such  poor  cases  and  thus  the  probabhty  of  the 
failure  of  the  algorithm  is  much  less.  _ 

The  stopping  criterion, cHosed  in  SA'  algorithm  is  also 
suitable  to  the  simulation,"  which  can  decrease  the  effect  of 
random  errors  inherently  with, the  simulation, 

3.Ca<e  study 


Trie  effect  of,' S A  algorithm  is  testified,  on  a  typical 
DEDS(i.c  an  assembly  system  )  . 

The  assembly  system  ^expressed  in  figure  be  m: 


Assumming  the  number  of  workstations  in  system  is 
two,  the.  service  times  of  workstations  arc  subject  to 
exponential  distribution  and  the  mean  service  rime  arc 
respevrivelyftand©,,  the  number  of  pallets  in  the  system  is 
eight.  And  there  is  a  buffer  behind  the  workstation  !  which 
item  is  two.  In  any  time,  the  number  of  parts  processed  in 
system  is  constant- 8) 

The  objict  function  of  the,  system  is  defined  as: 
r'«c,(T/'N)^(c,/9i ) 

Here  T»T(S,&)i$  the  lenghth  of  the  simulation  time  at 
each  iteration,  N  is  the  number  of  parts  processed  In  the  pe¬ 
riod  of  time  T,  c.,  c„  c^rc  constant  numbers  relative  to  the 
cost  of  the  system,  hem  let  c,»  l  and  c. »  c*«  c, 

T/N  in  the  object  function  is  the  average  processing 
time  of  each  part  through  the.assembly  system.  It  is  required 
that  the  less  the  better.  And  the  costs  of  the  system 

itselffFor  simplify  only  the  costs  of  workstations  are  consid¬ 
ered* -also  the  less  the  better  obviously.  So  there  exists  a 
trade-off  between  the  effiency  and  cost.  For  example,  when 
0,  (i  •*  1,2  )is  smaller  then  T/  N  is  also  smaller  but  this  rime 
more  cost  is  required  and  vice-versa.  The  aim  of  system 
optimization  is  then  to  chose  suitable  parameter 
6f  and  minimize  the  object  function  f. 

Because  the  system  above  has  no  analytical  optimal  so¬ 
lution,  here  one  method  which  has  alredy  experimented 
successfully  by  Professor  R.Sury  et  al  (Sec  Reference  13,14) 
is  applied  to  compare  with  the -results  obtained  ' by  SA 
algorithm,  this  method  is  called  method  3  here. 

By  using- different  sample  of  random  series- the  SA 
algorithm  can  be  divided  into  twosuber-methods.  Method  1 
uses  a  variance  reduction  tecfiniqye(CRN)  So  the  same 


pseudo-random  number  generators  are  called  and  the  same 
pseudo-random  number  series  are  used.  So  there  are  enough 
reasons  to  believe  that  the  difference  of  any  obscvable  data  is 
caused  by;the,difference  of  system  design  instead  of  expert- 
merital'Condition.;Methqd*2,try  to  dynamically  update  the 
-system  parameters  in  the  process  or  simulation.  So  the  ran¬ 
dom,  series  used  in  each  iteration  is  one  section  of  a  long 
stochastic  .series.  This  method  is  especially  suitable  for 
on-line  simulation. 

In  'experiment,  the-  simulation  model  is  written  in 
GPSS-F  simulation  language  and  run  in 
IBM-PC  /  AT(Refcrence  11,12) 

After  analyzing -the'  results  of  the  experiments,  it  is 
showed  that  the  results  obtained  by  SA  algorithm  are  nearer 
the  optimal  point  than  those  by  method  3;  Moreover  from 
the  experimentes,  it  is  showed  that  in  the  neighbor  areas  of 
the  optima!  points,  the  optimal  processes  using  method  3  are 
badly  vibrate  but  those  using  SAA  can  approximate  optiaml 
points  smoothly. 

-4. Conclusion  and  Future  Developments 

This  paper  proposed  a  new  heuristic  algorithm  which  is 
specifically  designed'to  DEDS  simulation.  The  algorithm  is 
called  Set  Approximatting  (SA)algorithm.Tt  is  revealed  that 
this  method  can  get- better  optimal  results  than  some  other 
successful  algori  thms.  “  ‘ 

In  the  future,  some  further, research  work  *$  .  needed 
such -as  to  theorize  the  S A  algorithm  and  improve  the 
algorithm  itself' and  apply  the  SA  algorithm  to  more  real 
complicated  DEDS  and  so  on. 
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Abstract 

This  paper  presents  the  conceptual  design  of  an  intelligent  interface  to 
numerical  simulation,  which  deals  with  the  modelling  of  problems 
described  bys partial  differential  equations, (PDEs),  The  complete 
modelling  process  from  a  given  real  world  problem;  to  a  simulation 
code  is  analysed  and  three  intermediate-models  are  extracted.  These 
models, correspond' to  engineering,"  mathematical  and  numerical 
representations  of  the  problem.  The  system'  structure  incorporates  an 
entry  level  for  each  model  and  provides  transformations  between 
them  This  allows  users  to  specify  the  problem  in  the  terminology  of 
Mhe.model  most  suitable  to  their  own  expertise.  A  frame  based 
approach  to  knowledge  representation  reflects  the  hierarchical  nature 
of  domain  knowledge.  Concepts  which  form  the  building  blocks  of 
each  model  are  stored  as  frames  m  three  knowledge  bases.  Model 
transformations  arc  achieved  by  accessing  localised  rule  bases  bound 
to  each  frame. 


I.- Introduction 


Numerical  simulation  has  become  a  very  important  and  powerful 
technique  for  scientific  experimentation  and  design.  As  well  as 
numerical  analysts,  many  different  kinds  of  people,  including  design 
engineers,  experimental  physicists  and  applied  mathematicians  can 
find  use  for  it  in  their  everyday  work.  With  the  increasing  availability 
of  powerful  computer  resources,  there  is  a  genuine  need  to  provide 
numerical  simulation' software  toots  which  can  facilitate  these 
potential  users. 

To  design  Such  a  tool,  the  issue  of  model  comprehension  must  be 
addressed.  It  is  essential  to  cater  for  how  users  think  about  and 
express  their  problems.  Existing  numerical  simulation  packages  such 
as  DEQSOL  JUmetani  eial .  1985,  Korfno  etal  1986),  ELLPACK 
(Rice  1985J  and  FIDISOL  [Schonauer.and  Schncpf,  1987),  have 
greatly  reduced  the  effort  demanded  by  numerical  simulation, 
compared  with  direct  coding  in  FORTRAN  or  C,  by  providing  high* 
level  programming  languages  or  .drivers  of  suoroutine  libraries. 
However  ’^without  the  prerequisite  knowledge  from  the  mathematical 
and  numerical  analysis  domains,  the  use  of  these  software  tools  can 
be  formidably  difficult  In  other  words,  there  is  still  a  considerable 
conceptual  gap  between  the  input  level  of  these  packages  and  the 
understanding  of  a  large  proportion  of  potential  users.  Also,  while 
offenng  quite  a  lot  of  flexibility  to  the  knowledgeable  usei,  they  fail 
to  provide  sufficient  guidance  to  the  non-expert  towards  finding  the 
best  model,  from  a  potentially  large  space  of  possibilities. 

Several  research  projects  are  investigating  the  application  of  AI 
techniques  to  the  area  of  numerical  simulation.  The  Numerical 
Algorithms  Group  (NAG)  [Chelsom  ei  al ,  1990]  is  developing 
knowledge-assisted  numerical  routine  selection  tools  fur  the  diverse 
NAG  FORTRAN  library.  The  EVE  system  is  aimed  at 
mathematicians  (Barras  ei  al.  1990]  and  enables  users  to  create  PDEs 
from  pre-defined  primitive  mathematical  components.  Another  system 
which  makes  use  of  PDEs  as  the  interface  level  was  reported  by 
(Russo  et  al  1987],  in  which  some  of  the  numerical  stability  and 
efficiency  constraints  are  taken  into  account.  These  projects  mainly 
aim  at  users  who  are  knowledgeable  enough  to  make  the  right  choices 


during  the  decision  process  or  who  are  to  a  certain  extent  familiar 
with  mathematical  expression.  However,  little  work  has  been  done  to 
establish' a  complete  modelling  framework,  which' incorporates  the 
comprehension  levels  of  all  potential  users,  from  engineers  with  no 
numerical  or  mathematical  knowledge,  through  to  numerical  analysis 
specialists/ 

This  paper  addresses  the  conceptual  design  of  a  sophisticated  tool  for 
numerical  modelling  Specifically,  it  is  concerned  with'  the  modelling 
of  problems  which  can  be'  described  mathematically  by  a  set  of 
PDEs.  These  include  heat  transfer,  fluid  dynamics,  structural 
analysis  and  electric  field  analysis  problem's.  The  complete  modelling 
process,  from  a  given  real  world  problem  to  the  final  simulation  code, 
is  examined,  and  three  intermediate  conceptual  models  are  isolated' 
Each  model  can  be  considered  as  a  valid  description  of  the  problem 
from  the  perspective  of  the  engineer/ the  mathematician  or  the 
numerical  analyst.  A  complete  modelling  tool  should  incorporate 
each  model  as  a  possible  input  level.  The  paper  discusses  how  each 
model  can  be  represented  in  a  simulation  modelling  system,  and  how 
transformations  between  models  can  provide  expert  guidance  to  the 
user  through  the  intermediate  models  to  the  final  code.  Thus,  a 
modelling  tool  which  reduces  conceptual  distance  and  guides  the 
search  for  the  best  model  is  proposed. 

The  remainder  of  this  paper  is  outlined  as  follows. 

Section  2  discusses  the  conceptual  models  between  a  real-world 
problem  and  simulation  code,  and  based  on  this,  an  overall  system 
architecture  is  outlined  A  knowledge  ontology  by  which  the 
conceptual  models  can  be  realised  ^described  in  Section  3.  Section 
4  discusses  inference  and  model  transformation  and  finally,  the 
current  status  of  a  prototype  system  which  implements  the  above 
proposals  is  reported. 


2.  Solution  .Perspective -for.,  Numerical. Simulation 

2.1  User  Group  Classification 

Potentially,  numerical  simulation  may  be  of  intei  est  to  three  different 
user  groups,  numerical  analysts,  mathematicians  and  engineers.  Each 
of  these  groups  has  different  motivations  for  using  simulation,  has 
different  expertise  levels  and  has  different  requirements  of  the 
simulation  results.  Design  ol  any  potential  modelling  environment  for 
numerical  simulation  therefore,  should  place  emphases  on  integrating 
these  various  perspectives  and  trying  to  develop  a  system  that  allows 
exploitation  by  all  user  groups. 

2,2-Us.ci.CiQAtp.RgquiKmcms, 

The  primary  concern  of  the  numerical  analyst  is  the  development  of 
strong  numerical  algorithms  to  deal  with  difficult  numerical  propemes 
of  the  system  of  equations  being  analysed,  A  modelling  tool  for  this 
type  of  user  should  remove  the  arduous  task  of  coding,  while  still 
offering  the  flexibility  to  choose  or  create  algorithms  and  modity 
important  parameters.  Mathematicians  are  interested  mainly  in  the 
properties  of  the  equations  themselves,  and  may  not  be  familial  wuh 
the  techniques  used  to  simulate  the  solution.  These  users  will  expect 
flexibility  in  specifying  mathematical  problems  Engmeeis  and 
scientists  are  concerned  with  the  modelling  of  physical  problems 
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Figure  1  Modelling  Perspectives  of  Real  World  Problems 


bui  may  not  be  aware  of  the  underlying  mathematical  formulation. 
Thus  there  are  three  conceptual  levels  of  understanding, 
corresponding  to  the  levels  of  expertise  of  the  different  user  groups, 
A  real  world  problem  may  be  modelled  from  an  engineering, 
mathematical  or  numencal  perspective  (see  Figure  1)  In  fact,  these 
perspectives  represent  the  stages  through  which  that  problem  must  be 
transformed,  before  it  can  be  simulated  on  computer, 
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Figure  2  System  Architecture 


may  be  defined  as  slot  variables  within  each  frame  or  more  generally 
as  functions  of  slot  variables.  Moreover,  structural  information  is 
stored  through  hierarchical  links  between  these  frames. 


Three  types  of  links  arc  found  to  be  of  use  in  the  knowledge  bases, 
namely,  taxonomic  links,  component  links  and  possible  component 
links.  The  latter  two  arc  different  types  of  substructure  links. 


2.3  A  modelling  environment  for  numerical  simulation 

Figure  2  illustrates  a  practical  proposal  of  a  system  architecture  which 
provides  a  modelling  environment  suitable  for  numencal  analysts, 
mathematicians  and  engineer;.  The  system  structure  is  based  on  the 
conceptual  levels  outlined  above.  Three  representations  are  allowed, 
referred  to  as  a  physical  model,  a  mathematical  model ,  and  a 
numerical  model.  If  the  user  is  a  numerical  analyst,  the  input  is  a 
numencal  model  in  the  form  of  a  set  of  numerical  algorithms  to  be 
solved  by  a  numerical  simulation  engine,  If  thevu$cr  is  a 
mathemauciari;  the  input  consists  of  a  mathematical  representation  of  a 
PDE  based  problem.  The  system  has  an  incorporated  knowledge 
base  (KB)  of  numerical  expertise  which  automatically  transforms  this 
mathematical  model  representation  to  a  suitable  numerical  model 
capable  of  being  solved  by  the  numerical  simulation  engine.  If  the 
user  is  an  engineer,  the  input  consists  of  a  physical  model 
representation  which  must  oe  brought  through  a  two  stage 
transformation  using  the  expertise  of  both  mathematicians  and 
numerical  analysts  in  two  knowledge  bases, 

3.  Knowledge  Representation 

This  section  presents  a  knowledge  ontology  by  which  the  conceptual 
models  of  the  previous  section  arc  described.  The  issue  is  to  find  an 
appropriate  internal  representation  of  domain  knowledge,  so  that 
models  entered  by  the  user  can  be  checked  for  consistency  and 
transformations  to  lower  level  models  can  be  obtained.  A  frame* 
based  approach  to  knowledge  representation  provides  the  most 
natural  solution  |Tello  1990].  Three  knowledge  bases,  the  Physical 
KB,  Mathematical  KB  and  Numencal  KB,  hold  a  representation  of 
the  problem  domain  from  the  perspective  of  the  corresponding  model. 
Conceptual  entities  which  form  the  basic  building  blocks  of  any 
particular  model  arc  stored  as  frames  in  each  knowledge  base  Each 
enuty  has  a  number  of  attnbutes  associated  with  iu  These 


0)  Taxonomic.  links 

Domain  concepts  fall  naturally  into  categones  or  classes.  For 
example,  while  there  arc  many  types  of  PDEs,  all  PDEs  can  be 
expected  to  share  some  common  features.  It  makes  sense  to  define  a 
superclass  of  PDEs  and  allow  all  PDEs  to  inherit  common 
information  from  this  superclass.  Taxonomic  links,  therefore,  allow, 
for  an  efficient  storing  of  information. 

(»1  Component  links 

Component  links  express  the  usual  component  relationships,  for 
example,  a  region  must  always  consist  of  a  number  of  boundaries 

(iii)  Possible  Component  links 

Possible  component  links  express  that  it  makes  sense,  m  a  particular 
model,  for  a  certain  entity  to  be  a  component  of  another  entity  For 
example,  a  source  term,  may  be  a  component  of  a  heat  equation,  but 
this  is  not  always  the  case,  and  depends  on  the  particular  problem 
being  modelled.  These  links  play  an  essential  role  in  inference,  by 
restricting  the  search  space  dunng  model  transformation. 

12 The  Knowledge  Bases 

Each  knowledge  base  is  now  described  in  more  detail 

(»)  Physical  KB 

Figure  3  shows  pan  ol  the  hierarchy  of  the  Physical  KB  Frames 
represent  engmeei  ing  concepts  and  terminology  There  are  five  main 
classes,  namely,  physical  legion,  physical  boundary ,  phy^t^al 
boundary  fiondawn,  physical  phenomenon  and  geometry  The 
boundary  condition  fiamc*  correspond  to  engineering  descriptions  of 
boundary  conditions,  for  example,  inflow  or  outflow  in  fluid  flow 
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Figured  .PhysicallKB* 


problems,  or  fixed  temperature  in  heat  transfer  problems.  The 
physical  region  frame  contains  a  material  slot  corresponding  to  thc 
material  of  which  the  region  is  composed.  Th  t  physical  phenomenon 
frame  contains  a  slot  for  initial  conditions 


All  information  corresponding  to  the  representation  and  structure  of 
PDE  problems  is  contained  in  this  KB.  Classes  include  PDE  group, 
PDE,  Term  and  Variable .  Many  physical  phenomena,  such  as  fluid 
flow  or  structural  analysis,  can  be  described  by  a  particular  set  of 
PDEs.  These  sets  are  stored  as  subclasses  of  PDE  group. 
Substructure  links,  to  PDEs,  terms  and  variables,  follow  in  the 
natural  way.  It  should  be  noted  that  the  substructure  links  are 
possible  component  links.  Depending  on  the  particular  problem, 
some,  but  not  necessarily  all  of  these  links,  will  be  instantiated  in  the 
actual  mathematical  model,  e  g.  time' terms  are  included  only  if  the 
problem  is  transient.  Figure  4  shows  part  of  the  KB  corresponding 
to  the  Navier  Stokes  equation  group. 


Figure  4  Mathematical  KB 


f»»)  Numerical, KB 

Frames  in  the  Numerical  KB  represent  the  basic  numerical  algorithms 
from  which  a  simulation  code  can  be  generated.  Note  that  these  arc 
not  the  algorithms  themselves,  but  rather  representations  which  hold 
essential  mfotmation  about  their  selection  and  use.  The  discretisation 
technique  may  be  fixed  as.  say,  the  Finite  Element  Method,  although, 
if  the  simulation  engine  can  cope  with  different  techniques,  (hen  the 
knowledge  base  should  be  expanded  to  include  these.  Algorithms  are 
organised  into  classes  according  10  iheir  purpose  in  the  simuiauon 
code.  Thus,  there  is  a  class  of  algorithms  to  deal  with  time* 


Figure  5  Numerical  KB 


dependence,  a  class  to  deal  with  non-lmeanty  and  a  class  of  inversion 
algonthms  which  solve  the  decreased  equations.  A  numerical 
scheme,  that  is,  a  complete  numerical  method  to  solve  a  particular 
PDE  problem,  may  possibly  consist  of  algonthms  from  each  of  these 
classes  (see  Figure  5). 

To  generate  a  model  corresponding  to  a  particular  problem,  the 
appropnate  frames  arc  chosen  from  the  coriesponding  KB,  arid 
instantiated  as  components  of  the  model.  Slots  in  the  frame  must  be 
filled  according  to  problem  data.  In  the  next  section,  the  process  of 
transformation  from  a  given  model  input  to  other  lower  level  models, 
is  discussed; 


4,I.LpgaUscd.Rulc.Bascs 

In  order  to  obtain  a  lower  level  model  from  a  high  level  model,  tt  is 
necessary  to  provide  bridges  between  concepts  in  different  knowledge 
bases.  These  bndges  arc  realised  through  localised  rule  bases  bound 
to  each  frame  definition  in  each  knowledge  base.  The  rule  base 
contains  the  conditions  under  which  the  frame  should  be  instantiated 
as  part  of  the  model.  Antecedents  of  each  rule  relate  to  characteristics 
of  the  previous  model.  For  example,  the  rule  base  bound  to  the 
advecnon  term  in  the  Mathematical  KB  may  contain  the  rule 
IF  peclet  number  is  not  negligible  THEN  select  advection  term. 
The  peclet  number  is  a  function  of  attributes  of  the  physical  region 
such- as  material  and  dimensions.  Functions  to  calculate  such 
characteristic  values  are  bound  to  the  frame  definitions  in  the 
knowledge  bases, 


The  primary  goal  during  model  transformation  is  to  find  a  lower  level 
representation  of  the  given  high  level  model.  Substructure  links  in  the 
knowledge  base  corresponding  to  the  lower  level  model,  provide 
subgoals  to  the  primary  goal.  For  example,  one  subgoal  in  the 
physical  to  mathematical  model  transformation  is  to  generate  a  PDE. 
To  achieve  this,  »he  further  subgoal  of  generating  the  PDE's 
component  terms  anscs.  The  possible  component  links  restrict  the 
space  of  possibilities  for  this  subgoal.  By  evaluating  the  rules  in  the 
rule  base  bound  to  each  possible  term,  the  selected  terms  are 
instantiated  in  the  mathematical  model  and  component  links  are 
established  between  these  and  the  parent  PDE. 


In  genera*  there  is  a  one  to:one  correspondence  between  physical 
model  concepts  and  mathematical  model  concepts  e  g  one  PDE  group 
describes  one  particular  physical  phenomenon  Generating  the 
mathematical  model  then  follows  the  inference  method  outlined 
above.  It  is  also  necessary  to  link  material  names  from  the  physical 
model  with  numerical  values  of  constants  m  the  mathematical  model 


A, material  properties .  database  provides  the  primary  relationship 
between  these  descriptive  engineering  material  terms  (e.g .  copper, 
air)  and  the  precise  numerical  property;  values  (e.g.,  copper 
conductivity  400 ,W/mK).  . 

4.4  Mathematical  to  NumericalModel  transformation 

In  the  mathematical  to  numerical  model  transformation;  dedsionsare 
closely  inter-related  and  cannot  be  made  in  a  simple  serial  fashion. 
For  example,  if  the  inversion  algorithm  is  always  chosen  before  the 
rime. algorithm,  and  the  time  algorithm's  always  choren  bcforc’the 
non-linear  algorithm,  it  may  beTmpossible  to  decide' the' best' 
combination  of  algorithms.  To  cope.wjth  this.difficulty,  the  inference 
mechanism  is  allowed  to  leave  a  number  of  possibilities  at  each 
decision  stage  Thus  a  solution  space  of  all  reasonable  algorithm 
combinations  is  generated.  In  this  context,  the  local  rules  serve. to 
eliminate  bad  selections,  rather  than  select  the  best  candidate.  Each; 
branch  of  the  solution  space  represents  one.  possible  numerical 
scheme.'  v 

An  appropriate  spatial  mesh  and  possibly.a  time  step  must  also  be 
derived.  ,Onc  approach  is. tq. select  the  spatial  mesh  from  a  fixed, 
family,  parameterized  by  a  variable  representing  the  mesh  size.  The 
issue  of  selecting  mesh  and  time  step  then  becomes'  one  of 
optimization,  choosing  the  best  mesh  size  and  time  step  according  to 
accuracy,  stability  and  efficiency, entena.  Each  solution  branch  is. 
rated  according  to  these  criteria,  but  the  final  choice  is  left  to  the  user. 

4.5  Creation  of  Numerical  Simulation  Code 

A  numerical  model  contains  sufficient  information  to  generate  the 
simulation  code.  Local  code  generation  procedures  arc  bound  to  each 
algorithm  frame  m  the  Numerical  KB,  and  a  global  procedure  controls 
how  each  part  is  entered  into  a  code  template.  These  procedures  can 
be  tailored  depending  on  the  particular  simulauon  code  required.  -  In  a 
prototype  system,  the  DEQSOL  simulation  code  is  generated.  This 
onlyrequires  manipulation  of  the  mathematical  representation  of  the 
problem,' for  example;  the  EXPLICIT  algorithm  frame  contains  a 
procedure  which  replaces  the  object  variable  in  each  differential 
operator  with  the  variable  which  holds  the  old  value  (e.g.  replace 
div[k  grad  U)  with  div(k  grad  Uq]  ). 


^.Implementation  and.Currfiat.Stalus 

A  first  prototype  of  the  proposed  system  has  been  implemented  on  a 
Macintosh  using  Object  Lisp,  an  object-oriented  extension  to 
Common  Lisp.  -The  final  output  of  this  system  is  a  Finite  Element 
simulation  code  in  the  syntax  of  DEQSOL.  Currently  only  a  physical 
model  enuy  level  has  been  implemented  and  the  usei  enters  the  model 
through  a  graphical  interface. 

The  knowledge  bases  are '-limited:  to  steady  state  and  transient 
problems  in  the  heat  transfer  and  fluid  domains,  with  several  typical 
algorithms  provided  Only  two-dimensional  geometries,  with  straight 
line  boundaries  are  allowed. 


6.  Conclusions 

The  complete  modelling  process  involved  in  transforming  a  real  world 
problem  into  a  form  suitable" for  numerical  simulation,  has  been 
analysed  and  on  this  basts), three  conceptual  models  (physical 
mathematical  and  numerical  models),  corresponding  to  the  different 
levels  of  expertise  expected  of  different  types  of  users,  have  been 
identified.  A  system,  based  on  these  conceptual  models,  in  which 
each  type  of  use*. can  express  the^problem  using  familiar 
teiminology,  has  been  proposed.  To  realise  this  system,  a  frame- 
based  knowledge  representation  approach  has  been  adopted.  Domain 
concepts  which  form  the  building  blocks  of  each  model  have  been 
isolated  and  represented  as  frames.  For  a  particular  problem,  each 
model  is  instantiated  from  a  knowledge  base,  consisting  of  these 
frame  concepts,  stored  in  a  hierarchy.  Hierarchical  links  express  the 
intrinsic  structures  which  any  valid  model  must  obey. 
Transformations  between  motjels  are  achieved  through  localised  rule 
bases,  which  form  bridges  between  concepts  in  different  knowledge 


bases.  Transformations  may  generate  more  than  one  possible  model, 
in  which  case  the  user  makes  the  final  decision.  Guidance  is  offered 
by  rating  each  possibility  according  to  predicted  accuracy  as  well  as 
-  efficiency  oitcria. 

The  authors  believe  that  the  proposed  system  forms  the  basis  of  a 
totally  integrated  problem  solving  environment.  However,  in  order  to 
realise  a  truly  powerful  system,  the  next  stage  must  include  intensive 
knowledge  acquisition,  in  order  to  develop  rules  that  can  deal  with  the 
complexities  of  real  world  problems. 


Acknowledgements- 

The  authors  would  like  to  express  their  appreciation  to  Dr.  J.B 
Grimson  and  Professor  J.G.  Byrne  of  Trinity  College,  Dublin  for 
their  advice  and  encouragement  We  would  also  like  to  thank  N 
Hataoka,  Hitachi  Dublin  Laboratory,  for  his  helpful  comments. 

References 

(B  arras  et  al.  1990]  Barras,  P.,  Bluni,  J ,  Paumier,  J.C ,  Witomski, 
P  and  Rechenmann,  F  "EVE.  An  Object-Centred  Knowledge-Based 
PDE  Solver".  In  Proceeding  of  the  Second  International  Conference 
on  Expert  Systems  for  Numerical  Computing,  Purdue  University, 
USA.  1990. 

(Chelsom  et  al  1990]  Chelsom,  J.,  Cornali,  D  and  Reid,  I 
"Numerical  and  Statistical  Knowledge-Based  Front-ends,  Research 
and  Development  at  NAG".  In  Proceeding  of  the  Second  International 
Conference  on  Expert  Systems  for  Numerical  Computing,  Purdue 
University,  USA,  1990, 

[Kon’no- et  al  1986)  Kon’no.C  ,  Saji,  M  ,  Sagawa  ,  N  and 
Umetam,  Y.  ''Advanced  Implicit  Solution  Function  of  DEQSOL  and 
its  Evaluation''.  In- Proceedings  of  IEEE  Fall  Joint-Computer 
Conference,  pp.  1026- 1033,  Dallas,  USA,  1986 
[Rice  1985]  Rice,  J  R.  "ELLPACK  An  Evolving  Problem  Solving 
Environment  for  Software  Computing’’  In  Proc  of  IF1PTC2i  IVG 
2J  Working  Conference  on  Problem  Solving  Environments  for 
Scientific  Computing,  pp.  233-245, 1985. 

(Russo  et  al  1987]  Russo, M.,  Peskin,  R  ,  Kowalski,  A.  "A  prolog- 
based  expert  system  for  modeling  with  partial  differential  equations" 
In  Simulation,  Vol  49,  No  4;  pp.150-158,  1987. 

(Schonaucr  and  Schncpf.  1987]  Schonauer,  W,  and  Schnepf,  E. 
"Software  Considerations  for  ’Black  Box'  Solver  FIDISOL  for 
Partial  Differentia)  Equations".  In  ACM  Trans,  on  Maths .  Soft ,  Vol 
13,  No.  4,  pp.  233-245,  1987. 

[Tcllo  1990]  Tello;  E  R  '-’Object  Oncnted  Programming  m  Artificial 
Intelligence".  Addison-Weslcy,  New  York,  1990. 

(Umetam  et  al  1985]  Umetam,  Y  el  al  "DEQSOL  A  numerical 
Simulation  for  Vector/Parallel  Processors"  In  Proc  of  IFIPTC2> 
WG  2  J  Working  Conference  on  Problem  Solving  Environments  for 
Scientific  Computing,  pp.  147-164,  1985. 


1861 


An  evaluation  tool, 
for 

CCND  language 
applications 

Rajiv  Trehan 

Information  Systems  Laboratory 
Toshiba  Research  and  Development  Center 
Toshiba  Corporation- 
1  Komukai  Toshiba- cho,  Saiwai-ku 
Kawasaki-shi,  Kanagawa  210,  Japan 

Abstract 

Currently  the  language  and  execution  models  being 
adopted  for  the  .Committed  Choice  Non-deterministic 
(CCND)  languages  are  settling  down  to  subsets  of  the 
possible  models.  In  general  the  subsets. being  adopted  are 
governed  by  implementation  issues  rather  than  applica¬ 
tion  requirements.  To  give, an  applications  perspective  on 
alternative  execution  and  language  models  we  develop  a 
new  evaluation  system.  The  system  allows  us  to  consider 
how  an  application  behaves  on  alternative  language  and 
execution  models.  Hence  providing  some  applications  ra¬ 
tionale  for  the  design  and  implementation  alternatives  for 
this  class  of  languages. 

1  Introduction 

Currently  the  language  and  execution  models  being 
explored  for  the  Committed  Choice  Non-Deterministic 
(CCND)  logic  languages  (10)  [17]  (5)  are  settling  down  to  a 
subset  of  the  possible  models.  For  example,  the  language 
executions  are  being  restricted  to  only  investigate  clauses 
sequentially.  In  general  the  subsets  being  considered  are 
governed  by  implementation  issues  rather  than  applica¬ 
tion  requirements.  To  give  an  applications  perspective  on 
alternative  execution  and  language  models  we  develop  a 
new  evaluation  system.  The  system  allows  us  to  consider 
how  an  application  behaves  on  alternative  language  and 
execution  models.  Hence  providing  some  applications  ra¬ 
tionale  for  the  design  and  implementation  alternatives  for 
this  class  of  languages. 

The  execution  alternatives  considered  are  for  the  entire 
class  of  CCND  languages,  rather  than  any  given  subset, 
and  represent  extremes  in  the  implementation  options,  for 
instance  the  alternatives  for  scheduling  are  busy  and  non- 
busy  waiting.  Of  courr.s  this  approach  allows  us  to  also 
compare  given  subsets  of  the  CCND  languages,  like  flat 
guards  (7),  (3).  To  compare  alternative  language  and  ex¬ 
ecution  models  we  propose  an  extended  set  of  evaluation 
parameters,. which  rationalise  currently  accepted  parame¬ 
ters  for  evaluating  various  sub-classes  of  the  CCND  lan¬ 
guages. 

As  with  current  evaluation  s> stems  we  attempt  to  mea¬ 
sure  the  inherent  parallelism  available  in  the  evaluation 
of  programs.  A  measure  of  the  inherent  parallelism  has 


several  uses:  it  gives,  a  theoretical  measure  of  parallelism 
against  which  particular  implementations  can  be  gauged; 
and  it  provides  information  for  programmers  on  the  rela¬ 
tive  merits  of  various  programming  techniques. 

For  typical  Computer  Science  application  areas,  such  as 
matrix  multiplication,  it  is  often  possible  to  obtain  theo¬ 
retical  measures  for  the  inherent  parallelism.  However,  for 
AI  type  problems  the  parallelism  depends  on  several  fac¬ 
tors,  such  as  data  structures  (knowledge  representation), 
inference  mechanisms  and  irregular  search- spaces.  The- 
irregular  nature  of  AI' problems  makes  a  theoretical  mea¬ 
sure  of,  parallelism  difficult.  Another  approach  to  obtain¬ 
ing  measures  of  inherent  parallelism  for  both  regular  and- 
irregular  problems  is  to  simulate  the  computation  on  an 
infinite  processor  model.  The  simulated  processor  utilisa¬ 
tion  gives  a  measure  of  the  inherent  parallelism.  It  is  this 
second  approach  we  adopt  in  this  work.  To  obtain  a  mea¬ 
sure  of  the  inherent  parallelism  available  in  program  exe¬ 
cution  we  adopt  a  breadth-first  execution  model  assuming 
an  unlimited  number  of  processors. 

We  first  consider  current  models  of  execution  and  the- 
parameters  collected  during  program  execution  and  dis¬ 
cuss  their  limitations  in  measuring  inherent  parallelism. 
The  limitations  highlighted  are  then  addressed  in  two  re¬ 
spects.  Firstly,  we  develop  a  new  system  which  allows 
us  to  more  accurately  measure  the  inherent  parallelism  in 
the  execution  of  a  program.  Secondly,  we  consider  possible 
alternatives  open  to  language  implementors,  for.  instance 
different  scheduling  policies.  This  provides  the  basis  for  a 
set  of  parameters  which  can  be  used  to  indicate  the  relative 
merits  of  the  alternatives.  The  measurement  of  the  pro¬ 
posed  parameters  is  carried  out  for  one  CCND  language, 
Parlog  (5).  We  then  present  a  profiling  tool,  developed 
to  graphically  display  profiles  of  the  various  proposed  pa¬ 
rameters  over  time  (cycles).  Evaluations  using  this  tool 
are  given  for  a  small  set  of  simple  example  programs  and 
the  results  analysed.  Finally  we  consider  limitations  of  the 
system  and  areas  of  future  work. 

2  Current  measurements  and 
their  limitations 

The  evaluation/comparison  of  application  level  programs 
tends  to  be  based  on  course  grained  metrics/parameters, 
like  logical  inferences  for  Prolog  [20).  For  the  CCND  lan¬ 
guages  three  parameters  are  usually  quoted  when  com¬ 
paring  applications,  namely  cycles,  suspensions  and  re¬ 
ductions  [10]  [11]  [8]  In  general  these  parameters  appear 
to  be  collected  using  interpreters  on  top  of  Prolog  [10]  [5] 
[12],  although  they  could  be  collected  using  compilers  [5] 
[18]  [9]  or  abstract  machine  emulators  [1]  [2]  [6]  [19], 

The  evaluation/comparison  also  tends  to  be  based  on 
the  inherent  parallelism  available  at  the  applications  level 
[10]  [11]  [8],  This  is  obtained  by  using  a  breadth-first  eval¬ 
uation  model  assuming  an  infinite  number  of  processes. 
There  are  two  main  limitations  with  the  current  parame¬ 
ters.  Firstly,  the  actual  evaluation  models  employed  make 
several. approximations  to  a  full  breadth-first  evaluation 
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•  'the  AND-parallel  goals  are  represented  as  a  list  of 
goals  to  be  evaluated; 

•  as  each  goal  is  reduced,  the  resulting  body  goals  are 
added  to  the  goal  list  and  any  appropriate  bindings 
are  made; 

•  variable  bindings  are  produced  in  the  order  that  the 
» goals  are  evaluated; 

•  'guarded  goals  are  evaluated'  as  a  single  reduction 
which  incur  no  cycle  overheads; 

•  the  interpreters  make  no  distinction  between  suspen¬ 
sion  and  failure  of  guarded  goals; 

•  the  interpreters  model  OR-paralielism  by  backtrack¬ 
ing  through  alternative  clauses;  and 

•  the  first  textual  clause  in  a  predicate  whose  guarded 
goals  succeed  is  committed  to. 

The  second  is  that  the  inherent  parallelism  depends  on 
choices  made  about  scheduling,  guard  termination  and 
suspension  mechanism,  however,  the  effect  of  these  al¬ 
ternatives  are  not  considered  or  reflected  in  the  current 
evaluation  parameters1. 

In  the  following  sub-section  we  consider  how  the  approx¬ 
imations  to  a  breadth-first  model  may  introduce  erroneous 
data  into  our  evaluation  parameters.  We  then  develop  an 
improved  system  which  addresses  these  execution  limita¬ 
tion.  Finally,  we  propose  and  collect  an  extended  set  of 
evaluation  parameters  which  reflect  execution  alternatives. 


2.1  Cycles 

The.. cycles  parameter  attempts  to  measure  the  depth  of 
the  breadth-first  execution  tree.  A  cycle  corresponds  to 
reducing  all  the  goals  in  the  system  once  in  parallel,  that 
is  assuming  an  infinite  number  of  processors. 

The  evaluations  of  guarded  goals  are  carried  out  by  a 
call  to  the  top-level  of  the  interpreter  and  for  simplicity  is 
assumed  to  take  zero  cycles  to  evaluate  (the  guard  evalu¬ 
ation  is  assumed  to  be  part  of  the  commitment).  Hence, 
the  cycle  count  can  only  claim  to  measure  the  depth  of  the 
evaluation  tree  when  evaluating  flat  code  (7),  (3).  More¬ 
over,  Jn  the  case  of  deep  guards,  any  goals  suspended 
awaiting  the  evaluation  of  a  guard  will  only  suspend  for 
one  cycle  and  not  the  number  of  cycles  it  takes  for  the 
deep  guard  to  be  evaluated.  This  distorts  the  breadth- 
first  evaluation  tree,  reducing  the  cycle  count. 

Another  limitation  is  that  the  goals  are  maintained  as  a 
list  which  is  processed  in  a  left-to-right  order,  any  bindings 
made  in  the  evaluation  of  goals  taking  place  immediately. 
So,  these  bindings  will  be  available  to  any  remaining  goals 
in  the  goal  list.  This  will  allow  goals  that  require  these 
bindings  to  reduce  in  the  current  cycle.  Hence  the  evalu¬ 
ation  is  dependent  on  goal  order. 

*Of  count  there  are  also  several  other  limitations,  like  the  use 
of  an  infinite  processor  model.  However,  this  model  provides  a 
yard  stick  against  which  particular  processor  implementations  can 
be  judged. 


2.2  Reductions 

The  reductions  parameter  attempts  to  measure  the  com¬ 
mitments  performed  by  the  system  in  solving  a  query, 
which  indicates  the  number  of  parallel  goal  evaluations 
that  can  take  place. 

The  first  limitation  is  that  current  interpreters  try  eval¬ 
uating  the  alternatives,  for  a.  given  predicate,  top-down, 
committing  to  the.  first, clause  whose  guarded  goals  suc¬ 
ceed.  Therefore  reductions  can 'only  be  counted  for  the 
clauses  that  have  been, attempted.  Hence  the  reduction 
count  depends  on  the  clause  order. 

Another  problem  occurs  if  a  goal  evaluation  fails,  it  is  re¬ 
scheduled  and  may  be  re-attempted  (if.  the  computation 
goes  on  for  further  cycles  before  deadlocking).  The  re- 
evaluation  of  failed  goals  may  introduce  erroneous  statis¬ 
tics  into  the  reduction  count. 

Finally,  the  evaluation  of  system  calls,  supported  by 
calls  to  the  underlying  Prolog,  are  not  accounted  for. 
These  do  contribute  to  the  overall  work  done  in  evaluat¬ 
ing  programs.  By  ignoring  their  contribution  the  compar¬ 
ison  of  programming  techniques  which  make  use  of  system 
primitives  can  be  misleading. 

2.3  Suspensions 

The  suspensions  parameter  attempts  to  count  the  num¬ 
ber  of  suspended  evaluations  in  the  evaluation  of  a  query. 

The  number  of  suspensions  depends  on  when  sus¬ 
pended  evaluations  are  re-scheduled.  When  evaluations 
suspend  in  the  existing  interpreters  they  are  immediately 
re-scheduled  for  evaluation;  this  is  known  as  busy  wait¬ 
ing.  However,  a  non-busy  waiting  strategy  could  have 
been  used7.  Using  an  ideal  non-busy  waiting  strategy 
each  evaluation  will  only  suspended  once. 

Another  point  to  note  is  current  interpreters  process  the 
goals  from  left-to-right,  generating  bindings  as  the  goals 
are  processed.  These  bindings  may  allow  goals  to  reduce 
in  the  current  cycle  which  would  suspend  if  the  goals  were 
evaluated  in  a  different  order. 

Finally,  as  no  distinction  is  made  between  failed  and 
suspended  goals,  erroneous  statistics  may  be  introduced' 
into  the  suspension  count. 

3  An  improved  execution  system 

3.1  Requirements 

Many  of  the  limitations  and  inaccuracies  of  the  statistics 
obtained  using  current  implementations  are  due  to  fea¬ 
tures  of  the  execution  model  employed.  The  collection  of 
more  meaningful  statistics  requires  the  development  of  an 
improved  implementation.  Such  an  implementation  would 
have  to  exhibit  the  following  features: 

.  it  must  distinguish  between  suspension  and  failure  of 
a  goal  evaluation. 

3Tlie  suspended  evaluations  are  hooked  (or  tagged)  to  the  vari- 
ables  that  were  required  when  they  suspended,  when  sufficient  vari¬ 
able.  become  instantiated  they  are  re-scheduled. 


•  It  must  mote  accurately  measure  ..tee  depth’ of  the 
evaluation  tree:  the  depth  of  the  evaluation  tree 
should  not  be  dependent  on  goal  or  clause  order;  and 
the  depth  of  the  evaluation  tree  must  account  for  the 
use  of  deep  guards. 

•  It- should  realise  AND-parallelism"  each  of  the  goals 
in  the  conjunction  should  appear  to  be  evaluated  in' 
parallel;  ■  each  .-AN  D-parallel  goal:  should  be  reduced' 
once  in  each  cycle.  The  reduction  may  take  place  in 
the  guarded  evaluation  in*  the  case  of  deep  guards; 
and  the  simulated  reduction  of  each  goal  in  a  given 
cycle  should  be  indeperident  of  the  actual  order  in 
which  the  goals  are  processed. 

•  -It, should  realise  ORrparallelism:  each  of  the. clauses 
that  a  goal  could  use  to  reduce, should  appearto  be 
explored  in,  parallel;  in  a.parallel  evaluation  a  goal 
should  commit  to  the  firsts  clause  whose  guard  suc¬ 
cessfully  terminates;  and^the  evaluation  of  a  goal  sus¬ 
pends  if  no  committable  clause  exists  and  at  least  one 
clause  evaluation  suspends. 

3.2  Idealisations 

We  now  consider  idealisations  made  in  the  design  of  our 
improved  execution  model. 

3:2.1  Guard  evaluation  idealisations 

We  considered  two  alternative  models  for  incorporating 
the  effect  of  the  guard  evaluation  into  the  overall  cycle 
based  evaluation3.  The  first  assumes  that  in  a  cycle,  a 
goal  can  be  head  unified  with  a  clause  and  the  guarded  goal 
evaluation  instigated.  This  model  assumes  that  guarded 
system  goals  (flat  guards^  will  evaluate  in  1  cycle.  The 
second  assumes  that  in  a  cycle,  a  goal  can  be  head  unified 
with  a  clause  and  either  the  guarded  evaluation  instigated 
or  a  system  guard  evaluated.  This  model  assumes  that  sys¬ 
tem  goals  incur  no  cycle  costs.  We  adopt  the  second  model 
as  this  model  has  a  similar  notion  of  depth  to  the  previous 
implementations  when  executing  flat  guarded  programs. 

3.2.2  AND-parnllcl  idealisations 

A  fully  accurate  model  would  be  able  to  determine  exactly 
when  a  goal  makes  a  binding,  how  long  it  would  take  for 
this  binding  to  reach  another  goal  and  whether  this  would 
be  in  time  for  the  goal  to  use  it.  Clearly  the  inherent 
parallelism  should  not  be  dependent  on  goal  order.  We 
make  the  assumption  that,  in  a  cycle,  a  goal  can  only  use 
bindings  available  to  it  at  the  start  of  the  cycle.  Such  a 
model  may  not  display  all  the  parallelism  that  could  be 

3It  »hould  be  noted  that  mo*t  cycle  based  models  for  obtaining  a 
measure  of  the  depth  of  the  evaluation  tree  will  be  prone  to  giving 
distorted  results,  in  that  they  will  tend  to  associate  fixed  costs  with 
the  various  components  of  the  evaluation,  like  head  unification,  sys¬ 
tem  call  evaluation  and  commitment.  Although  an  elaborate  model 
of  cost  could  be  developed,  these  costs  would  tend  to  be  implementa¬ 
tion  dependent.  Moreover,  such  a  model  is  unlikely  to  give  a  radically 
different  view  of  the  general  feature*  of  the  evaluation  compared  with 
a  fixed  cost  model. 


achieved  in’ a  given  implementation,  but.at  least  it  gives 
a  measure  which  is  not  dependent  on-how, the  goals  are 
ordered. 

3.2.3  OR-parallel  idealisations 

The  modelling  of  inherent  OR-paraUelism. requires  the 
evaluation  to  commit  to  the  clause  whose  first  guard  suc¬ 
cessfully  terminates.  In  our  system  the  duration  of  a  guard 
evaluation  is  approximated  by, the  depth, of  its  evaluation 
tree.  So  the  evaluation  should  commit  to  the  guard  with 
the  shallowest  evaluation  tree. 

3.2.4  System  call  idealisations 

System  calls  contribute  to  the  overall  work  done  in  the 
evaluation  of  a  goal.  However,  the  complexity  of  each  sys¬ 
tem  call  may  vary.  This  makes  the  matter  of  system  calls 
difficult  to  address;  as  it  will  be  implementation  dependent 
and  there  is  no  agreement  on  the  nature  of  such  calls.  As 
a  general,  principle  we  count  system  calls  as  reductions,, 
as  they  contribute  to  the  overall  work  done  (although  our 
evaluation  system  is  easily  adapted  to  ignore  all  or  some 
of  the  system  calls,  see  section  5).  In  suspending  the  eval¬ 
uation  of  a  system  call  we  assume  it  behaves  like  &  goal 
with  one  clause. 

3.3  Development/Realisation. 

We  could  implement  the  required  improved  execution 
model  by  an  abstract  machine  emulated  in  ’C\  The  ab¬ 
stract  machine  could  then  be  instrumented  to  collect, 
dump,  dynamic  information  like  the  cost  of  various  op¬ 
erations  and  the  reference  characteristics  of  our  abstract 
machine  and  implementation,  in  the  same  vein  as  [13]. 
However,  this  approach  was  not  feasible  for  this  work  be¬ 
cause  there  was,  and  still  is,  no  representative  abstract 
machine  for  the  entire  class  of  CCND  languages  available 
for  instrumentation. 

Alternatively,  we  could  implement  an  improved  inter¬ 
preter  which  could  either  dynamically  collect  statistics  like 
the  previous  evaluation  interpreters,  or  dump  information 
about  the  program  evaluation  as  in  a  'O’  emulator.  We 
have  chosen  to  implement  an  interpreter  which  will  dump 
data  about  the  program  execution.  The  dump  data  is 
rich  enough  to  allow  flexible  analysis  assuming  alternative 
models* of  execution.  Hence  we  are  able  to  support  an 
extended  set  of  evaluation  parameters. 

Using  an  interpreter  allows  us  to  collect  coarse  grained 
information  like  number  of  commitments  or  size  of  sus¬ 
pension  queues.  These  coarse  parameters  are  similar  to 
the  coarse  grained  parameters,  like  logical  inferences  [20], 
collected  for  sequential  logic  programming  languages  (Pro¬ 
log)  and  to  the  currently  accepted  evaluation  parameters 
used  for  the  CCND  languages  (10)  [11]  [8].  However,  an 
interpreter  does  not  easily  allow  us  to  measure  fine  detail, 
like  (13). 

The  incremental  design  and  development  of  the  im¬ 
proved  interpreter  can  be  found  in  [16].  The  features  of 
this  interpreter  are  as  follows: 
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*  both 'AND  and;ORrparallelism  are  modelled; 

.♦  each  of  the  guarded  goals  for  a  given  predicate  is  tried, 
and  relevant  statistics  collected; 

•  the  statistics'from  the  evaluation  of  the  guarded 'goal- 
are  used  to  pick  the  solution  path  (currently  this  is  the- 
shallowest  successful 'guard,  i.e.Uhe  first  guard  that, 
would  have  succeeded  in  a  breadth-first  execution); 

♦  the  goals  that  form  the  goal-list  undergo  one  reduc¬ 
tion  in  a  cycle  (any  bindings  made  as  the ’goal  list 
is  processed  occur  only.- when  all  the  goals  have  been 
attempted); 

•  the  evaluation  of  a  system  goaf  which. makes  a  call 
to  the  underlying  Prolog  system  is  counted 'as  .one 
reduction; 

♦  bindings  made  using  calls  to  the  underlying  Prolog 
system  are  made  only*  when  all  the  goals  have  been 
attempted; 

•  the  interpreter  makes  a  distinction  between  suspen¬ 
sion- and  failure; 

♦  although  guard  evaluations  are  carried  out  to  comple¬ 
tion  in  one  go,  the  commitment  of  a  goal  to  a  given 
clause  is  prevented  for  the  number  of  cycles  the  guard 
took  to  evaluate;  and 

♦  the  suspension  of  all  the  guarded  goals  causes  suspen¬ 
sion  of  the  goal  being  evaluated. 

4  New  evaluation  parameters 

Apart  from  having  inaccuracies  in  measuring  the  inherent 
parallel  behaviour  of  programs  introduced  through  limita¬ 
tions  in  the  execution  model,  the  parameters  proposed  by 
Shapiro  (cycles,  suspensions  and  reductions)  do  not 
give  any  indication  of  the  effects  that  alternative  imple¬ 
mentation  models  may  have  had. 

In  this  section  we  consider  the  extremes  of  the  current 
implementation  models.  Our  new  parameters  aim  to  re¬ 
flect  the  effect  of  adopting  different  alternative  execution 
models  and  so  give  us  a*  better  understanding  of  our  ap¬ 
plications, 

4.1  Basis  for  new  parameters 

4.1.1  Pruning  OR-branclics 

The  parallel  evaluation  of  a  goal  invokes  several  guard 
evaluations,  one  for  each  clause  that  the  goal  successfully 
head  unifies  witfi.  The  evaluation  commits  to  the  first 
clause  whose  guarded  system  successfully  terminates.  On 
commitment,  the  other  guarded  systems  invoked  by  the 
goal  evaluation  can  be  terminated  or  ignored.  Terminat¬ 
ing  the  alternative  clauses  (pruning)  requires  the  system 
to  stop  the  computation  being  carried  out  in  the  alterna¬ 
tive  branches.  This  may  prevent  these  branches  carrying 
out  needless  computation.  Ignoring  the  other  alternative 
clauses  (non-pruning)  when  a  goal  commits,  requires  the 
system  to  disregard  any  commitment  requests  from  the 
other  alternatives  should  their  guarded  systems  also  ter¬ 
minate  successfully.  This  assumes  that  guard  evaluations 
terminate  and  certainly  do  not  diverge.  This  may  save 


some  computation  (in  sending  a  terminate  message  to, the 
other  guard  evaluation)  if  the  guards  .are  balanced  or  in 
cases  where  only  one  clause. can  be  committed  to. 

4.1.2  Suspension- mechanisms 

A;  goal  evaluation  .suspends  if,  there  is  no  committable 
clause,  and  at  least  one  of  the  guard;  evaluations  or  head 
unifications  suspends.,  Suspending  the  evaluation  can  be 
achieved  in  several  ways;  the  two  extremes  being  goal  sus¬ 
pension  and  clause  suspension.  Goal  suspension  involves- 
suspending  the  parent  goal  of  a  computation  when  each 
of  the  clause  evaluations  suspend. 

Alternatively  each  of  the  clauses  (guarded  computations 
and  head  unifications)  could.be  suspended,  which  is  known 
as  clause  suspension.  As  there  may  be  recursive  guard 
evaluations  invoked,,  clause  suspension  may  result  in  a 
tree  of  suspended  evaluations,  representing  the  guard  call 
structure.  The  trade-off  between  these  two  extremes  is 
basically  a  space-time  consideration.  Suspending  a  goal 
requires  less  space  tlian  suspending  the  evaluation  of  each 
of  the  clauses.  However,  if  some  computation  is  performed 
in  the  evaluation  of  the  guard  before  suspension  then  this 
computation  will  be  lost,  and  repeated,  if  the  goal  is  sus¬ 
pended. 

4.1.3  Scheduling  policy 

Another  choice  is  how  and  when  suspended  evaluations 
are  re-scheduled.  When  an  evaluation  suspends  it  could 
be  tagged  the  variables  which  are  required  and  un¬ 
bound,  and  re-scheduled  when  they  become  bound,  this 
is  known  as  non-busy  waiting.  It  should  be  noted  that 
some  predicates,  like  nerge/3,  only  suspend  on  one  vari¬ 
able  whereas  others,  like  equals/2,  require  both  argu¬ 
ments  to  be  bound.  The  other  extreme  would  be  to  im¬ 
mediately  reschedule  the  suspended  evaluation,  known  as 
busy  waiting; 

Employing  a  non-busy  waiting  suspension  mechanism 
is  appropriate  if  suspended  evaluations  remain  suspended 
for  several  cycles,  for  example  in  generating  prime  num¬ 
bers  by  sifting  (5)  most  of  the  filter  processes  will  be  sus¬ 
pended  most  of  the  time.  Employing  a  busy  waiting  sus¬ 
pension  mechanism  is  appropriate  if  suspended  goals  are 
only  likely  to  be  suspended  for  a  short  period  (see  [14], 
(15)).  as  with  Layered  Streams  (8). 

4.2  Proposed  profiling  parameters 

Our  new  profiling  parameters  aim  to  reflect  the  effea  of 
various  alternative  execution  options  We  propose  col¬ 
lecting  suspensions  and  reductions  using  combinations  of 
the  following  alternatives.  OR-branches  may  be  pruned 
or  non-pruned,  suspended  evaluations  may  be  goals  or 
clauses  and  scheduling  may  be  busy  or  non-busy  The 
new  parameters  are  suspensions  and  reductions  using 

-  busy  waiting,  non-pruning  and  goal  suspension, 

-  busy  waiting,  non-pruniug  and  clauso  suspension, 

-  busy  waiting,  pruning  and  goal  suspension; 


-  busy  waiting,  pruning  and  clause  suspension; 

-  non-busy  waiting,  non-pruning  and  goal  suspension; 

-  non-busy  waiting,  non-pruning  and  clause  suspension; 

-  non-busy  waiting,  pruning  and  goal  suspension;  and 

-  non-busy  waiting,  pruning  and  clause  suspension. 

A' full  description  of  all  8  models  of  execution  and  addi¬ 
tion  parameters  we  also  consider,  tH  depth  of  the  eval¬ 
uation  and  the  minimum  reductions,  is  given  in  [14]. 

5  Profile  tool 

As  mentioned  earlier,  see  section  3.3,  our  hew  interpreter 
provides  information  about  the  execution  by  creating  a 
dump-data  file.  This  dump  file  contains  tokens  which 'al¬ 
low  us  to  build'a  parallel  picture  of  the  execution  under 
a  range  of  al  ter  native  models  of  execution.  The  tokens 
in  this  dump  file  are  used  by, our, post  analysis  to  extract 
the  parameters  we  put  forward.  The  main  features  of  this 
post  analysis  are: 

•  The  profiler  maintains  an  aggregate  of  each  of  the  pro- 
posed  profile  parameters  per  cycle.  This  provides  us 
with  a  break  down  of  the  various  profiling  parameters 
which  can  be  used  to  give  a  dynamic  picture  of  the  ex¬ 
ecution.  Moreover  this  mechanism  also  provides  the 
means  by  which  we  are  able  to  collect  pruned/non- 
pruned,  busy /non-busy  and  goal/clause  data. 

•  Profiling  the  guard  .data  produces  a  cycle  by  cycle 
aggregate  of  each  of  the  parameters  as  collected  in 
the  guard  evaluation. 

•  On  completing.the  profiling  of  some  guard  data  the 
next  token  indicates  that  the  parent  goal  either  com¬ 
mits,  suspends  or  fails. 

-  If  the  goal  comets,  the  clause  number  and 
depth  of  the  commitment  are  also  returned. 
This  provides  information  which  is  used  to 
prune  those  profiling  parameters  which  adopt 
a  pruned  execution  model.  The  pruned  guard 
profile  is  then  combined  (spliced)  into  Us  par¬ 
ent’s  profile  data. 

-  If  the  goal  evaluation  suspends  for  the  first  time 
the  suspension  parameters  of  the  guarded  goal 
evaluations  are  spliced  into  the  parent’s  profile. 
An  additional  suspension  is  also  added  to  all 
the  goal  suspension  parameters,  at  the  depth 
at.  which  the  guard  suspended. 

-  If  the  goal  evaluation  re-suspends,  the  busy  sus¬ 
pension  parameters  of  the  guarded  goal  eval¬ 
uations  are  spliced  into  the  parent’s  profile. 
An  additional  suspension  is  also  added  to  the 
busy  waiting  goal  suspension  parameters,  at 
the  depth  at  which  the  guarded  evaluation  sus¬ 
pended. 


We  have  also  implemented  .a  profile  tool  which  exe¬ 
cutes  under  SaJNVIEW^.  This  tool  allows  us  to  see 
any,  or  several,  profiling  parameters  in  graphical,  form. 
The  tool  .also  provides  information  on  .the  totals  of  the 
various  *  parameters.  -32  The  .dump  file  could  be  used*  to 
collect. several  other  parameters..  For  example,  .we; count 
system  calls  as  reductions;  which  we  use  as  a  measure  of 
parallelism.  However,  the  .dump  file  contains  different  to¬ 
kens  for  .commitments  and  system  call  evaluations  and  so 


different  measures  for  the  parallelism  could  be  obtained 
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Figure  1:  Example  of  the  Profiletool  for  Quick-sorting 


Figure  1  contains  an  example  screendump  of  our  tool. 
The  tool  shows  plots  of  the  number  of  reductions  and  sus¬ 
pensions  (y-axis)  in  each  cycle  (x-axis).  Such  plots  can  be 
given. for  any  combination  of  suspension  mechanism, 
scheduling  strategy  and  pruning  option.  The  options 
selected  are  indicated  by  the  toggle  buttons  on  the  right 
hand  side.  The  tool  also  presents  information  on  the  total 
number  of  reductions  and  suspensions  using  a  given  exe¬ 
cution  model,  next  to  the  toggle  buttons  for  each  option. 
Finally,  the  tool  also  contains  some  more  general  infor¬ 
mation,  such  as;  the  goal  that  was  evaluated;  the  elapsed 
time  of  the  evaluation,  and  the  minimum  number  of  reduc¬ 
tions  required  to  evaluate  the  goal,  assuming  the  existence 
cf  an  oracle  to  pick  the  correct  clause. 

6  Example  executions  and  mea¬ 
surements 

In  this  section  we  consider  the  behaviour  of  two  simple 
example  programs,  “quick  sort”  and  “prime  number  gen¬ 
eration”.  These  programs  are  simple  enough  to  be  con¬ 
sidered  theoretically  and  so  we  use  them  to  highlight  the 
improvement  and  use  of  our  profile  tool;  The  evaluation  of 
other  programs  including  a  collection  of  AI  programs  can 
be  found  in  (14)  and  (15). 

6.1  Quick-sort 

The  quick-sort  example  highlights  the  various  differ¬ 
ences  in  the  various  suspension  parameters  and  how  reg- 
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mode  quicksort  (?,“). 
quicksort (Onsort *d .Sortmd) 

qsort(0nsorted.Sort*d-£]). 

mods  qsort(?,~). 

qfort(CllUnsort*d3  .Sortsd-kcst):- 

- part it ion(0nsort#d,X. Smaller, Larger), 
qsort (Smaller .Sorted- [X 1 SortedTeap] ) , 

qsort (Larger, Sort edTemp-Rest) . 

qsort  (□, Sort  ed-Rest):-  Sorted  =  Rest, 
mode  jartitioa(?,?,“ 

partition (  EX  I  Xs] ,  A  .Smaller ,  EX  I  Larger]  ) :  - 
A  <  I  : 

partition(Xs , A .Smaller, Larger) . 
partitionCEXfXs] ,  A,  EX  I  Smaller]  .Larger) 

A  >=  X  : 

partition(Xs, A, Smaller, Larger) . 
partitionC 


Figure  2:  Quick-sort  program  in  Parlog 


ular  and  irregular  queries  result  in  differing  dynamic  fea¬ 
tures  of  the  computation.  Firstly,  we  consider  how  this 
program  behaves  if  the  list  to  be  sorted  is  already  ordered 
and  then  if  the  input  list  is  unordered. 

6.1.1  Quick-sort  of  an  ordered  list 

Consider  the  program  m  Figure  2  with  the  following  query 

quicksort  (Cl  .2,3,4, 5, 6, 7, ft, 9, 10]  ,L) 

The  regular  nature  of  the  data  for  this  query  al¬ 
lows  us  to  reason  about  its  evaluation.  Basically, 
quicksort  (El  ,2, 3, 4, 6, 6, 7, 8, 9, 10],  L)  will  be  reduced 
to  the  initial  qsort/2  goal.  This  goal  is  then  reduced 
to  a  partition/4  and  two  new  qsort/2  goals.  The 
partition/4  goal  will  partition  the  input  list  (based  on 
the  current  first  element;  the  pivot )  into  two  output  lists. 
One  output  list  contains  elements  greater  than  the  pivot, 
the  other  elements  less  than  the  pivot.  As  the  input  list  is 
already  ordered  the  partition/4  goal  will  only  add  ele¬ 
ments  to  one  of  the  output  lists.  The  two  qsort  processes 
will  initially  suspend  awaiting  the  output  lists  from  the 
partition/4  to  be  generated.  In  the  following  cycle  one  of 
the  qsort  goals  will  be  able  to  reduce,  as  the  partition/4 
process  constructs  the  output  lists.  The  reduction  of  this 
qsort  goal  will  again  result  in  a  partition/4  process  and 
two  qsort/2  processes.  The  other  qsort/2  process  re¬ 
mains  suspended  until  the  entire  list  has  been  partitioned, 
i.e.  until  the  partition/4  processes  complete. 

These  processes  will  behave  as  before,  the  partition/4 
process  will  only  add  elements  to  one  of  its  output 
lists,  the  two  qsort  processes  will  initially  suspend,  one 
of  which  will  be  able  to  reduce  in  the  following  cy 


de.  The  partition  processes  wiB  be  spawned  in  cy¬ 
cles  2,4,6,8,10,12,14,16,18,20  respectively.  The  duration 
of  tfce  processes  will  be  S.8,7,6,5,4,3,2,1  cycles  respec¬ 
tively.  Hence,  these  processes  will  terminate  in  cycles 
11, 12, 13,14,15,16,17,18, 19,20,21.  After  cycle  11  there  will 
be  one  less  suspended  process  each  cycle  (a  qsort/2  pro¬ 
cess)  until  cycle  21. 

This  effect  has  to  be  combined  with  the  spawning  pat¬ 
tern  of  the  qsort/2  goals,  i.e.  initially  2  suspensions,  of 
which  1  reduces  in  the  next  cycle.  The  overall  goal  sus¬ 
pension  pattern  is,  initially  in  every  other  cyde  there  will 
be  two  new  suspensions,  one  of  which  is  able  to  reduce  in 
the  following  cyde.  After  cyde  11  the  pattern  will  invert. 
In  every  other  cyde  there  will  be  one  new  suspension  fol¬ 
lowed  by  two  of  the  suspended  goals  being  re-scheduled 
and  reducing. 


Figure  3.  Quick-sorting  an  ordered  list  (goal  and  clause 
suspensions) 


Figure  4.  Quick-sorting  an  ordered  list  (busy  and 
non-busy  suspensions) 

Figure  $  gives  profiles  for  goal  and  clause  suspensions 
using  busy  waiting  and  non-pruning  Using  busy  wait¬ 
ing  gives  us  a  measure  of  the  total  number  of  processes 
suspended.  Note  that  the  goal  suspension  profile  (the 
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figure  5:  Quick-sorting  an  ordered  list  (reductions  and 
suspensions)  _ 


lower  graph)  is  as  predicted,  that  is  the  total  number  of 
suspended  processes  will  initially  increase  in  a  step  wise 
■'manner  (steps  of  +2,  -1)  and  from  cycle  11  onwards  will 
reduce  in  a  step  wise  manner  (steps  of  +1^  2). 

Moreover  comparing  the  goal  and  clause  suspension 
profiles  indicates  the  number  of  clauses  that  each  sus¬ 
pended  evaluation  could  be  reduced  by  in  the  dynamic 
program  This  gives  a  ratio  of  exactly  2  clause  suspen¬ 
sions  for  every  goal  suspension.  This  is  also  confirmed  by 
our  analysis,  in  that  the  only  processes  to  be  suspended 
are  qsort/2,  which  could  be  evaluated  via  two  clauses. 

Figure  J  gives  profiles  for  goal  suspension  using  busy 
and  non-busy  scheduling  strategies.  Busy  waiting  gives 
a  measure  of  the  total  number  of  suspended  processes 
while  non-busy  gives  a  measure  of  the  new  suspended 
processes  in  each  cycle.  The  profiles  fit  the  analysis  of 
this  execution.  In  every  other  cycle  there  will  be  two  new 
suspended  goals,  one  of  which  will  reduce  in  the  next  cycle. 

Finally  we  give  a  profile  of  the  reductions  in  Figure  5: 
The  number  of  reductions  increases  by  1  each  cycle,  as  new 
partitien/4  processes  are  spawned  and  reduced.  After 
cycle  11  the  partition/4  goals  begin  to  succeed  (termi¬ 
nate)  and  the  processes  begin  to  collapse.  At  the  peak 
there  will  be  10  partition/4  and  1  qsort  process  reduc¬ 
ing  in  parallel. 

6.1.2  Quick-sort  of  an  unordcrcd  list 

We  now  turn  our  attention  to  the  behaviour  of  quick-sort 
on  an  unordcred  list.  Consider  the  the  following  query: 

4  There  U  a  difference  betwe  4  counting  the  trailer  of  clauses 
for  each  predicate  statically,  and  the  dynamic  nature  of  the  pro. 
gram,  a*  tome  predicates  may  be  used  more  often  than  others,  hence 
weighting  the  results.  Of  course  comparing  suspensions  for  goal 
and  clause  suspension  mechanisms  only  provides  the  dynamic  in¬ 
formation  for  impended  evaluations  and  not  the  whole  evaluation. 
This  comparison  still  provides  useful  information  about  the  space- 
time  considerations  for  the  suspension  mechanism  Suspending  the 
clauses  may  save  head  unifications  and  possibly  some  reductions 
(for  deep  guard  examples)  but  requires  more  space  in  that  there 
may  be  several  clause  states  to  suspend  rather  than  a  single  goal 
state. 


mo 


Figure  6:  Quick-sorting  an  unordered  list  (goal  and 
clause  suspensions) 


Figure  7;  Quick-sorting  an  unordered  list  (busy  and 
non-busy  suspensions) 


quicksort^  [4, 6, 2, 9, 5, 5, 1,10, 3, 7j,L) 

The  irregular  nature  of  the  data  for  this  query  makes 
reasoning  about  its  evaluation  difficult.  However  some 
global  features  can  be  predicted,  namely: 

♦  The  unordered  query  will  result  in  the' part  it  ion/4 
process, adding  elements  to  both  output  lists.  This 
will  result  in  both -the  qsort/2  processes  reducing 
•before  the  partition/4  processes  have  terminated. 
Compared  with  the  ordered  list  example  this  should 
show  an  in;rease  in  the  average  number  of  reductions 
and  reduce  the  total  length  of  the  computation. 

♦  As  the  partition/4  processes  add  elements  to 
both  output  lists  the  qsort/2  processes  reduce  to 
three  suspended  processes,  i.e.  the  newly  spawned 
partition/4  goal  may  suspend  because  no  further 
elements  have  been  added  to  its  input  list.  This  will 
be  indicated  by  the  ratio  between  goal  and  clause 
suspensions  increasing,  as  the  partition/4  processes 
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can  be  evaluated  via  3  clauses,  whereas  the  qsort/2 
processes  can  be  evaluated  by.  2  clauses. 

•'In  both  the  ordered  and  unordered  list  examples  10 
partition/4  processes  are  spawned  (one  for.  each  el¬ 
ement  of  the  input  list).Tn  the  ordered  example  each 
partition/4  process  will  partition  the  remainder  of 
the  input  list,  i.e.  the  partition  process  with  a  pivot 
of  3  will  have  to  partition  the  remainder  of  the  input 
list,  namely  f 4 , 5,6,7,8,9,10];  For  the  unordered 
example  each  partition/*  process  will  only  have  to 
partition  a  subset  of  the  remaining  input  list  because 
the  remaining  input  list 'will  be  partitioned  into  two 
lists.  Hence,  there  will  be  less  reductions  performed 
in  the  sorting  of  the  unordered  list  example. 

We  now  compare  the  data  collected  using  our  new  pro¬ 
filing  system  with  the  theoretical  evaluation 'given  above. 
Figure  6  gives  profiles  for  goal  and  clause  suspensions 
using  busy  waiting  and  non-pruning.  The  first  point 
to  note  is  that  the  ratio  of  goal  to  clause  suspensions 
changes  from  1:2  for  the  ordered  example,  to  90.205  for 
the  unordered  example.  From  this  we  can  conclude  that 
some  partition/4  processes  suspend.  Furthermore,  we 
can  see  that  the  overall  length  of  the  computation  has 
been  reduced  from  23  cycles  (for  qsorting  an  ordered  list) 
to  18  cycles  for  this  example. 

Figure  7 gives  profiles  for  goal  suspensions  using  busy 
and  non-busy  scheduling  strategies.  Firstly,  we  can  see 
that  the  duration  of  suspended  processes  is  more  complex 
to  predict.  Secondly,  we  see  that  the  ratio  of  busy  to 
non-busy  suspensions  is.  90:33  for  this  query,  whereas  it 
was  65:20  for  the  ordered  list  example.  This  indicates  that 
the  suspended  goals  remain  suspended  for  less  time  in  the 
unordered  example,  which  is  intuitively  the  case. 

Finally,  comparing  the  total  reductions  performed  for 
the  ordered  list  (122  reductions)  and  the  unordered  list 
(72  reductions)  we  see  that,  as  predicted,  there  is  a  marked 
decrease  in  the  required  number  of  reductions. 

6.2  Prime  number  generation  by  sifting 

The  prime  number  example  illustrates  how  our  model 
for  AND-parallclism  gives  a  more  realistic  indication  of 
ihe  depth  of  the  computation.  The  prograin  (sec  Figure  8) 
used  generates  prime  numbers  by  sifting  a  stream  ofinte- 
gers  [17].  As  each  prime  number  is  produced  it  results  in 
a  filter  process  being  spawned',  each  filter  proems  removes 
multiples  of  itself  from  the. remainder  of  the  stream.  So 
the  algorithm  involves  generating  a  pipeline  of  filter  pro¬ 
cesses  one  for  each  integer  that  is  unfiitered  (new  prime) 
by. the  previous  get  of  filter  processes.  We  consider  the 
generation  of  primes  up  io  50  and  primes  up  to  500. 

This  algorithm  gives  a  good  indication  of  how  the  exe¬ 
cution  model  affects  the  collection  of  meaningful  statistics 
The  technique  involves  generating  a  stream  of  integers,  say 
fifty,  these  integers  being  generated  in  fifty  cycles.  This 
stream  of  integers  then  under  goes  a  sifting  stage,  this  will 
require  further  cycles.  For  example,  consider  the  number 
47.  This  will  be  generated  in  the  forty-seventh  cycle.  It 


primes 

integers(2,I),  sift(I,J). 

j  ^  node  integers (?,“  ). 
integ«rs(If  till]  j 

II  is  1*1, integers (II fI). 

.  .mode  sift(?,~  ). 

sift(D,0). 
k  sift([P|I],tPlRl]) 

lilter(I,P,R),  sift(R,Ri). 

mode  filter (?,?,*  ). 
filter 

filter([M I I] ,P,R) 

0  =:=  H  mod  P 

filter(I,P,R). 
filter(fK I I] »P,[IlR]) 

0  =\=  I  mod  P 

filter(I,P,R  ). 


Figure  8:  Prime  number  generation  by  sifting 

will  then  be  processed  by  filters  representing  the  follow¬ 
ing  prime  numbers.  2,3,5,7,11,13,17,19,23,29,31,37,41,43 
This  takes  at  least  fourteen  cycles,  one  for  each  filter  pro¬ 
cess. 

Previous  statistics  give  a  cycle  count  of  only  fifty ,  this 
is  because  the  goals  in  the  process  queue  are  evaluated 
in  a  left-to-nght  fashion.  Hence,  an  integer  that  is  pro¬ 
duced  in  a  given  cycle  is  able  to  propagate  through  the 
filter,  processes  in  the  same  cycle,  (the  filter  processes  in 
the  queue  are  set-up  in  a  left  to  right  fashion).  Our  sys¬ 
tem  gives  67  cycles  to  produce  the  first  50  prime  numbers. 
Fifty  of  these  cycles  can  be  attributed  to  producing  the  50 
integers,  another  fifteen  can  be  attributed  to  propagating 
the  last  integer  through  the  fifteen  filter  processes  and  the 
remaining  irvo  axe  due  to  spawning  the  first  filter  process 
and  terminating  the  output  stream.  The  effect  of  account 
mg  for  the  propagation  vf  the  integers  through  the  filter 
processes  results  in  the  number  of  suspended  goals  being 
higher. 

7  Limitations  of  the  new  mea¬ 
surements 

The  limitations  of  our  new  system  c  in  be  classified  in  two 
ways.  Those  associated  with  modelling  the  execution  and 
the  collection  ot  our  proposed  parameters,  and  those  as¬ 
sociated  with  information  we  do  not  or  cannot  collect 

•  We  adopt  a  fixed  cost  model,  in  terms  of  cycles.  How 
ever,  the  cost  of  the  given  operation  may  depend  on 


1869 


.Busy  and  Noa-Busy  Suspensions  using 
Hca-Pruned  and  Goal-suspand  execufeicq 


Figure  9:  Prime  numbers  up  to  50  (busy  and  non-busy 
suspensions) 


Busy  and  Non-Busy  Suspensions  using 
Non-Prunod  and  Goal-suspend  elocution 


Figure  10.  Prime  numbers  up  to  500  (busy  and  non-busy 
suspensions) 

several  factors,  such  as  its  complexity.  It  would  be 
better  to  adopt  a  functional  cost  model,  where  the 
cost  of  an- operation  is  calculated  based  on  its  com¬ 
plexity.  Such  a  model  would  however  require  the  costs 
of  the  variou  s  operations  to  be  accurately  quantified. 
However  such  a  cost  model  would  be  difficult  to  con¬ 
struct  without  reference  to  an  actual  implementation 

♦  We  assume  that,  in  a  cycle,  a  goal  can  only  use  bind¬ 
ings  available  to  it  at  the  start  of  the  cycle.  While 
this  is  an  improvement  over  the  current  interpreters, 
in  modelling  the  inherent  parallelism.  Our  model  may 
not  display  all  the  parallelism  that  could  be  achieved 
in  a  given  implementation,  but  at  least  it  gives  a  mea¬ 
sure  which  is  not  dependent  on  how  the  goals  are  or¬ 
dered. 

♦  The  new  interpreter  (like  previous  interpreters)  uses 
goal  suspension.  Unlike  previous  interpreters  our  sus¬ 


pension  ^  record  contains  information 'about  the  dura* 
-iron  of  the  guard  evaluation  before  the  evaluation  sus¬ 
pended;  This  information' is  use d . to  ind i cate ' i f  the 
-Process,] would /have" been ^evajuated^using  clause  sus¬ 
pension.. Whilst  this  is  ah  improvement  oyer,  the  pre- 
, .  .yious  system’our-new.parameters  may  be  in  ..error  for, 
< certain  classes  of  program  [3t4].' 

•>The  system,  does  not  consider  possible. compile  time 
-optimisations,  such  as : clause  indexing.  For  example, 
.  theTour  clauses  for  .a  merge  goal  canbe  indexed  via 
tworswitch  statements.  Of  course,  th  is ,  leads  ,fco  -  the 
-results,  for  clause  suspensions, not  being  directly ''ap¬ 
plicable;  to;  compiler  implementations  using^this  such 
indexing.  However, it  should  be  noted. that" the  com¬ 
parison  of  application -programs  is  still  possible,  as 
the  program  expects  four  suspended  clauses-to  result 
in;four  clause  suspensions. 

8  Conclusions 

Currently  the  language  and' execution  models  being 
■adopted  Tor  the  Committed  *  Choice  Non-deterministic 
(CCND)  languages.are  settlingdown  to  subsets  of  the 
possible  models.  The  resulting  languages  appear  to  have 
flat  guards  and  adopt. non-busy  waiting  scheduling  pol¬ 
icy.  There  is  still  some  discussion  on  whether  the  lan¬ 
guages  should  be  safe  or  unsafe.  In  general  the  sub¬ 
sets  being  adopted  are  governed  by  implementation' issues 
rather  than  application  requirements.  This  work  aims  to 
place  some  applications  rationale  for  the  design  of  lan¬ 
guage  features  and  their  usage.  Considering  applications 
is  important  because  the  classes  of  applications  that  lan¬ 
guage  implementors  aim  to  support  and  the  models  of  ex¬ 
ecution  that  they  provide  may  not  be  those  required  by 
applications  programmers. 

In  this  article  we  present  an  evaluation  system  for  com¬ 
paring  applications  oh  alternative  language  and  execution 
models.  The  evaluation  system  addresses  two  classes  of 
limitations  with  previous  systems:  The  first  is  that  previ¬ 
ous  systems  claim  to  execute  the  object  code  (CCND  pro¬ 
gram)  breadth-first  (hence  allowing  the  inherent  parallel 
features  to  be  measured),  however,  the  actual  evaluation 
models  used  make  several  approximations.  The  second  13 
that  the  parameters  quoted  in  the  evaluations  of  a  pro¬ 
gram  do  not  reflect  possible  alternatives  open  to  language 
implementors,  like  scheduling  policy.  This  results  in  mis¬ 
leading  and  distorted  measurements. 

The  new  evaluation  system  comprises  two  stages: 
an  AND/ORrinterpreter,  which  evaluates  the  program 
breadth-first  producing  a  dump  file,  and  an  analyser  pro¬ 
gram  which  reconstructs  a  parallel  view  cf  the  program 
execution  for  different  execution  models.  The  statistics 
obtained  are  more  accurate  in  two  respects.  The  first  is 
in  the  modelling  of  a  parallel  AND/OR  execution,  which 
allows  us  to  measure  the  inherent  parallel  features  of  our 
algorithm.  The  second  is  in  observing  the  affect  of  al¬ 
ternative  models  of  execution,  pruned  or  non-pruned 
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guard  evaluations;  busy  or  non-busy  waiting;  and  goal 
or  clause  suspension. 

Finally,;  we  highlight  ;the  use  and 'improvement  of  our 
system' by  considering  the  evaluation  of  two  simple  pro¬ 
grams.  The  nature  of 'these  programs 'allows  us  to  con- 
sider  the  theorctical.behaviour  of  their  execution  which 
we  compare  with  that, predicted  by  our  system. .However; 
this  system  was  built  to  allow,  us  to  compare  AI  applica¬ 
tions,  a  detailed  study. of  three,  classes  of  AI  application, 
using  the  system  is  given  in  (14). 
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Abstract  -  A  new  numencal  method  for  delamination  of  composite 
laminates  based  upon  a  Damage  Mechanics  approach  is  jproposed. 
Degradation  of  the  interface  between  two  adjacent  layers  is  introdu¬ 
ced.  This  approach  allows  an  accurate  prediction  of  both  initiation 
and  propagation  of  delamination  with  a  single  model.  A  program  has 
been  realized  to  treat  delamiriatioq  in  the  vicinity  of  weakly  curved 
edges  with  low  numerical  cost.  First  results  arc  presented. 

I.INTRODUCTION 

In' the  design  of  carbon-epoxy  laminates  such  a$,T306-914  the 
mode  of  damage  and  rupture  can  be'  split  into  two  classes; The  first 
class  is  due  to  fiber  nipture,  matrix  and  fiber-matrix  interface  degra¬ 
dations.  The  modelling  of  these  degradation  phenomena  at  the  single 
layer  scale  tan  be  described  using.  Damage  Mechanics 
(Ladevizc,1991),  (Allcn.1987).  (Taljera.1985).  The  second  class  of 
rupture  is  delammation.  This  phenomenon  which  consists  in  the  de¬ 
bonding  of  laycrs.'appears  in  the  vicinity  of  the  edges  of  structures. 

In  the  analysis  of  this  phenomenon,  distinction  is  made  between 
a  initiation  phase  and  a  propagation  phase  of  an  existing  delaminated 
area.  At  present  the  simulation  of  both  phases  relics  on  the  hypothesis 
of  a  perfect  bonding  between  clastic  layers.  The  analysis  of  initiation 
is  quite  qualitative,  in  view  of  the  normal  stresses  obtained  from  an 
elastic  computation  a  more  or  less  important  tendency  is  assigned  io 
different  stacking  sequences  Empirical  criteria,  such  as  ‘’point- 
stress"  or  "average- stress"  are  also  used  (Kim  and  Soni.1984).  They 
seem  to  give  good  results  on  specimen  under  tension,  but  they  relic 
on  weak  physical  basis.  The  propagation  of  an  existing  delaminated 
area  has  been  analyzed  numerically  and  experimentally  through 
Fraciurc  Mechanics  (Bcnzeggagh.1980),  (Wang,  1983).  However 
propagation  modelling  through  Fracture  Mechanics  uses 
computations  of  Energy  Release  Rates  whose  values  are  different 
from  the  experimental  values  under  the  assumption  of  perfect 
bonding  between  layers  (Ne$a,1990). 

The  proposed  method  is  based  upon  the  Damage  Mechanics  ap¬ 
proach  proposed  in  (Ladevfcze,1990),  The  modelling  at  a  structural 
analysis  scale  of  gradual  degradation  phenomena  in  a  material  -  such 
as  T300-914  -  due  tv  the  occunrence  and  the  development  of  micro¬ 
cracks,  allows  an  accurate  prediction  of  both  initiation  and  propaga¬ 
tion  of  delammation  m  a  single  motel.  The  laminate  is  modelized  as  a 
>tackmg  of  single  layers  and  of  interferes  connecting  the  layers.  The 
interface  is  a  two  dimensional  entity  which  ensures  displacement  and 
traction  transfers  from  one  layer  to  another.  Its  influence  is  located 
near  edges  or  defects  where  a  three  dimensional  stress  state  may  oc¬ 
cur  and  lead  to  delamination.  Its  behavior  depends  on  the  angle  bet¬ 
ween  the  fibre  directions  of  adjacent  layers. 

Following  a  "shell"  computation,  delamination  analysis  is  done  in  the 
vicinity  of  the  edge  with  appropriate  boundary  conditions.  Full  ana¬ 
lysis  of  delamination  around  a  circular  hole  has  been  developed 
(Allix,1987).  In  this  work  all  the  non  linear  phenomena  of  degrada¬ 
tion  and  plasticity  inside  the  layers  and  the  interfaces  between  the 
layers  arc  introduced.  This  approach  allows  to  take  into  account  pos¬ 
sible  important  gradients  parallel  to  the  edge  but  it  needs  to  solve  a 
three  dimensional  time-dependant  problem  which  can  be  expensive 
with  a  great  number  of  layers  or  in  the  case  of  parametered  studies. 

The  described  approach  can  be  qualified  of  “simplified"  in  com¬ 
parison  with  the  previous  one  A  Finite  Element  program  has  been 
realized  to  predict  initiation  and  propagation  of  delammation  at  a  low 
numerical  oost.  The  main  assumptions  of  this  method  are : 


-  delammation  is  the  mam  mode  of  degradation ,  only  the  inter¬ 
faces  between  the  layers  can  damage 

-  the  edge  is  weakfy  curved  allowing  to  consider  it  is  locally 
straight ,  the  initial  three  dimensional  problem  becomes  a  two  dimen¬ 
sional  problem  set  into  a  band  perpendreular  to  the  edge, ' 

First  results  are  presented.  The  analysis  of  delamination  in  the 
neighbothood  of  the  free  edge  of  a  specimen  under  tension  has  been 
'treated.  This 'simple  example  allows  comparisons  with  experimental 
results  (KIM  and  SQNI*1984). 

I1.LAMINATE  MODELLING 

The  laminate  is  modelized  at  the  mcso-scale,  l-c  at  the  scale  of  the 
elementary  componcnts_  which  arc  the  single  layer  and  the  interface 
connecting  adjacent  layers. 
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The  single  layer  -  The  single  layer  is  homogeneous  in  the  thickness. 
Its  behavior  is  clastic  with  no  damage. 

The  interlace  -  The  interface  is  a  surface  entity  (of  zero  thickness) 
which  ensures  stress  and  displacement  transfers  from  one  ply  to 
another  (Allix,1987).  In  elasticity  the  interface  is  schematized  by 
means  of  a  model  which  has  been  used  in  older  to  modclize  the  fiber- 
matnx  interface  (Un6  and  lreguillon.l9S  1).  It  depends  on  the  relative 
orientation  of  the  upper  and  lower  plies. 

The  displacement  discontinuities  arc  denoted  by ; 


(UJ  -  U*  -  U“  -  (U|jj  N\  ♦  (U21  54  4  (U31 

The  (F?|,  F?2)  axes  arc  associated  with  the  bisector  of  the  fiber  direc¬ 
tions.  The  damaged  energy  of  the  interface  is  ( >+ denotes  the  po¬ 
sitive  pan ) : 
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k°,  k^,  k^  arc  initial  elastic  characteristics.  If  k°-  k®,  k^  — >  -too  thus 
the  interface  is  perfectly  bonded,  l-e  the  modelling  which  is  generally 
used.  The  conjugate  vanables  associated  with  the  dissipation  are ; 


Model  Mode  II  Modem 


A  simple  modeling  is  to  consider  that  the  damage  evolution  is  gover¬ 
ned  by: 
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i=sup|,?(Y<l+Yi-Yi)1+Y2Y<12) 

where  Yj,  y2  arc  coupling  constants  In  term  of  delamination  modes, 
the  first  term  is  associated  with  the  first  opening  mode,  the  other  two 
with  the  second  and  third  modes  The  damage  evolution  law  is  defi¬ 
ned  through  a  material  function  w,  such  that : 

d  .awQ}  if  d  <1  I  d  a  i  otherwise 

dj  =Yi  w00  ifdi  <  I  andd<  l  ;  d,  =  1  otherwise 

d2  =  Y2w(X)  ifd2<landd<l  ;  d2 s  I  otherwise 

v 

A  first  choice  for  the  material  function  w  is  w(Y)  * 
where  Ye  is  a  critical  energy. 

A  first  delamination  analysis  -  Link  between  Damage  Mechanics  pre¬ 
diction  and  Fracture  Mechanics. 

Under  the  assumption  of  a  damageable  interface  connecting  two 
clastic  layers  under  a  pure  mode  I  loading,  an  analysis  of  a  Double 
Cantilever  Beam'(DCB)  specimen  has  been  realized.*  Because  of 
interface  deterioration  there  exists  a  maximum  value  of  the  applied 
load  P  denoted  by  Pc.  This  limit  is  reached  for  d«l  at  the  tip  of  the 
delaminated  area.  This  means  that  there  is  delamination  propagation 
This  allows  us  to  calculate  the  critical  energy'release  rate  ,  that  is 

G(PC).  For  a  long  enough  crack  one  obtains .  #  2Ye 

A  similar  result  has  been  obtained  for  the  second  mode.  Within  this 
frame  work.  Fracture  Mechanics  appears  as  a  simplified  tool  to  study 
delamination,  under  the  assumption  of  clastic  layers  for  an 
established  delamination. 

HI.NUMERICAL  SIMULATION  OF.  DELAMINATION 

Geometrical  assumption  -  This  study  deals  with  the  problem  of  dc- 
lamination  in  the  vicinity  of  a  weakly  curved  edge.  Thus  the  varia¬ 
tions  of  the  displacement  or  stress  solution  along  the  direction  parallel 
to' the  edge  are  less  than  those  in  the  perpendicular  directions. 
Therefore  the  three  dimensional  problem  is  reduced  into  a  two  di¬ 
mensional  problem  in  a  strip  perpendicular  to  the  edge, 

Numerical  solving  -  The  problem  to  solve  is  non  linear  and  time-de¬ 
pendant.  It  is  due  to  the  damage  modelling  of  the  interfaces  between 
the  plies.  Tlic  contact  between  two  delaminated  plies  is  described  as 

an  unilateral  contact  without  friction.  Critical  points  may  occur  and 
then  a  Riks  algorithm  is  used  to  detect  critical  points  and  to  go  further 
with  a  good  convergence. 

The  numerical  treatment  for  fixed  delaminated  areas  -  the  problem 
being  purely  elastic  -  is  based  upon  the  conjugate  gradient  method. 
The  problem  to  solve  is : 

KX  -  F  with  K  -  Klty+  Kim 

Ki«y  and  Ktet  stiffness  matrix  of  the  layer  and  of  the  interface.  At 
each  step  it  is  solved: 

Kl*y  V  »  R  with  KUy  ■  XUy  i  denotes  the  layer 

Thus  the  computation  reduces  to  parallel  and  separated  computations 
on  the  layers. 

AjmmeAcal  example  •  Free  edge  delamination  of  a  specimen  under 
tension  - 

The  analysis  of  delamination  near  the  free  edge  of  a  specimen 
T300-5208  (30/30,90)$  submitted  to  tension  has  been  chosen  to  il¬ 
lustrate  the  proposal  method.  For  that  example,  it  is  experimentally 
known  that  delamination  occurs  prior  to  transverse  matrix  cracking  or 
Fiber  breaking  (KIM  and  SON!,  1984).  Delamination  first  occurs  and 
propagates  on  the  (90,90)  interface,  an  important  damage  also  ap¬ 
pears  on  the  (-30,90)  interface.  These  results  confirm  the  experimen¬ 
tal  observations. 


^.CONCLUSION 

A  simplified  method  to  analyze  delamination  near  a  weakly 
curved  edge  has  been  presented  .This, engineering  tool  allows  to 
simulate  both  "initiation  and  propagation  in  post-processor  of  an 
clastic  shell  computation .through  Damage  Mechanics.  First  results  are 
encouraging'  but  it  is  necessary  to  make  others  comparisons  with 
experimental  results  to  valid  the  model  and  the  numerical  program. 
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Abstract:  This  paper  presents  a  nunericat  method  to  predict-the 
macroscopic  strength  of  a  composite  in  terms, of  .the  strength  of 
the  Individual:  constituents  and  of' their  arrangement. 


I.  THE  YIELD  OESIGN  PROSLEK 

In  ■  this  paper  we  discuss  the  strength  of  composite  materialsvln 
terms  of  the  strength  of1  Individual  phases-  and  of  their 
geometrical  arrangement.  A  composite  structure,  which  occupies  a 
domain  ft  in  R*,  Is  made  of  a  material  which  is  assumed  to  exhibit 
a  periodic  micro-structure  generated  by  repetition  of  a  unit  cell 
€Y,  where  €  Is  a  small  parameter  measuring  the  site  of. the 
heterogeneities  and  Y  Is  the  tnlt  cell  (see  (1)  or  (2J  fora 
detailed  exposure  of  these  classical  notations  In  homogenization 
theory).  This  composite  structure  Is  submitted  to  bod/  forces  A/Q 
proportional  to  a  loading  parameter  A  and  we  search  for  the 
adalssibte.values  of  A  At  the  levet  of,  the  u\lt  cell  the  strength 
domain  of  each  constitutive  phase.  In  which  the  stress -tensor  Is 
constrslned  to  remain,  Is  known:  to  simplify  ve  essune  thst  It  can 
be  defined  by'the  Von  Mlses  criterion 


“(e(u))  dx;  u  *  0  cnTl  ,  l(u)«  t 


}■ 


%<Y)  <k(y)  where 


(r«i, "!,)"■ 


i*ere  c P  is  the  devlatortc  part  of  <7,  k(y)  takes  constant  values, 
different  from  one  phase  to  the  other.  The  classical' theory  of 
yield  design  (31  leads. to  the  following  dual  definitions  of  ihe 
limit  load  A  e: 

A*  ■  Sup  (  Al  there  exists  <7swh  that: 

div(0>  *  Afg  a  0  In  ^  (x)  <k^  a.e.  x  In  (1> 

■Inf  ^J^iTe(e(u»  dx  ;  dlv(u)*0  In  1}  u  ■  0  on  ,  l(u)  ■  lj(2) 

■  “§•«,.  eii'u>  * 

L(u)  ■  (^u  d tx. 

2.  THE  MACROSCOPIC  STRENGTH  DOMAIN 

Characterizations  analogous  to  (1)  or  (2)  of  Ahc*  -  tin  A*  can 

€  -0 

be  given  with  the  help  of  the  theory  of  I-convergence  CAT: 

A  to*  a  $*p  (  Al  there  exists  ersuch  that; 

dlv(O)  ♦  >f„  .  0  Inf},  <7(x>  Gp  Kca  ....  *  JnJiy  (3) 


>  horn. 


such  that  there  exists  &.  div(0)»  0  In  Y,  O.n  is 
anti-periodic  on  8y,  <  <7>  ■  Z,  cr^y)  <k(y)  a.e.  In  Y>,  <4) 

lTh”<E)  -  Inf  <  Tl(y, £•«<»))>,  <.>  m  -ip  .  dy.  <5) 
w  periodic  -1'1  ’ 

1r(E<«(w»>0 

TTha®  Is  the, support  function  of  P  *!**:  irhctt<E>  ■  S\*>  <2:S>. 

zeph«* 

Expressed  -  in  physical  terms  (3)  states  that  the  macroscopic . (or 
homogen lzed)-  strength  of  the  composite  material  Is  defined  by  the 
strength  domain  P  ho*  ,  which  can  •  be  computed,  at  least 
theoretically,  by  (4)  or  (5).  It  should  be  noted  that  in  absence 
of  any, further  Information  on.the  precise  constitutive  law  of  the 
phases,  Pho*  is  only  an  upper  bound  of  the  strength  capabilities 
of  the  ccmposite.  However  if  the  individual  phases  are  elastic 
perfectly  plastic,  It  can  be  shown  that  the  overall  behavior  of 
the  composite  is  elastic  plastic  (with  hardening)  and  that  every 
stress  state  Z  In  the  domain  Pho™  can  effectively  be  reached,  for 
this  reason  the  use  of  this  homogenization  theory  is  permissible 
for  djctlle  constituents  (such  as  netat  matrix  composites)  but 
should  be  considered  with  care  for  brittle  constituents. 

The  numerical  determination  of  ?hon  can  be  performed  directly  on 
the  variational  definition  (5)  of  its  suport  function,  but  the 
functional  to  minimize  ha*  only,  a  linear  growth  and  Is  not 
differentiable  everywhere.  To  overcome  the  difficulty  we  use 
another  method  based  on  the  solution  of  an  elastic  perfectly 
plastic  problem  IS)  ,  for  which  several  f EH  codes  are  available, 
fix  an  arbitrary  corplianca  tensor  s  end  solve  the  evolution 
problem: 

dlv(<7(y,t)>  «  0,  <<7(y,t)>  •  2<t),  <7.n  anti-perlodic  on  8y^ 
e(u* (y,t)>  ♦  E(t)  ■  s<7(y,t)  ♦  ePfy.t),  u*  periodic  r (6) 


o'  <y,tl  <l(y>,  ep(y,t)  ■  ><y.OCTt'(y.f),  A£o, 


(6)  can  be  solved  along  at  least  two  paths.  First,  it  can  be 
solved  along  a  radial  path  fn  the  space  of  macroscopic  strains: 

E(t)  Is  constant  and  prescribed,  and  (6)  is  solved  for  u*.  <7  (fra* 
which  Z  Is  deduced).  Second,  (6)  can  be  solved  along  a  radial  path 
In  the  space  of  macroscopic  stresses:  2<t)/Z  ^  (t)  and 

2{t):E(t)/2^<t)  a  e  constant  and  prescribed  and  (6)  yields  u* ,  <7, 

E  and  Z„.  In  both  cases  the  solution  converges  when  t  ~**CCland 

the  limits  e<oo>  satisfy: 

£«>)  £r  hc«,  <£(oo y£  -  2<oo »  <o  for  every  £  in  P  hoa.  (7) 

Each  run  of  the  algorithm  determines  a  point  2<«>)  on  the  boundary 
of  pk°ato3Cther  W|tf,  the  outer  normal  vector  E(°°)  to  Phcoat  this 
point. 
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3;  unidirectional  co<posii£s- 

UhTdlrectlonal  composites  exhibit-  gecmetrical  And  eaterlal 
properties  which  ere  Invariant  u-ider  translation  along  a  specific 
direction.  In  the .,11  near  setting,*  this- Invariance  can  be  used  to 
reduce  the  3-d.c<wputetions  to  2*d  computet  Ions  (plane  strain  and- 
antl:plabe,pfcbl«>s)-  In  the, nonlinear  setting  considered  here- the 
superposition  principle  is  r»  more  valid  or^- the  problem  remains 
3-deerofonaU  In  Its  full  generality.  However  for  macroscopic 
stresses-in  the  fora  (where  ej,ls  the  invariant  direction) 

*  %2 !  ^2<e1®c2  ?  '2®'l>><9> 

«  j  r* 

problem  <6>  has  a*  solution  E  of  the  same  for*  (9)  and  u  Is 
2-di  mens  local  (u^  ■  0,  u^and  u^,  depend  on  yt  and  y2  only).  (6>  Is 
well  posed  In  the, context  of  generalized  pltne  strains.  A  state  of 
strain  Is  said  to  be  «  generalized  plane  strain  If  <ef  16]): 


%.*  Wt'V'  CCm  1,2»  Hi  *  e33y3  *  vhcrc  hi  ,s  8  constant. 

This  non  classical  framework  of '  generalized  plane  strain  gives 
sufficient  flexibility  to  consider  macroscopic  stress  Stales  with 
independent  components  In  the' longitudinal  direction  and  In  the 
transverse  direction,  which  U  not,  .possible  with  usual  plane 
strain  computations. 


4.1  Unidirectional  metal  matrix  composites:  u«  consider  a  eetol 

matrix  composite  with  unidirectional  fibets.  The  matrix  Is  assured 
to  be  plastic  with  yleld'stress’C^yiUe  the  fibers  are  assumed  to 
be  elastic.  The  fibers  are  arranged  along  a  triangular  array  (the 
emit  cell  Is  hexagonal).  The  yield  stress  In  transverse  sfsple 
tension  has  been  computed  by  the  Above  oethod.  figure  l  shows  this 
yield  stress  versus  the  angle  between  the  tensile  dliectlon  and 
one  axis  of  the  lattice.  The  reinforcement  effect  In  the 
transverse  direction  due  to  the  fibers  Is  rather. small  end  depends 
only  weakly  on  the  volume  fraction  of  the  fibers  vf.  This  f* 
to  the  presence  of  planes  of  maximal  shear  Into  the  matrix  as  long 
as  vf  does  not  exceed  0.68. 


l.i  A  *  Mode  60° 

I  o  vf*  0,25 

\  ♦  vf*  0,50 


iure  1:  yield-stress  of  a  uildln 


i  versus  orientation  for  vf«0.2S.  yf»Q. 


4.2  Cylindrical  cavities:  Ue*  consider  a  matrix  material,  with 
yield'  stress  O q  ,  weakened  by  a  periodic  array  of  cylindrical 
cavities  (radius  R)  arranged  along  a  square  lattice  (spacing  2a). 
Ue  Investigate  the  yield  locus  of  this  danaged  material  under 
macroscopic  stresses  ■  ^2*°'  ^y^O,  other  "  °*  f*9ure  2 
shows  these  yield  loci  I  for  different  values  of  the  pattern  ratio 
R/a.  The  prediction  ofsthe  Gurson's  model  |7].has  been  reported  in 
dotted  line. 


«  Gurson 

o  R/a*  0,3  ./ 

*  R/a- 0.5 

A  R/a- 0.7  *\ 

// 

/fx*\  *■  ■ 

«V  /  *  •  9 

.///  f  / 


V  f  /  211 /cO 
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Abstract . 

The  mixed  FEM  proposed  in  order  to  compute  accu¬ 
rately  the  3D  stress  field  in  a  composite  plate  are  based, 
on  modifications  of  the  principle  of  the, complementary 
energy.  The  main  difficulty  of  mixed  methods  is  to,  con¬ 
struct  admissible  fieldfsuch  that  the  final  linear  system  is 
invertible  and  the  method  converges . 


We  first  use  the  3D  complementary  energy  principle 
and  we  suppose  that  the  in-plane  stresses  arc  linear  func¬ 
tions  in  each  *ayer(this  is  the  cruder  physical  hypothesis 
we  can  do).A$  the  admissible  stress  fields  must  satisfy  the 
equilibrium  equations  in  the  composite.wc  get  a  2D  for¬ 
mulation  with  16  generalized  stress  variables  per  layer, 
defined  on  the  middle  plane  of  the  composite  (to). 

Let  0*  be  a  family  of  triangulations  of  to.  In  order  to 
satisfy  the  equilibrium  equations, we  have  to  construct 
symmetrical  tensors  T  a  and  fields 

q  €  (L ,  div  q  e  L*(o>)  (1) 


T«(L ,  divTs(LJ(<o))J ,  <iiv(divT)eLJ(o>) .  (2) 


In  order  10  satisfy  (I),we  can  use  the  Ramn-Hiomsj 
space  ((2)) : 


D,(K) 


OVi(K))J  + 


*1 

ft 


Pjt-t(K)  . 


(3) 


(  K  is  a  triangle,  k  is  an  integer  21  ,  Pt.,(K)  is  the  space 
of  polynomials  of  degree  a- 1  defined  on  K ), 

In  order  to  satisfy  (2),  we  can  use  the  following  lemma: 
Lemma 

Lei  T  belong  to  (P2(K))/  » then  T  is  uniquely  determined 
by: 

a)  the  moments  of  order  0  of  on  K  (  )  » 

b)  the  moments  of  order  0,1,2  of  T^nanfi  on  the  edges 

c)  the  moments  of  order  0,1  of  T^natfi  on  the  edges  of 
K , 

(n  3  »  t  c  (OaaU  are  the  unit  normal  and 

tangent  vectors  along  the  edges  of  K) . 


If  we  choose  the  spaces : 

E»f  =  {T«  <X-2<m}  I  T|K  e  (P2(K)),4 .  K  s  e* 


jt4) 


<■  )- 

E33 = ju  s’l>)? U|K  €  Pq(K)  ,Ks8,  j-  '(6) 

for  the  variables  related  to  c0p  ,  <s^d .  cr33 ,  and  if  we,  use 
Lagrange  multipliers  to  satisfy  the  continuity  of  the  d  o  f. 
'normal  traces'' characterizing; the  spaces  (P2(K))/'and 
Dj(K)  we  can  prove  that  the  final  linear  system  is  inverti¬ 
ble  . 

The  admissible  spaces  can  be  enlarged.  Let 
if** 

(WoP  =  ^tM“iP)^|“W  (a'P=1>2)  ’ 
if  rwe  replace 

<Pj(K»/  by  (P2(K)),‘  +  R<,1  +  R<,j  m  (4) 

Dt(K)  by  D2(K)  in  (S) 

Po(K>  by  P,<K)  in  (6) 

(method  number  2) 
or 

(P2W)/  by  ADG(K)  +  R$i  +  R$j  in  (4) 

D,(K)  by  D2(K)  in  (5) 

Pp(K)  by  P,(K)  in  (6) 

(method  number  3) 

(here  ADG(K)  is  the  Amold-Douglas-Gupta  spacc(l)), 
after  slightly  modifying  the  lemma  and  the  Lagrange 
multipliers,  the  same  conclusion  holds. 

In  method  number  3, we  arc  able  to  construct  3D  stress 
fields  verifying  exactly  the  equilibrium  equations  in  the 
composite , 

If  we  simplify  the  formulation  in  method  number  2  by 
considering  only  plane  elasticity  problems,  or  in  method 
number  3  by  considering  only'  bending  of  plates  prob¬ 
lems,  we  can  prove  convergence  results. 

As  far  as  the  implementation  of  the  method  is  con¬ 
cerned,  all  the  d.o.f,  related  to  the  stresses  can  be  elim¬ 
inated  locally.  As  the  d.o.f.  on  the  edges  of  the  triangles 
are  related  to  2  elements  we  can  divide  the  mesh  m  a  few 
clusters  of  elements  and  use  an  element  by  element  itera¬ 
tive  method,  in  order  to  solve  the  final  linear  system.  This 
technique  is  very  efficient  on  parallel/vector  supercomput¬ 
ers  . 

Some  tests  have  been  made  (plane  elasticity 
problcms.bcnding  of  plates  problem$),and  the  results  are 
all  excellent,  on  both  mechanical  and  computational 
efficiency  points  of  view . 

Ill  Arnold  D„Dougla$  J., Gupta  C.,  *A  family  of 
higher  mixed  finite  element  methods  for  plane  elasticity ", 
Num.  Math.-45-1984 . 

(2)  Thor. us  J.M.,  Thesis,  Universite  de  Paris  VI- 
1977  . 
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Abstract . 

Domain  decomposition  methods  have, very, interest* 
ing  features  to  solve  three  dimensional  composite,  struc¬ 
tural  analysis  problems.  Indeed,  . the  composite  feature  of 
the  material  leads  to  a  natural  mesh  substructuring,  with 
subdomains  that  have  similar  stiffness  matrices,  *  that 
entails  optimal  efficiency  with  domain  decomposition 
solvers  on  parallel  computers 

In  this  paper,  we  present  some  tests  performed  with  a 
domain  decomposition  method  via  Lagrange  multipliers, 
developped  in Jl),  on  a  distributed  memory  parallel  com¬ 
puter,  the  Intel  iPSC2 , 

The  method  is  based  upon  a  hybrid  variational  formula¬ 
tion  of  the  linear  elasticity -equations  that  consists  in 
enforcing  the  continuity  constraint- along  the  interface 
between  the  subdomains  by  introducing  the  Lagrange 
multiplier  of  this  constraint . 

The  Lagrange  multiplier  appears  to  be  equal  to  the 
interaction  force  between  the  subdomains . 

The  solution  process  consists  in  solving  the  con¬ 
densed  problem  related  to  the  Lagrange  multiplier  on  the 
interface  by  the  conjugate  gradient  algorithm.  So,  the  con¬ 
densed  interface  operator  does  not  need  to  be  actually 
assembled,  and  the  computation  of  the  matrix-vector  pro¬ 
duct  involves  the  solution  oMocal  independant  problems 
in  each  subdomain.  The  use  of  a  direct  skyline  solver  in 
the  subdomains  allows  a  good  vector  efficiency  . 

As  the  Lagrange  multiplier  just  enforces  the  continuity 
constraint,  the  Ladyzenskaia-Babuska-Brczzi  condition 
does  not  need  to  be  satisfied,  as  far  as  the  discrete  dis¬ 
placements  fields  arc  continuous  at  the  interface  . 

We  analyze  the  performance  obtained  with  different 
splittings  for  a  three  dimensional  composite  cantilever 
beam,  on  both  numerical  and  parallel  processing  points  of 
view.  These  tests  prove  that  this  method  performs  much 
better  than  the  classical  substructures  method.  At  last  we 
compare  the  results  obtained  with  various  implementation 
strategies  in  order  to  define  the  optimal  one . 
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Abstract 

The  homogenization  techniques  are  particulary  powerful  for 
studying  tint  modehzing  the  composites  materials  behaviour.  They 
lead  to  ihe'sesolution  of  problems  depending  on  a  basic  cell  which 
characterizes  the  medium  studied.  These  problems  are  usually 
solved  by  use  of  a  finite  element  method.  In  this  study  we  show  how 
to  solve  the  celt  problems  by  using  the  boundary  integral  method. 
Contrary  to  classical  finite'  element  methods  formulated  in  dis - 
placements,  the  unknowns  are  here  the  fields  of  displacements  and 
stress  vector  on  the  boundary  of  the  basic  cell  and  at  the  inter¬ 
faces  between  its  components.  Thus,  the  boundary  integral  method 
appears  as  a  well  suited  tool  to  analyse  the  damage  phenomenons 
which  may  appear  at  the  interface. 

Introduction 

The  main  difficulty  with  studying  composite  materials  is  their 
high  level  of  heterogeneity  compared  with  the  dimensions  of  the 
structures,  making  any  numerical  computation  (for  example,  by  fi* 
nite  element  method)  prohibitive  if  not  impossible.  The  technique 
commonly  used  to  get  around  this  obstacle  consists  of  substituting 
for  this  highly  heterogeneous  material  a  homogeneous  one  having 
’’average*’  mechanical  properties  that  depend  on  the  mechanical 
properties  and  geometry  of  the  various  constituents.  This  is  the 
homogenization  process  (Duvaut  (4j,  Sanchez  Palencia  (11)).  Fur¬ 
thermore,  by  a  localization  procedure,  the  method  allows  air  easy 
computation  of  the  microscopic  field  of  stresses  and,  in  particu¬ 
lar,  of  stress  forces  at  the  boundaries  between  heterogeneities  and 
matrix.  One  can  then  show  the  overstresses  phenomenon  may 
initiate  decohesions  at  the  microscopic  level. 

In  the  first  part  of  this  paper  we  show  the  F.E.M.  results  in¬ 
duced  by  these  techniques.  This  application  refers  to  the  example 
of  the  woven  composites  ^Paumelle  (9)).  In  order  to  obtain  a  good 
approximation  of  the  local  fields,  the  cost  of  the  computations 
may  be  expensive  in  this  case. 

The  second  part  is  devoted  to  the  implementation  of  the 
boundary  integral  method  (Mikhlin  [8],  Brebbia  (2),  Dautray  (3], 
Hartmann  (6j)  for  homogenization.  We  show  that  it  allows  to  ob¬ 
tain  a  good  approximation  of  the  stress  vector.  Thus,  it  may  be 
a  useful  tool  to  develop  models  which  take  in  account  a  damage 
localized  at  the  interfaces  between  the  components  of  the  basic 
cell. 


1.  Homogenization  by  F.E.M.  : 
application  to  woven  composites 


A  lot  of  computations'  about  composites  materials  (unidirec¬ 
tional  composites,  syntactic  foams,  . .)  have  been  made  in  our 
laboratory.  They  have  been  realized’by  use  of  F.E.M.  and- the 
When  it  is  a  matter  of  considering  composites 
whose  basic  cell  needs  to  be  tridimensional  (as  woven  composites), 
the  induced  computations  may  be  expensive.  Moreover  our  aim 
being  to  predict  the  sensitivity  to  damage  near  the  interface  of 
the  components,  the  mesh  has  to  be  sufficiently  refined  in  order 
to  obtain  a  very  good  approximation  of  the  stress  vector. 

As  the  discretization  of  these  problems  is  made  by  use  of 
linear  pentahedral  and  hexahedrab  Lagrange  finite  elements,  the 
stresses  we  obtain  are  constant  over  each  of  the  elements.  So,  it 
is  difficult  to  obtain  a  satisfying  approximation  of  the  interface 
stress  vector.  This  one  was  nevertheless  well  computed  by  use  of 
the  numerical  technique  developed  by  G  Duvaut  and  F.  Pistre  [5). 
To  this  way  we  have  used  the  adapted  post  processor  developed 
in  our  laboratory. 


Some  results  presented  in  figure  I  show  the  norm  of  the 
tangential  traction  introduced  by  an  imposed  macroscopic  stress 
(o'12)  ®  100  Mpa.  An  analyse  of  ♦he  results  we  have  obtained,- ab 
lows  a  prediction  of  the  damage  sensivity  by  sliding  or  decohesion 
at  the  interfaces. 


Figure  1, 


•  A  damage  model  by  sliding. 


In  order  to  modelize  a  sliding  phenomenon  between  the  jarn 
and  the  matrix,  which  is  often  experimentally  observed,  we  have 
introduced  a  behaviour’s  law  of  the  interface  allowing  a  tangential 
elastic  sliding  without  normal  unstiking  (Lene,  [7])  It  writes  . 


(|vtf])=x0 

where  [((v,)r)]  (respec  H^n)])  is  the  tangential  jump  of  the 
displacement  (normal  jump)  and  tt  the  tangential  stress  vectoi 
at  the  interface. 

This  phenomenon  has  been  taken  in  account  inside  a  software 
we  have  developed. 

We  present  ou  figure  2.  the  values  of  the  homogeneous  equiv 
alent  modulus  versus  the  parameter  of  sliding  k  It  appears  that 
there  exist  limit  values  noted  and  Am  of  this  scalar  which 
determine  three  regions.  The  first  ( k  >  hw)  corresponds  to  a  be 
haviour  without  sliding;  the  third  (k  <  JfcM)  to  a  total  decohesion 
between  the  yam  and  the  resin;  the  middle  region  (JtOT  <k<  kj,t) 
to  a  partially  debonded  yam. 
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Figure  2, 

In  order  to  develop  models  more  sophisticated  than  the  pre¬ 
vious  one  m  the  framework  of  the  F.E.M.,  we  should  be  led  to 
consider  complex  problems  with  expensive  resolutions.  Our  aim 
is  to  develop  realistic  damage  models  at  the  interface,  constructed 
above  the  stress  vector.  Thus,  it  appears  necessary  inside  the 
framework  of  the  homogenization  to  develop  a  more  adequat  me¬ 
thod. 

2.  Integral  equations  for  homogenization 

To  describe  the  homogenized  behaviour  of  composite  mate¬ 
rials,  we  have  used  the  integral  formulation  of  the  cell  problems 
(Paumelle  |10)).  The  numerical  method  associated  with  the  in¬ 
tegral  formulation  is  called  Boundary  Elements  Method,  The  un¬ 
known*  functions  of  the  formulation  are  the  displacement  u  and  the 
stress  vector  r(n)  on  the  boundaries.  These  one  are  the  external 
side  of  the  basic  cell  and  the  interface  between  the  components. 

In  order  to  numerically  implement  this  technique,  we  have 
constructed  three  bidimensional  elements  with  a  constant,  lin¬ 
ear  and  quadratic  interpolation  and  one  three  dimensionnal  con¬ 
stant  element.  One  can  note  that  in  the  case  of  homogenization 
we  have  introduced  some  specific  boundary  conditions  and  man¬ 
age  the  presence  of  some  domains  which  need  a  special  numerical 
treatment. 

The  results  shown  here  are  related  to  unidirectional  compos¬ 
ite  materials  and  are  carried  out  on  a  SUN  4  workstation.  As 
one  can  see  they  are  quite  satisfying  The  approximations  of  the 
homogenized  coefficients,  the  microfields  and  the  stress  vector  on 
the  interfaces  are  as  good  as  the  same  results  obtained  by  finite 
element  method.  One  test  is  interesting  to  discern  the  difference 
obtained  between  constant,  linear  and  quadratic  interpolatkrs 
It  consists  in  studying  the  convergence  speed  of  the  homogenized 
coefficients  versus  the  number  of  the  nodes  of  the  mesh  (Fig  3)  It 
shows  that  the  results  with  constant  elements  are  as  much  power¬ 
ful  than  the  others.  One  can  notice  that  over  200  nodes  the  results 
tend  to  the  same  values  which  are  the  finite  element  values. 


chanical  properties.  The  analyse  of  results  obtained  by  F.E.M 
shows  a  nearly  pe'rfecl'agreemenl  (less  than  1.3  %)  with  those  ob¬ 
tained  by  the  boundary  integral  method.  The  agreement  of  th< 
results  is  nearly  perfect , 

This  study  has  emphazed  the  high  level  of  quality  of  constant 
element.  ‘Moreover,  we  have  seen  that  the  number  of  nodes  doesn’t 
need  to  be  high.  Thus,  the  number  of  degrees  of  freedom  stay 
weak.  One  can  also  note  that  it  is  very  easy  to  add  some  nodes 
in  a  part  of  the  mesh  (interface  for  instance)  without  altering  the 
whole  mesh.  And  theJdircctly  computation  of  the  stress  vector  on 
the  interfaces  presents  a  real  advantage  for  a  fuither  analyse  oi 
the  microphenomcnon  of  damage. 
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tatahllahsoat  at  th#  aathaaatloal  baala  at  thaa#  alaaaata  haa 
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Abstract 

The  highly  nonlinear  behaviors  of  steel  structures  axe  sim¬ 
ulated  by  the  finite  dement  method  which  employs  a  bilinear  de¬ 
generated  shell  dement  with  the  reduced  integration  technique. 
The  calculated  problems  are  as  follows  :  1)  a  ash  wort  hi  ness  of 
axially  compressed  square  tubes  with  various  thickness  ratios  2) 
ultimate  strength  of  tabular  column  to  H-beam  connections  3) 
rei"  for  cement  effect  on  transverse  collision  collapse  of  box  beams. 
The  obtained  results  are  compared  with  the  experimental  results 
or  the  empirical  formula  in  order  to  discuss  the  quantitative  va¬ 
lidities  of  the  present  analysis. 

1.  Outlines  of  the  Present  Finite  Element  Code  {!] 

The  present  finite  dement  code  for  the  quasi-static  large 
deformation  analysis  is  based  on  the  following  algorithm :  1)  in¬ 
cremental  theory  by  the  updated  Lagrangian  approach  2)  zero 
normal-stress  projection  [2]  3)  orthogonal  hourglass  control  [3]  4) 
additional  stiffness  resisting  in-plane  rotation  (4].  In  the  present 
formulation,  the  updated  Kirchhoff  stress  increments  calculated 
at  each  loading  step  are  transformed  into  Jaumann  rates  of  Eu¬ 
ler  stresses.  ’Zero  normal  stress  projection’  is  the  technique  to 
maintain  the  plane  stress  condition  during  this  transformation. 
It  should  be  noted  that  the  experimental  true  stress-true  strain 
relation  must  be  used  as  the  constitutive  equation. 

2.  Crashworthiness  of  Axially  Compressed  Square  Tubes  [5] 

Finite  dement  analysis  for  crashworthiness  of  square  tubes 
under  axial  compression  has  been  conducted.  A  square  lube  un¬ 
der  axial  loading  collapses  with  a  progressive  buckling,  but  it  will 
cost  too  much  to  simulate  this  phenomena  directly.  Therefore, 
assuming  that  progressive  buddings  occur  simultaneously  on  the 
tube,  the  mean  crushing  stress  for  a  unit  periodic  area  of  buck¬ 
ling  patterns  has  been  calculated  with  a  minimum  computing 
cost.  In  Fig.  1,  the  calculated  mean  crushing  loads  expressed 
in  terms  of  the  tensile  strength  are  compared  with  the  experi¬ 
mental  values  for  various  thickness  to  edge  length  ratios.  The 
bold  line  in  the  figure  is  the  empirical  formula  given  by  Magee 
et  al.  (6).  The  calculated  mean  crushing  stresses  are  about  25 
%  smaller  on  the  average  than  the  experimental  values.  This 
difference  is  caused  by  the  above  assumption  as  well  as  the  limi¬ 
tation  of  ’thin  walled  assumption’,  because  the  crush  analysis  is 
always  accompanied  with  fairly  larger  strains  than  those  in  the 
ordinary  ultimate  strength  analysis.  However,  there  is  a  good 


agreement  between  the  calculated  and  experimental  results  from 
a  qualitative  point  of  view.  Fig.  . 2  and  Fig.  3  are  the  experimen¬ 
tal  and  the  numerical  crushing  deformations  respectively,  which 
agree  well  with  each  other. 


Fig.  1  Mean  crushing  stresses  of  axially  compressed  square 
tubes 


Fig.  2  Axially  compressed  square  tubes 


Fig  3  Calculated  crushing  deformations  of  axially  compressed 
square  tubes 
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3.  Ultimate  Strength  of  Tabular, Column  lo'Htbcam  Connec¬ 
tions  [7]  '  - 

Influences  of  a  dianxtcr-tbickcess  ’ralio  cf  a  circular- col¬ 
umn  on  the  cltimste  strength  and  the  ccilapse  inode  have  been 
studied.  Dimensions  and  boundary  conditions  for  the  calculated, 
connection  are  shown  in  Fig.  4  and  Table  1.  Calculated  four 
specimens  hare  different  diameter-thickness  ratios  for  the  col¬ 
umn.  The' half  region  of  the  connections  was  subdivided  into  , 
eleroents-  The  calculated  load-displacement  curves  are  shown  in 
•  .Fig.  5.  The  predicted  maximum  loads  by  the  empirical  formula 
[8]  for  Case.l  aad'Cas*>-£  are  34.4  tons  and  47.9  tons  respec¬ 
tively  which  agree  well  with  the  calculated  values.  In  Fig.  6  the 
calculated  coUapse,modesiK>  shown.  Depending  on  diameter 
•thickness  ratios,  the  four  specimens  have  collapsed  in  different 
modes.  In  Case  1  and  2  where  columns  are  relatively  thin,  tj;e- 
circular  columns  locally. buckled.  In  Case  3,  both  the  flanges  of 


ribtd) 


Fig.  4  Ttabulai  column  to  H-beam  connection 


the  H-beam  and  the  circular  column  yielded  at  the  same  time. 

14$ 

Therefore,  it  can  be  said  that  the  strength  of  the  circular  column 
and  the  3-beaxm  arc  well  balanced  in  Case  3.  In  Case  4  where 

121 

i  i 

the  column  is  relatively  thick,  the  torsional-flexural  buckling  with 
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a  local  buckling  of  the  flanges  have  occurred,  while  the  column 
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has  deformed  little.  The  present  method  would  be  useful  for  the 

!/ 

optimal  design  of  these  connections. 

(0 

;/ 

i 

Cm*  2 

_ _  »./.»,•  41.9 

4.  Reinforcement  Effect  on  Transverse  Collision  of  Box  Beams 

49 
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The  reinforcement  effect  on  the  transverse  collision  col¬ 

20 

r  c*** 1 

t  1 

lapse  of  box  beams  has  been  studied.  A  spot  welded(Case  3) 
or  a  line  welded(Ca$e  4)  reinforcement  plate  is  attached  on  the 
colliding  surface  of  box  beams  and  their  strength  has  been  com- 

•  »  49 

Fig.  5  Load-displacement 

<9  «9-  ICO  a  (••) 

curves  for  the  connections 

j  /  pared  with  that  of  box  beams  with  no  reinforcement  plates.  The 

loading  apparatus  and  the  calculated  box  beam  are  shown  in  Fig. 
7  and  Fig.  8,  respectively.  The  box  beams  are  assumed  to  col¬ 
lide  with  a  rigid  circular  column  at  a  speed  of  25  km/h.  In  this 
section,  the  quasi-static  algorithm  used  in  the  former  sections  is 
extended  to  the.  dynamic  transient  analysis  based  on  the  explicit 
time  integration  scheme.  In  addition,  the  influence  of  a  strain 
,  rate  is  taken  into  account  by  the  elasto-viscoplastic  flow  theory, 

'  and  the  nodal  degrees  of  freedom  are  shifted  from  the  clement 

i  mid-surface  to  the  outer  surface  at  the  welded  points.  The  ob¬ 

tained  load-displacement  curves  are  compared  with  experimental 


T*We  1  Dimeado&s  of  calculated  connection 
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(c)  Case  3  (d)  Case  1 

Fig.  6  Collapse  modes  for  the  connections 
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maximum  loads  in  Fig.  9.- It  is  obvious  that  the  present  analy¬ 
sis  explains  the  difference  of  the  ultimate  strength  of  four  tested 
specimen*. 

5.  Concluding  Remarks 

The  numerical  results  for  highly. nonlinear  behaviors  of 
steel  structures  subjected  to  large  deformations  have  been  briefly 
described.  It  is  obvious  that  the  finite  element  code  developed  in 
the  present  study  can  be  a  powerful  tool  for  the  basic  study  of 
crush  phenomena  as  well  as  the  optimum  design  of  actual  struc¬ 
tural  components.  Other  results  are  published  in  {l),(5]  and  [10]. 
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Fig  -7  Transversely  colliding  box  beam 
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Fig.  8  Calculated  box  beams 


Fig.  9  Load-displacement  curves  for  the  box  beams 
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NUMERICAL  ANALYSIS  OF  TECTONIC  DEFORMATION  BY  MAGMA' 
LNTRUSION  AND  PREDICTION  OF  VOLCANIC  ERUPTION 
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TADAHIKO  KAWAIL  .  - 
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l-3,Ka'giira2aka,ShinjyuIcu<  • 

Tokyo  162yJapan 

Abstract-Numerical'  analysis  by  discrete  models  -  was  used 
to  reveal  the  shape. and  depth  of  underground  magma 
body  from  pattern  of  ground  movement  of  the  Izu- 
Oshima  volcano,  Japan.  Measurement  of  areal  pattern  ot 
ground  movement  of  volcanic  surface  is  useful  for  tracing' 
of  underground  movement  of  magma -body  and  for 
prediction  of  volcanic  eruption. 

1  INTRODUCTION 

Discrete  models  are  suitable  for  numerical  simulation-^  of 
tectonic  deformation  of  the  Earth’s  crust  which  is 
essentially  discontinuous  with  faults,  joints  and  various 
boundaries  of  rocks.  The  Japanese  Islands  locate  near  of 
boundaries  between  major  platcs:Eurasia  plate, 

Philippine  Sea  plate,  Pacific  plate  and  North.  American 
plate.  Active  interaction  between  plates  induces  frequent 
earthquakes  and  volcanic  activities,  in  Japan.  On  the 
November  15th,  1986,  the  first  volcanic  eruption  since 
1974  started  in  the  summit  crater  of  the  Izu-Oshima 
volcano,  Japan,  Earthquake  swarms,  volcanic  tremors, 
etc,  signaled  the  earlier  subsurface  magmatic  activity,  but 
the  subsidence  of  central  surface  of  the  volcano 
prevented  the ,  short-term  prediction  of  the  eruption  as 
people  believed  that  upraise  of  subsurface  magma  body 
induces  upheaval  of  ground,  only. 

In  many  cases,  the  inflation  of  central  surface  of  volcano 
proved  of  the  most  reliable  precursor  of  volcanic 
eruption.  However,  Koide  and  Bhattachazji( 1975) 

^pointed  out  that  rock  mass  could  be  depressed  in  the 
central  area  by  the  increase  of  magma  pressure  above  the 
magma  body  which  has  long-vertical  depth  with  narrow 
horizontal  section. 

n  NUMERICAL  ANALYSIS  BY  DISCRETE  MODELS 

In  this  paper,  the  authors  applied  the  rigid-body-spring 
method  (RBSM)  by  Kawai(1980)  for  a  realistic  simulation 
model  of  tectonic  effect  of  subsurface  magma  intrusion 
where  differential  movements  along  fractures  have  vital 
role  for  deformation  of  the  crust.  In  the  two  dimensional 
calculation,  an  elliptical  hole  is  assumed  in  a  rectangular 
plate(Fig.l),  The  v/all  of  hole  is  applied  with  magma 
pressure.  At  the  start  of  calculation,  magma  pressure  is 
equal  to  the  Iithostatic  pressure  at  the  top  of  magma 
reservoir.  Then,  magma  pressure  increased  step  by  step 
with  the  rate  of  0.5  Mpa.  The  aspect  ratio(b/a)  of  magma 
body  were  taken  as  1,  5  and  10.  The  results  of  calculation 
show  that  fractures  start  at  the  top  of  magma  body  of 
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higher  aspect  ratios  (that  is  dike-like  magma  body)  and 
extend  upwards  forming  fracture  zone  of  funnel  shape 
(Fig  2).  The  calculated  vertical  displacement  of  earth’s 
surface  indicates  that  a  simple  dome  uplift  is  formed  over 
a  circular  magma  body,  but  that  a  central  trough-like 
depression  with  flank  uplifts  is  formed  over  magma 
bodies  of  higher  aspect  ratio$(Fig.3) 


Fig.l  Model  for  calculation  of  magma  intrusion. 


Fig  2  Extension  of  fractures(h«*  100m, b« 200m) 
around  the  magma  intrusion.  Waved  fines  indicate 
tensile  fractures  and  thick  lines  denote  shear  fractures. 
The  numbers  in  circle  indicate  steps  when  fractures  are 
formed. 
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Fig.3  Vertical  displacement  of  surface  over  subsurface 
magma  iritru$ion(h*»  100m, b«= 200m). 


SI  TROUGH-LIKE  SUBSIDENCE  AS  INDICATOR 
OF  DIKE-LIKE  MAGMA  INTRUSION 

In  the  Izu-Oshima  volcano,  trough-like  subsidence  during 
the  time  span  which  includes  the  eruption  event,  is  clearly 
shown  from  the  comparison  of  leveling -.data.  The  result 
of  calculation  indicates  that  the  width  of  trough-like 
depression  is  nearly  twice  of  the  depth  of  top  of  magma 
intrusion(F)g.4).  As  the  width  of  trough  in  the  Izu- 
Oshima  is  about  4  Km  in  the  narrowest  part,  the  top  of 
magma  intrusion  is  expected  at  the  depth  of  about  2  Km 
in  the  south-eastern  part  of  Izu-Oshima  volcano(Fig.5). 

The  maximum  subsidence  of  about  30  cm  is  expected  by 
the  magma  pressure  of  about  6  MPa(Fig.6). 

The  results  of  three-dimensional  calculation,  also,  clearly 
indicate  that  the  trough-like  subsidence  in' the  Izu- 
Oshima  volcano  is  a  kind  of  "keystone  graben"  where  the 
trough  depression  is  formed  -by  splitting  of  rock -mass  by 
the  intrusion  of  dike-like  magma  wcdge(Fig.4,S). 

Therefore,  the  subsidence  of  summit  of  Izu-Oshima 
volcano  indicated  the  surface  intrusion  of  magma  and 
increase  of  magma  pressure,  that  is,  the  subsidence  is  also 
the  precursory  phenomena  of  volcanic  eruption  in  some 
cases. 

E  CONCLUSION 

Numerical  analysis  by  discrete  models  is  used 
successfully  to  simulate  the  formation  of  Keystone  graben 
by  subsurface  intrusion  of  dike-like  magma  body  beneath 
the  Izu-Oshima  volcano,  Japan.  Measurement  of  areal 
pattern  of  ground  movement  of  volcanic  surface  is 
effective  for  the  prediction  of  volcanic  eruption,  only  with 
adequate  numerical  model  for  magma  intrusion. 


Fig.4  Sketch  of  vertical  displacement  of  volcanic  surface 
due  to  a  subsurface  dike-like  intrusion  'calculated  from 
the  three-dimensional  RBSM  model. 


Fig.5  Extension  of  fractures  around  a  dike-like  magma 
intrusion  estimated  from  the  three  dimensional  RBSM 
model. 


Fig.6  Model  of  graben  subsidence  over  a  large  dike-like 
magma  intrusion  in  the  Izu-Oshima  volcano.  The  upper 
diagram  shows  the  vertical  displacement  measured  in 
the  Izu-Oshima  volcano  with  the  theoretical  curve 
where  the  magma  pressure  is  6MPa. 
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application  of  discrete  limit  analysis  to  reinforced  concrete  beams 


Tadahiko  KAWAJ,  Masatoshi  UEDA,  Norio TAKEUCHL  Haninon  HIGUCHI,  Hiroaki  KITO 

Science  Univ.  of  Tokyo  TakenakaCo.  MeiseiUniv.  Abe-Kogyo  Co.  Osaka  City  Univ. 


Reinforced  concrete  beams  subject  to  shear  fracture  were  analyzed  by  a  new 
discrete  limit'analysis  method,  A  comparison  was  made ,  between  the  results  for 
the  failure  mechanism,  failure  process  and  maximum  strength  obtained  through 
analysis ,  by  two  types  of  i  ncremental  load  methods, 

I.  INTRODUCTION 

A  family  of  new  discrete  models  were  proposed  by  Prof.  Kawai  in  1977  on  the 
basis  of  the  experimental  evidence  for  solids  under  the  ultimate  state  of  loading  (1  J. 
These  models,  named  RBSM,  consist  of  rigid  bodies  and  two  types  of  connection 
springs,  one  of  which  resists  dilaiatioria!  deformation,  while  the  other  resists  shear 
deformation. 

The  failure  mechanism  of  reinforced  concrete  structures  is  complicated  and  it  is 
extremely  difficult  to  analyze  their  behaviour  using  existing  analysis  methods, 
which  are  usually  based  on  the  continuum  mechanics  approach.  The  authors  have 
been  engaged  in  the  analysis  of  various  types  of  reinforced  concrete  shear  walls 
using  RBSM's  of  reinforced  concrete  clemem$(2>3,4). 

Modifications  are  r  equired.  in  the  analysis  algorithm  for  the  incremental  load 
method  in  analyzing  deep  beams,  for  example,  in  which  the  shear  cracks  in  the 
concrete  form  severe  fracture  mechanisms.  There  arc  two  types  of  algorithms  for 
the  incremental  load  method.  One  is  Macal's  method,  in  which  the 
unbalanced  forces  are  redistributed  during  the  iteration,  and  the  other  \ 
is  Yamada’s  method,  in  which  the  yielding,  failure  and  unloading  are  a( 

judged  for  each  integration  point.  In  this  paper,  a  proposal  Is  made  for  — 
a  new  incremental  load  algorithm,  which  is  that  of  Yamada’s  method 
modified  to  accommodate  cracking  and  recontact,  and  the  results  of  the 
analysis  for  a  deep  beam  using  the  constitutive  relation  for  reinforced 
concrete  obtained  with  the  previously  developed  RBSM’s  are  compared 
with  test  results  to  show  its  validity.  ** 

2.  STRESS-STRAIN  RELATION  OF  CONCRETE  11 
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Fig.  2  Yield  Failure  Surface 
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Fig.  3  Calculation  oflncremcntal  Load 


The  stress-strain  relationships  of  concrete  in  uniaxial  compression 
arc  approximated  by  a  trilinear  curve  as  shown  in  Fig.l.  The  stresses  at 
the  first  and  second  yielding  levels  are  Fc/2  and  0,95  Fc,  respectively. 
The  tension-stiffening  effect  is  also  taken  into  consideration. 

The  failure  surface  of  concrete  loaded  in  its  triangular  plane  is 
divided  into  7  regions  as  shown  in  Fig.2  according  to  the  states  of 
stress.  Stages  2  and  10  are  decided  according  to  the  normal  strain  Ca 


<*)  Model- 1  (wfchom  Shoe  RemXwced  Bin)  (b)  Model- 2  (w*h  Shew  Rcnforcod  Bin) 

Fig.  4  Reinforced  Concrete  Beam  Used  in  Analysis  (Umf  cm) 
Table  1  Concrete  Material  Constants  (Units,  kgf.  cm) 
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Table  2  Reinforcement  Material  Constant  (Units,  kgf.  cm) 
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3.  ANALYSIS  ALGORITHM 

When  stress  is  released  by  cracking  and  crashing,  the  force 
released  may  in  turn  cause  cracking,  compression  failure  and 
slips.  Here  a  proposal  is  made  for  an  algorithm,  .which  is  a  - 
modified  version  of  Yamada’s  incremental  load  method  and 
which  can  add  the  released  force  to  the  remaining  load  while  - 
counting  the  load  and -can  simultaneously  take  account  of  slip 
failure,  cracking  and  compression  failure.  As  shown  in  Fig.  3, 
the  minimum  load  increase  r- corresponding  tp  various  failure- 
.  conditions  at  a  given  load  increase  P  is  obtained  and  the  released  - 
force}F'is  added  to  the  remaining  load.  The  remaining  load  at 
step  n  is  expressed  by-  the  following  equation. 


p>*'  -  “'rfu-DfP  +  ,4  t*Su<-ri>ru->*l  (I) 
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The  process  is  repeated  until  tlie  given  load  increase  P  and 
■the  released  force  are  all  used  up. 


Fig,  5  Relation  between  Working  Load  and  Deflection  at  Span  Centre 
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4.  NUMERICAL  EXAMPLE 

The  deep  beam  test  piece  models.  Models  1  and  2,  with  and 
without  stimips  were  analyzed  by  two  types  of  incremental  load 
methods,  the  normal  Macal’s  method  (Algo.  I)  and  the  method 
proposed  here  (Algo.  2).  The  material  constants  used  ,  in  the 
analysis  are  given  in  Tables  1  and  2.  The  load-displacement 
relationships  for  Models  1  and  2  arc  shown  in  Fig.  5,  It  can  be 
seen  from  the  figure  that  the  differences  between  the  incremental 
load  methods  originate  in  the  cracking  zone  and  the  proposed 
algorithm  (Algo.  2)  gives  solutions  that  correspond  well  to  the 
failure  load  maxPexp  obtained  in  the  lest.  The  deformation  mode 
of  Model  1  around  the  time  of  collapse  indicate  thatthc  results 
obtained  with  Algo.  1  diverge  from  the  test  results  neanhe 
centre  of  the  span  (Fig.  6),  The  failure  charactenstlcs  at  the 
time  of  collapse  given  by  Algo.  2  are  shown  in  Fig.  7,  The 
formation  of  the  resistance  mechanisms  in  the  deep  beams  is 
well  represented  in  both  Models  I  and  2. 


Fig.  6  Deformation  Mode  of  Model  1  around  Time  of  Collapse 
(pn  16  tons,  1/2  Zone) 
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5.  CONCLUSION 

It  was  confirmed  through  analyses  of  the  behaviour  of 
reinforced  concrete  beams  using  RBSM's  that  they  are  capable 
of  closely  representing  maximum  shear  strengths  and  failure 
characteristics  of  reinforced  concrete  structures.  In  particular, 
when  combined  with  the  analysis  algorithm  proposed  here, 
it  was  made  possible  to  cany  out  satisfactory  evaluation  of  the 
conditions  surrounding  the  progress  of  failure  such  as  the 
occurrence  of  released  force  due  to  cracking  and  recontact. 
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APPLICATION  OF  THE  RIGID  BODIES-SPRING  MODELS  TO  THREE  DIMENSIONAL  FRACTURE  ‘MECHANICS 

At suslii  KIKUCHI*,  Tadahiko  KAWAI  *  *  and  Noriyuki  SUZUKI* 

♦Plant-Engineering  &  Technology  Bureau,  Nippon  Steel  Corporation,  Otemachi  1,  Tokyo,  Japan 
•**lst  Faculty  of  Engineering,  -Science  University  of  Tokyo,  Shinjuku-ku,  Tokyo,  Japan 


Abstract-The  Rigid  Bodies-Sprmg  Models  (abbrevi¬ 
ated  as  the  RBSM  hereafter)  are  applied  to  the 
phenomenon  of  crack  growth  m  the  three-dimensional 
field  and  demonstrated  the  effectiveness  of  the  mod¬ 
els.  The  potential  fracture  criteria  for  the  stable 
and  unstable  crack  growth  are  discussed.  Then  nu¬ 
merical  examples  for,each  fracture  problem  are  pre¬ 
sented. 

X.  INTRODUCTION 

A  family  of  new  discrete  models  named  as  the 
RBSM,  which  was  proposed  by  one  of  the  authors  Kawai 
in  1976  (1),  has  been  proved  to  be  effective  and 
suitable  especially  for  analyses  of  nonlinear 
problems  with  large  plastic  deformation .  With  respect 
to  the  constitutive  law  of  the  connection  springs, 
it  i3  believed  that  the  Mohr-Coulomb's  law  is  the 
most  suitable  criterion  for  any  material  if  the  size 
of  tho  elements  is  taken  reasonably  small  accord¬ 
ing  to  accumulated  results  of  numerical  analysis. 

In  this  paper,  the  practical  application  of 
the  three-dimensional  RBSM  based  on  the  previous 
work  (2)  is  presented,  and  the  potential  fracture 
criteria  are  also  discussed. The  stable  crack  growth 
in  an  arbitrarily  shaped  initial  crack  and  the 
unstable  crack  propagation  of  brittle  material  are 
simulated  using  these  criteria. 

II.  OUTLINE  OF  THE  RBSM 

The  detail  of  the  RBSM  has  been  already  de¬ 
scribed  m  tho  previous  paper  {1} .  The  outline  of 
the  theory  for  the  three-dimensional  problems  is 
as  follows: 

For  the  3D  problems  the  tetrahedral  rigid  ele¬ 
ment  is  used.  Each  element  has  6  degrees  of  freedom 
at  its  centroid,  which  are  assumed  to  be  infini¬ 
tesimal  .  For  the  analysis  of  crack  growth,  the  tensile 
normal  stress  or  strain  is  employed  as  the  crite¬ 
rion  parameter. 

III.  NUMERICAL  RESULTS 
&._Canst itutiy&.law  and  plastic  region 

In  order  to  achieve  an  appropriate  constitu¬ 
tive  low  for  the  RBSM,  the  numerical  and  experimen¬ 


tal  tests  of  the  standard  compact  test  specimen  were 
carried  out.  The  calculated. plastic  region  around 
the  crack- tip  with  each  yield  condition,  the  Von 
Mises'  type  or  the  Mohr-Coulomb's  one,  was  indi¬ 
cated  in  Figil.  As  far  as  the  calculated  plastic 
zone  is  concerned,  the  Mohr-Coulomb's' type  shows 
fairly  good  agreement  with  the  result  £3 1  ( 4 ]  ana¬ 
lyzed  by  the  FEM  and  is  considered  more  suitable 
for  this  kind  of  problem. 


Fig.l  Calculated  plastic  region  of  crack  tip 

B.-CriteriQa.Qf-atable  crack-growth 

The  criterion  of  the  stable  crack  growth  in 
ductile  material  was  obtained  by  comparing  analyti¬ 
cal  results  for  the  previous  test  specimen  with 
experimental  ones.  First,  the  load  at  which  the  crack- 
tip  started  propagating  in  the  experiment  (a  big 
inflection  point  on  the  experimental  COD-curve)  is 
denoted  by  Pc,  then  the  maximum  strain  around  the 
crack-tip  in  the  RBSM  analysis  for  the  specimen  at 
the  load  Pc  is  defined  as  the  criterionEc  of  the 
stable  crack  growth  (Fig. 2) . 


Fig. 2  Criterion  of  stable  crack  growth 
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C.  Sirtmlation:of_V_ishaped  Crack  Growth 

In  order- to  verify  the  generality  of  the  con¬ 
stitutive  law  and  the  criterion,  a  V-shaped  ini¬ 
tial  cracked  body  with  the  tensile  loading  was 
analyzed.  The  crack  growth  was  simulated  m.the 
elasto-plastic  analysis  by  cutting  the  springs  of 
which  the  tensile  strain 8n  reached  the  fracture 
criterion  £c  derived  in  the  last  subsection.  (Fig. 3) 


Fig. 3  calculated  crack  growth  of 
V-shaped  initial  crack 

P.,  Application., to  Dynamic-Crack  Propagation 


Mohr-Coulomb ' s  law  may  ^be  more,- suitable,  than  the 
Von-Mises ' .  The  cutting  criterion  for  the  springs 
of  the  RBSM.is  obtained  by  analyzing  the  compact 
test  specimen  itself,  and  the  effectiveness  of  the 
criterion -is  verified  m^the  application  to  an 
arbitrarily  shaped- crack.  In  an  application  to 
dynamic  crack  propagation  of  brittle  material,  the 
simulated  crack  propagation  is  m  reasonable  agree¬ 
ment  with  the  actual  crack  propagated  m  the  resid¬ 
ual  stress-field  of  the  welded  plate  m  a  water  vessel . 


Crack  Initiation 


$©j>**c. 


(Dyntmlo  truly*!*  ot  th*  crack 
procugatioautlng  th*  RBSM-Sh*l( 
*)*m»nt*  and  th*  criterion 
determined  by  the  *n*ly*l*  of  th* 
t*»t  *p*cim*ix) 


Fig. 4  Simulation  of  the  dynamic 
crack  propagation 


A  dynamic  simulation  of  unstable  crack  propa¬ 
gation  in  the  actual  water  vessel  was  carried  out. 
At  first  the  residual  stress  distribution  of  a  welded 
plate  was  calculated  by  using  the  general  purpose 
FEM  program  MARC  and  then  obtained  results  were 
converted  to  the  mesh  subdivision  for  the  RBSM-shell 
model  (5) .  Brittleness  of  the  material  was  so  strongly 
evident  that  the  normal  stressCn  was  employed  for 
the  criterion  of  the  fracture  and  the  critical  stress 
value  Oc  was  determined  in  the  same  manner  as  the 
previous  subsection. 

Fig. 4  shows  the  numerical  results  of  the  crack 
propagation.  An  initial  crack  existed  Along  the  right 
side  of  the  welded  rectangu^  ur  plate  and  the  spring 
at  the  initial  crack  tip  wa3  cut  by  force  in  the 
beginning  of  the  transient  analysis.  The  simulated 
route  of  crack  propagation  corresponded  with  the 
actual  one. 

IV.  CONCLUSION 

Aiming  at  the  practical  application  of  the  RBSM 
the  constitutive  low  for  the  three-dimensional  crack 
problems  is  discussed.  The  calculated  plastic  zone 
of  the  crack-tip  in  our  results  indicate  that  the 
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The  Limit  Analysis  of  Tapered  Structural  Members  and  Frames  by  a  Discrete  Model 
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Abstract-This  research  aimed  at? the  developaent  of  a 
d i sere te  model  for  liait  analysis  of  thin-walled  aeabers 
and  frames.  This  aodel,  having  both  geometrical  and. material 
nonlinearity,  can  deal  with  large  deformation  analysis.  As 
a  result,  it  was  shown  that  the  new  discrete  aodel  was  able 
to  easily  determine  the  development  of  a  plastic  region  in  a 
cross  section  in  a  load  carrying  problem,  and  it  was  able 
to  consistently  trace  the  equilibrium  path  considering  from 
the  stage  of  elastic  deformation  to  plastic  collapse  by 
numerical  example. 


<nu  «<yn.  0) 

for  the  same  state  of  deformation,  where  dll*  is  the 
■virtual  work  as  a  continuum. 

When  a  member  is  in  an  elastic  state,  it  is  assumed  that 
the  k-th  axial  spring  takes  charge  of  the  partial  cross- 
sectional  area  Aik  and' is  attached  to  the  center  of 
gravity  of  the  grid.  At  thisvtiac,  theaxiai  spring  constant 
is  set  according  to 

Kt.-EAi./Si  .  (2) 


1.  INTRODUCTION 

Recently,  many  different  procedures  have  been  widely  used 
for  the  discrete  analysis  of  a  contlnuua,  but  finite 
elcaent  aothod  has-been  accepted  as  the  «ost  powerful 
analysis  method  available. 

On  the  other  hand,  there  is  the  rigid-body  spring  model  as 
the  limit  analysis  proposed-from  the  standpoint  of  a 
dlscontinuum” ” .  The  RBSH  assumes  the  structural  system  to 
comprise  rigid  bodies  and  springs,  and  .enables  easy 
formulatation  the  geometrical  and  material  nonlinearity  and 
determination  of  the  development  of  a  plastic  region. 

This  paper  proposes  a  beam-column  modol  which  consists  of 
the  rigid-body  spring  system.  The  ultimate  strengths  of  the 
tapered  steel  members  and  frames  have  been  calculated  by 
this  model”"”. 


n.  NEW  DISCRETE  MODEL 

In  a  rigid-body  spring  model,  a  member  Is  divided  into  a 
finite  number  of  elements,  and  the  elements  themselves  are 
assumed  to  be  rigid  bodies.  The  neighboring  dements  aro 
connected  with  a  spring  system  for  resisting  the  relative 
motion  on  the  boundary  planes;  In  addition,  mechanical 
characteristics  are  given  to  this  spring  system. 

In  this  report,  the  author  proposes  the  rigid-bar  element 
model,  as  shown  in  Fig. 1.  .This  model  utilizes  a  spring 
system  composed  of  axial  springs  as  a  distributed  spring 
system,  and  the  torsion  spring  of  St.  Venant  and  shear 
springs,  arranged  in  horizontal  and  vertical  directions,  as 
a  concentrated  spring  system.  In  order  to  take  warping 
deformation  into  consideration,  the  assumption  of  rigid 
elements  was  partially  rejected,  and  unit  warping  o>  was 
calculated  as  a  continuum. 


Fig.l  Rigid-bar  element  modol 

The  determination  of  the  spring  constants  should  be 
carried  out  using  the  energy  principle,  and  it  is  suggested 
that  they  be  selected  so  that  the  virtual  work#  IK  by  the 
spring  system  given  in  Fig.l  becomes 


where  E  is  the  modulus  of  elasticity  and  $j  is  the  length  of 
spring  element  i. 

The  torsional  spring  constant  and  the  shear  spring 
constant  are  given  as  follows  respectively. 


K,i-GJ/Si  \ 

KSi-KSi-GA/Si  * 


(3) 


Where  G  is  the  elastic  shear  modulus  and  J  is  the  torsion 
constant. 

for  the  treatment  of  this  model  in  an  inelastic  region, 
the  following  assumptions  aro  Introduced; 

a)  The  yielding  of  materials  is  to  be  determined  only  by 
.axial  stress;  the  effect  of  shearing  stress  is  neglected. 

b)  The  stress-strain  relationship  between  a  hysteresis 
type  body  and  a  perfectly  elasto-plastic  body  Is  used. 

c)  An  linearized  residual  stress  distribution  is  used. 

d)  The  bending  and  bending- torsional  rigidity  must 
conform  to  the  tangent  modulus  theory.  Also,  the  torsional 
rigidity  of  St.  Venant  must  conform  to  the  plastic  flow 
theory. 


S.  LIMIT  ANALYSIS  OF  TAPERED  BEAM-COLUMNS 

When  solving  problems  dealing  with  a  tapered  structural 
member,  generally,  the  member  is  divided  into  several 
elements  and  treated  as  a  collection  of  elements  of  uniform 
cross  sections.  Then  the  member  is  approximated  by  elements 
whose  cross  sections  change  In  the  stepwise  manner. 

RBSM  can  basically  be  treated  in  the  same  manner  as  the 
procedure  explained, above.  As  shown  in  Fig. 2,  since  the 
mechanical  characteristics  of  the  cross  section  of  each 
element  of  the  tapered  member  are  replaced  with  the  spring 
constant  of  each  ‘spring. element’,  the  elements  of  tapered 
members  can  bo  considered  to  be  equivalent  to  the  elements 
of  uniform  cross  section  found  between  the  centers  of 
gravity  of  adjacent  rigid  elements  inclusive  of  the  spring 
systems.  tew 


Fig. 2  Spring  element  of  a  tapered  member 

As  the  method  of  changing  the  cross  section,  a  tapered 
member  whose  web  changes  linearly  in  the  direction  of  the 
length  of  the  member  was  evaluated.  As  a  variable  parameter. 
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the  web  height  ratio  7,  the  ratio  of  the  web  height  at  the 
minimum  cross  section  to  that  at  the  maximum  cross  section, 
was  used. 

As  an  example  of  practical  calculation,  the  displaceaent 
relationship 1  of.  a  simply  supported  bean-column  with  a 
rectangular  cross  section  subjected  to  uonoaxjai  bending. 
Here,  r  =*0.5,  M*  0.2MP  (plastic  moment),  "and ;fesidual 
compressive  stress  distribution  <rr<=  0.4 ay  were  used. 
The  number  of  elements  divided  was- 20.  Under' these 
conditions,  the  results  shown  in  Fig.3  were  obtained. 


Fig. 3  Deformation  behaviour  ot  lapcred  beam-column 

The  dotted  line  in  the  figuer  shows  the  results  for  the 
member  whoso  cross  section  was  the  same  as  the’  cross  section 
of  tho  central  segment  of  the  taperod  member.  Compared  with 
the  tapered  member,  a  high  critical  load  was  obtained  with 
the  uniform  beam-column.  This  can  bo  explained  by  the  fact 
that  nonelastic  behavior  devolops  earlier  In  the  tapered 
member  than  It  does  In  the  uniform  member. 

1Y.  LIMIT  ANALYSIS  OF  FRAMES, 

In  the  case  of  the  rigid  body  spring  model,  first,  the 
model  Is  constructed  by  assuming  plastic  hinges  which  can 
accommodate  the  possible  range  of  plastic  yield  of  the 
structure  and  by  providing  mechanical  characteristics  to  the 
spring  system  at  the  hinges. 

As  a  practical  example.  a  problem  of  the  ultimate  strength 
In  the  direction  of  tho  span  of  the  main  tower  of  a 
suspension  bridge  Is  presented.  Tho  main  tower  receives 
horizontal  reactive  force  due  to  the  horizontal 
displaceaent  via  the  main  cable  at  the  top  of  the  tower;  a 
model  such  as  that  shown  in  Fig. 4  can  be  constructed. 


Pig, 4  Tower  model  of  suspension  bridge 

The  cable  can  be  modeled  using  an  axial  spring  whose 
spring  constant  Is  K»  -Elyy/h’rw  (rw  "0.002).  The 
object  of  the  analysis  is  the  main  tower,  in  which  tho 
relationship  between  the  axial  force  at  the  top  of  the 
bridge,  P,  and  the  forced  displaceaent,  d,  is  given  by  the 
following  equation  considering  the  entire  suspension  bridge. 

P/Py  -42l.4«?/h)*+2.752(J/h>  +0.295  (4) 


The  tower  consists  of  a  monobox- type  member (115  X170  X&ra) 
with  h^2.8«.  the  yield  load  of  Py  -101.74tw  . 

The  critical  load  calculated. by  the  present  model,  the 
experimental  result  and  the  value  obtained  by  the 
difference  method  are  compared  in  Fig. 5.  The"  F -6  curve 
obtained  by  the  model  agrees  well  with  that  obtained  by 
difference  method;  they  are  represented  by  a  single  line. 
The  horizontal  reactive  force  of  the  cable  obtained  using 
the  model  in  terms  of  absolute  values  was  smaller  than  that 
obtained  experimentally.  The  tendencies  of  the  behavior. of 
the  F-  6  relationship  as  determined  by  calculation  and 
theory  were  essentially,  identical.  Also, -the  critical,  axial, 
force.  Pcr/Py. calculated  using  the  model -essentially 
agreed  with' that  of-'  the. experimental’  result. 
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V.  CONCLUSIONS 

Tho  results  of  theso  analyses  are  summer ized  as  follows. 

(1)  When  spring  constants  aro  determined  by  applying  the 
variation  principle,  there  is  a  possibility  of  the  spring 
system  model  becoming  too  stiff  or  too  flexible.  However, 
tho  rigid-bar  element  model  can  uniquely  select  spring 
constants. 

(2) For  ultimate  strength  problems  of  tapered  structural 
members  and  frames,  the  development  of  a  yielded  region  in  a 
cross  section  can  bo  easily  dealt  with;  in  addition,  a 
numerical  solution  having  sufficient  accuracy  is  obtained. 
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Abstract  oiling  on  the  experimental  evidence  of  solids  under  the 
ultimate  state  o{  loading,  KAWAl  proposed  a  family  of  new  discrete 
models  in  1976.  These  models  consist  of  rigid 'bodies  and  two  type  of 
connection  springs,  one  of  which  resist- the  dilatational  deformation, 
while  the  other  shear  deformation.-ln  this  paper,  application  of  these 
models  is  proposed  to  soil  and  rock  mechanics. 

^INTRODUCTION 


ui  =  (ux,  vx  0X)‘ ,  Uj  sx  (tlj  l>3  $})*  ,  6^(6,  $„)* 

Bi  ss  [  ] 

B  [  ft.  7*1  -Ii6r-y2)+”tj(2“*j)1 

~  (  h  W}  J 

h  m  co$(7, z) cos(y, y) ,  mx  =  cos(y.r),  ntj=  co$(p,y) 


To  consider  the  soil  and  rock  materials  as  continua  they  are  gener¬ 
ally  too  nonuniform,  inhomogeneous  and  eby  to  slip  internally  under 
applied  loading.  The  finite  element  method  treat  the  sod  and  rock 
masses  on  the  continuous  materials*  On  the  other  hand,  the  simplified 
method  is  a  practical  but  rather  crude  method  which  has  been  devel¬ 
oped  by  ingenious  use  of  elasticity  and  plasticity  theories  bemuse  the 
stress  field  is  determined  without  consideration  on  the  displacement 
and  strain  hysteresis. 

Considering  such  a  status  of  existing  methods  KAWAl  proposed  a 
family  of  new  discrete  models.  In  these  models  structures  or  solids 
are  idealized  as  a  set  of  rigid  elements  interconnected  by  two  types  of 
spring  system,  one  of  which  resist  the  dilatational,  the  other  shearing 
deformation.  Therefore  sliding  or  separation  of  two  adjacent  elements 
can  be  made  easily. 

In  case  of  the  granular  materials  influence  of  crack  initiation  due  to 
tensile  load  often  can  not  be  neglected.  In  view  of  this,  present  au¬ 
thors  developed  a  new  algorithm  which  may  be  applicable  to  analyze 
the  coupled  failure  of  solids  due  to  slippage,  tensilt  cracking  and  solids 
contact.  In  this  paper  application  of  these  models  to  analysis  of  foun¬ 
dation  structures  will  be  attempted  and  the  general  method  of  discrete 
limit  analysis  will  be  described  with  some  verification  examples, 

2.FORMULATION  OF  TWO  DIMENSIONAL  RDSM 

For  simplicity,  consider  two  dimensional  rigid  triangular  clement  of 
RBSM  as  shown  in  Fig.l*  They  are  assumed  to  be  equilibrium  with 
external  loads  and  reaction  forces  of  the  spring  system  which  Is  dis¬ 
tributed  over  contact  surface  of  two  adjacent  bodies. 


(a)  before  deformation  (b)  after  deformation 
Fig.l  Two  dimensional  rigid  triangular  element 

Rigid  displacement  field  xs  assumed  in  each  element,  whose  nodal 
displacement  are  given  by  the  displacement  (u,v,{3)  of  the  centroid  as 
shown  in  Fig  1.  Therefore,  the  relative  displacement  sector  6  of  the 
arbitrary  point  P  an  be  derived  as  follows; 


On  the  other  hand,  the  following  relation  are  obtained  from  the 
definition  of  the  spring  constants: 


(t  =  D-6  (  2) 

<r= (*■•»-*•>'  •  d=(oT  t„j 

1  -  ilzipe  E 

“  (1  +  >0(1  -  2k)A  1  •’  (l  +  l-)ft 
where  A  a  h\  +  by  is  the  projected  length  of  a  vector  connecting 
centroids  along  the  normal  drawn,  and  r,j  and  a*  is  tangential  and 
normal  stress  respectively. 

Basing  on  the  above  preliminaries,  the  strain  energy  expression  of 
the  in-plane  element  V  can  be  obtained  as  the  following; 

Vail  (6'-D«)iSo  iu|  /  («'  •  D .  H)iSn,  (  3) 

2  Jlit  l  Jlt , 

Applying  Castigban’s  theorem,  the  following  stiffness  equation  can 
be  derived; 


(4) 


where  K  is  a  (6  x  6)  symmetric  matrix  and  P  is  a  nodal  load  vector 


3.CONSTITUTIVE  LAW 


In  the  RBSM,  present  authors  considered  that  reaction  stresses  in¬ 
ducted  are  not  tensor  but  vector,  and  consequently  Coulomb’s  condi¬ 
tion  may  be  most  realistic  constitutor  law  for  such  a  discrete  system 
representing  granular  materials.  In  case  of  the  granular  materials  like 
soils,  it  is  commonly  observed  that  normal  stress  is  relieved  as  soon  as 
it  reaches  <7»  as  shown  in  Fig  2. 


Consequently  the  yield  condition  of  soiMike  materials  can  be  modi- 
(I)  fied  as  shown  in  Fig.3. 

Fox  determination  of  spring  constants  in  shear  failure  plactic  flow 
rule  is  adopted. 
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4.PROPOSED  ALGORITHM 

A  new,  algorithm  is  proposed  by  applying  the  incremental  loading 
procedure  developed  by  YAMADA.  In  Yarrada’s  method,  necessary 
rate  of  the  load  increment  to  yield  the  most  heavily  stressed  element 
can  be  calculated  by  stress  distribution  and  load  increment  at  the 
present  stage  as  shown  in  Fig,4.  From  this  condition  the  required  rate 
of  load  increment  r  can  be  calculated.  Once  the  stress  point  lies  on 
the  failure  carve,  it  may  move  according  to  the  plastic  flow  rule  until 
the  unloading  occurs. 


Fig.4  Rate  of  load  increment 


Similar  calculation  must  be  model  in  case  of  the  tensile  failure  as 
shown  in  Fig.4.  AU  the  possible  rate  ofload  increment  corresponding 
to  failure  patterns  should  be  calculated  in  all  the  elements  and  the 
minimum  rate  ofload  increment  must  bedetermined  at  each  step  as 
the  rate  of  load  increment  corresponding  to  the  next  step. 

Stress  relaxation  is  usually  follows  by  the  tensile  failure.  If  Y-mada’s 
method  is  applied  to  this  stress  relaxation  process  exactly,  endless  cal* 
culation  should  be  repeated  corresponding^  the  tensile  failure  which 
may  occur  continuously. 

The  load  P’**  at  the  (i+I)lh  step  can  be  calculated  by  using  load 
P'  and  rate  of  load  increment  n  at  the  present  step  (i)  as  follows* 


»  X-r 

rtotii  =  S<nKI~r.)))rfc  (  8) 

k*i  i*0 

The  calculation  must  be  repeated  until  rf0f4j  =  l  m  each  stage  of  - 
loading.  Fig  5  shows  outline  of  the  flow  diagram  of  the  proposed  algo¬ 
rithm., 


F>g.5  Flow  diagram  of  the  present  algorithm 


P'«=(l-r,)P’  (  5) 

Therefore,  in  case  of  shearing  failure,  residual  load  P"  at  the  n,* 
step  can  be  obtained  by  using  initial  load  P  as  follows: 

P'  =  ff((l-r,))P  (ro  =  0)  (  6) 

••0 

This  is  the  same  result  with  Yamada’s  method. 

On  the  other  hand,  it  stress  relaxation  will  caused  by  crack  initiation, 
relieved  forces  are  taken  into  account  as  follows. 

P*=n((>-r.))P  +  f;(fl((I-r.)]P'-)  (  7) 

i«0  k«l 

where  F*  is  the  relieved  force  at  step. 

Here  rtc<Ai  implies  the  cumulative  rate  of  load  increment  and  it  can 
be  defined  as  follows 


5.NUMERICAL  EXAMPLE 

Fig  8  shows  tho  numerical  model  fer  a  anker  block  and  material  con¬ 
stants  used.  In  the  present  analysis  effect  of  tho  gravitational  load  was 
neglected.  The  horizontal  load  at  the  anker  block  was  applied  in  step 
by  step  mannnner  taking  the  incremental  as  lOt,  lot,  5t,  5t  and  St. 
Ffg7.  shows  the  slip  line  pattern  of  thr  solution  at  the  step  5.  In  this 
figure  it  can  be  seen  that  not  only  slip  lines  but  also  tensile  cracking 
miy  spread  on  the  front  region  of  a  given  block  and  may  be  consid¬ 
erably  different  from  that  of  the  previous  solution,  the  displacement 
mode  corresponding  to  this  step  are  shown  in  Fig.8.  From  this  figure 
separation  of  the  soil  on  the  rear  wall  of  block  can  be  seen, 

6.CONCLUSION 

A  new  algorithm  was  developed  by  which  coupled  failure  due  to  shear 
and  tensile  loads  can  be  treated.  Although  a  numerical  exsample  is 
very  simple,  it  is  believed  that  the  present  method  may  be  useful  to 
failure  analysis  of  various  soil  and  rock  engineering  problems. 


Fig.6  Numerical  model  and  material  constants 


Fig.7  Slip  line  pattern 
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Fig  8  Displacement  mode 
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Abstract-A  numerical'-  procedure  for  tbe  slope 
stability  analysis  which  combi os  the  slice  method 
in  R3SM  is  shown  in  this  paper’. 

Fea'atre  .'n  “he  present  method  is  as  follows: 

(1) Plastic  work  is' evaluated  only  along  arc. 

(2) Tensile  failure  is  also  taken  into  account. 

(3)  Mesh  division  is  unnecessary. 

(4)  It  is  designed  to  compute  with  data  of  slice 

method.  ~ 

1. INTRODUCTION 

Slico  methods  are  often  applied  when  slope 
stability-  problem  is  discussed.  The  Fellenius 
method  is  most  popular  among  slice  method? ,  but 
in  this- method  the  force  between  each  piece  of 
slice  is  "not  considered.  Therefore  the-  methods 
•such  as  the  Bishop,  the  Janbu.  the  Specuer-wore 
proposed  and  they  are  ^considered  the  effects  of 
statically  indeterminate  force,  Plastic 
condition,  however,  is  not  taken  into  account  in 
these  slice  methods,  because'  they  arc  'based  on 
the  limit  equilibrium  method. 

Finite  element  method  is  also  used-  to 
analyze  slope  stability  problems,  but  there  ar'i 
some  problems  that  definit  slipe  line  may  not  be 
obtained  clearly  and  making  input  data  is 
laborious  work  comparing  with  slice  methods. 

On  the  other  hand,  RBSM  (Rigid  Bodies  - 
Spring  Model)  has  been  proposed  as  a  physical 
model  especially  suitable  for  limit  analysis  of 
solids  and  by  which  mutual  slip  movement  of  two 
adjacent  element  can  be  simulated.  It*s 
usefulness  has  been  duly  verified  by  solving  many 
collape  probrems  where  slippage  is  considered 
problems.  The  solution  is  influenced  by  the  mesh 
division  and  the  upper  bound  solution  is  obtained 
because  of  this  specific  character. 

A  n-uerical  procedure  for  the  slope 
stability -analysis  which  combins  the  slice  method 
in  RBSM  is  shown  in  this  paper.  Feature  in  the 
present  method  is  as  follows: 

(1)  Plastic  work  is  evaluated  only  along  a 
circular  arc. 

(2)  Tensile  failure  is  also  taken  into  account 

(3)  Mesh  division  is  unnecessary. 

(4)  It  is  designed  to  compute  with  data  of  slice 
method. 

2. ALGORITHM  OF  PROPOSED  METHOD 

To  simplify  tla  explanation  of  tho  proposed 
method,  tho  caso  which  slipe  lines  aro  modeled  by 
circular  arc  is  taken  as  an  example. 

Tho  procedure  of  analysis  is  as  follows: 

(1)  As  shown  in  Fig.l,  tho  region  which  is 
defined  by  shape  of  slope  and  slipe  line  is 
dovidod  into  slice.  This  process  is  same  as 
tho  ordinary  slico  method. 

(2)  Integral  points!  0)  are  assumed  botween 
circle  points (•). 

(3)  As  shown  in  Fig. 2,  the  degrees  of  freedom 
are  rigid  body  displacements  (u,v,  0)  at  the 
center  of  gravity  of  each  slico  element.  Then 
stiffness  matrix  of  each  slico  element  is 
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obtained. 

(4)  Driving  forces  are  calculated  from  the 
weight  of  each  slice  element. 

(5)  As  shown  in  Fig.2.  surface  stresses  (o.z)  at 
each  integral  .point  are  computed  by  the 
discrete  limit  analysis. 

(6)  The  safty  factor  is  computed  by  the  surface 
stresses. 

The"  safty  factor  can  be*  given  by  tbe 
following  equation: 

f  s  -  £  £ c  •*’  °  u*  ^  1 > 

where:  c  is  cohesion 

£  is  internal  frictioral  angle 
a  is  normal  stress 
r  is  shear  stress 
t  is  length  of  slip  lines 


(X,  Y) 


3. TECHNIQUES  OF  NON-LI NEAR. ANALYSIS 

Two  techniques  for  solving  non-1 incar 
analysis  are  discussed  in  this  paper.  One  is  the 
stress  transfer  method  in  which  the  tensile 
failure  can  be  taken  into  consideration  easily. 
Another  is  the  Yemada's  method  in  which  the 
yielding  failure  and  unloading  are  judged 
exactly.  In  the  stress  transfer  method  tho 
corrc  tive  increment  stress  is  caluculated  only 
by  thi  normal  stress  as  shown  in  Fig. 3.  Therefore 
the  stress  paths  of  both  will  bo  different. 
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4.  NUMERICAL  EXAMPLES 

4.1  COMPARISON  OF  SAFTY  FACTOR  WITH  THE  ORDINARY 
METHODS 

A  test  model  Is  shown  In  Ffg.4.  In  this 
case,  the  center  and  the  radius  of  a  circular- arc 
are  fixed.  The  safty  factors  compared  with  the 
ordinary  slice  methods  are  shown  in  Tab.l.  The 
results  obtained  from  this  table  arc  summarized 
as  follows: 

(1)  The  safty  factor  which  is  computed  by  elastic 
analysis  is  smaller  than  the  Fellenius  method. 

(2)  The  safty  factor  which  is  -  computed  by  the 
stress  transfer  method'  is  almost  .the  same  as 
the  Yamada*sv method. 

(3)  The  saft.  factors  which  is  computed  by  the 
stress  transter  tstbod  andvthe  Yamada'.s  method 
arc  situated  'Jjetweco  the  Fellenius  and  the 
Bishop. 

(4)  In  the -case  of  the  stress  transfer-method,  The 

saft7  factors  which  is  computed  by  plastic 
within  a  tensile .strength  differs  little  from 
plastic.  w 
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4.2  STATICALLY  INDETERMINATE  FORCES 


Statically  indeterminate  forces  are  shown  in 
Fig. 5.  In  elastic  analysis,  horizontal  forces  E 
become  large  tension  near  the  top  of  slope,  but 
In  plastic  analysis  the  tensile  forces  arc 
decreased  owing  to  occurrence  of  slip  lines. 
Besides  In  clastic  analysis  vertical  forces  T 
which  indicate  shear  forces  arcv  greatly  changed 
from  toe  to  top  of  a  slope,  but  in  plastic 
analysis  the  range  of  change  becomes  small.  Then 
the  local  safty  factors  computed'  by  plastic 
analysis  are  lower  than  those  by  elastic  analysis 
except  the  region  of  slippage.  In  the  region  of 
slippage,  local  safty  factors  are  1.0  in  plastic 
analysis.  The  stress  transfer  method  and  the 
Yamada's  method  have  similar  distribution  of 
statically  indeterminate  forces. 

4.3  THE  STRESS  TRANSFER  METHOD  HITHIN  A  TANSILE 
STRENGTH 


Fig. 6  shows  the  results  of  two  methods.  One 
is  tho  method  which  is  taken  no  account  with  a 
tensile  strength  with  a  broken  lino.  Anothor  is 
tho  method  which  is  taken  into  account  of  a 
tensile  strongth  with  a  solid  line.  Taking 
account  of  a  tensile  strength,  tho  tensilo  region 
which  is  occurred  at  a  top  of  slope  is 
disappeared.  Then  tho  length  of  slipo  line 
becomes  longer.  Both  horizontal  forces  E  and 
vertical  forces  T  are  replaced  towords  the 
compressive  side. 


4:i.EIGI0S  OF, TENSILE  FAILURE  USING  SIR ESS 
TRANSFER  METHOD 

Eoek's  study  shows  that  in  the  case  of  slope 
having  a  face -angle  of-  30  degrees  In  a  drained 
soil  with  a  friction  angle  of  20  degrees,  the 
circle  center  of  the  critical  failure  is  located 
at  (0.2H.1.85H)  and  the  critical  tensile- crack- is 
at  a  distance  0.2H  behind  the  top ’of  the  slope. 
The  result  of  H*10m  using  the  stress  transfer 
method  within  a  tensile  strength  ‘is  shows  in 
Fig. 7.  As  shown  in  this  figure,  the  tensile 
failure  .Is  located  at  ?.  distance  0.68*(0.07H)  and 
slipe  lines  are  occurred  from-  the  bottom  of 
tensile  failure.  Because  of  the  size  of  the 
breadth  for  each  slice  element,  the  location  of 
tensile  failure  is  reasonable.  ' 


to:*. « tx 


5. CONCLUSION 

A  numerical  procedure  for  the  slope 
stability  analysis  which  combins  tho  slice  method 
in  RBSM  was  proposed  and  applied  It  to  some 
numerical  examples. 

The  principal  results  and  conclusion  of  tho 
present  study  aro: 

(1)  The  numerical  data  for  the  ordinary  slice 
methods  aro  available  to  this  method. 

(2)  Tho  safty  factor  can  bo  obtained  fairly 
easily  and  the  value  is  reasonable,  compared 
with  tho  ordinary  slice  methods  such  as  the 
Bishop,  tho  Janbu  and  so  on. 

(3)  The  results  of  the  stress  transfer  method 

and  the  Yamada's  method  have  a  little 

difference. 

(4)  In  the  case  of  Including  tensile 

failure,  tho  stress  transfer  method  within  a 
tensile  strongth  is  useful. 
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Abstract  -  This  paper  deals  with  an  explicit, finite  dif¬ 
ference  scheme  with  rigid  body  sprint  model  for  soae  la- 
pact  problcas  cn  clastic  plane  beams  and  reinforced 
concrete  beams.  The  beams  are  divided  ^Into  a  finite 
number  of  small  rig Id  bodies,  which  are  connected;  with 
springs  distributed  the  contact  area  of  neighboring 
bodies.  Equations  of  aotlon  cn  each  rigid  body  arc  ex¬ 
pressed  In  a  finite  difference-fora  on  tlae  and  then 
these  equations  are  solved  both  numerically,  in  an 
explicit  fora.  Present  numerical  results  show  good 
agreeaent  with  the  exact  solution  for  an  elastic  plane 
beam  and  experimental  results  for  a  RC  beam. 

I.  RIGID  BODY  SPRING  MODEL 


cd  to  element  i .  and  e*  and  If  are  mass  and  moment  of 
Inertia  of  elements  f .  respectively. 

Velocities  and  displacements  at  tine  tt&c  are  obtain  by 
a  direct  time  Integration  using  forward  finite  differ¬ 
ence  .expressions  as  follows: 

"i.Wflt  ■  Pi.t  fw.t  U  w  Vi.c *££  *  *i.t  *vi,t  Lt  *  (5) 

(  '  6f,t  ^ 

ui.ttU  m  **.t  Ht  to*  vi,ttte  •  yj.t  *'vi,uU  (6) 

Bi.t*tt  -  ©i.t  #Utt££  ^ 


Considering  tro  triangular  rigid  plate  elements  i*j 
which  are  connected  by  two'  different  types  of  springs  Kn 
and  Ks  at  two  evaluation  points  on  contact  area  as  shown 
In  Flg.X(a.b),  stiffness  of  the  springs  Is  expressed 
from  the  plane  stress  condition  as  follows: 


r. r.  lU  E 


(1) 


there  E  and  v  are  Young's  modulus  and  Poisson’s  ratio, 
and  other  symbols  should  refer  to  FJg.l.  Centro! daj 
displacements  of  each  clement  are  denoted  by  ( u± .  V|,  \) 
and  (ur  vj*  Qj)  respectively.  Relative  displacements  on 
the  evaluation  points  are  given  by  the  following  equa- 
tlons(see  Fig.l(c)): 


§1  *  ~(§vLJ'+  b\j)  •  As  -  -(Ajf  +.$sj) 


(2) 


III.  ELASTIC  PLANE  BEAM 

First  example  concerns  a  problem  of  clastic  stress  wave 
propagations  In  a  two  dimensional  plane  elastic  beam 
subjected  to  an  Impact  step  load.  A  rectangular  beam 
whose  two  edges  are  simply  supported  Is  considered,  and 
element  mesh  division  and  boundary  clement  condition 
used  "here  are  shown  ln-Flg.2.  Figure  3  shows,  stress 
history  of  cy  at  the  mid  point  of  beam  (x*0,y«h/2)  as 
compared  to  the  exact  solutions  by  the  two  dimensional 
elasto  dynamic  theory.  The  numerical  solution  exhibits 
dispersion  and  spurious  oscillations  behind  wave  front 
However,  wave  arrival  time  and  amplitudes  are  correct 
and  the  center  of  oscillations  approaches  the  exact 
solution.  In  the  figure,  the  abscissa  Indicates  a  ron- 
dlmenslonal  time,  T- ctt/h.  where  C|-  vfrlpfJ-v*;)  .  P-  den¬ 
sity. 


where 

*ni*J  •  m,/  cos  a j'j  *  vi, j  sin  aj,j 

aI.j  -(*p-*gl,J>  sin  «J 
*sl.j  ‘  "VW  sin  a i,J  Ivf'j  ax  a i,j 

sin  Oi'j  t(xp-Xgi,j)  axaj'jWi.j 

where  xtf,j  »  ygi,j  and  xfi  ,  yp  arc  coordinates  of  the  cen¬ 
troid  of  plates  and  the  evaluation  point,  respectively. 
Here  rotational- displacement  Qi,j  are  assumed  to  be  very 
small.  Spring  forces  Interacted  on  the  two  elements  are 
given  by  . 

NiJ'Kn&n  •  %•***» 


Center  Line 


II.  EXPLICIT  FINITE  DIFFERENCE  SCHENK 

At  time  t,  acceleration  of  each  element  Is  subjected  to 
the  equation  of  motion, 

•  vi.fWifal  ,  e'/.c  •  &i/li  (4) 
where  DCf,  D*.  fflfj  arc  sunmatlon  of  all  the  forces  appli 


IV.  REINFORCED  CONCRETE  BEAM 


Second  example  concerns  an  impact  fracture  problem  of 


1897 


Center  Line 


Fig.5  Element  mesh  division;  dotted  part  -shots  the 
element  Including  reinforced  bar 

relnfor.  ~d  concrete  beam.  Xcdel  considered  here  Is  the 
saae  as  one  of  the  test  speciaens  In  the  experiments  of 
reference (21.  The  Inpact  leading  system  used  in  the 
experiment  was  operated  by  use  of  compressed  Sa  -gas 
through  a  striking  haaaer  consisting  of  a  hard  steel 
cylinder  with  diameter  of  9.8cm.  impact  velocity  of 
10.7n/s  and  weight  of  70kgf.  Figure  4  shows  the  test 
specimen  and  Flg.5:shows  the  element  mesh  division  used 
for  numerical  analysis.  Constitutive  relation  of  con¬ 
crete  used  In  this  analysis  includes  the  effects  of 
cracking  and  shear-slippage  as  given  In  Flg.S.  And  the 
main  and  shear  reinforcing  steel  bar?  are  elastic-per- 
fectly  plastic  (yield  stress  Is  3000kgf/cm3).  Figure  7 
shows  the  two  fracture  modes  obtained,  namely  Fig.7(a) 
is  a  bending  fracture  mode  under  a  comparatively  slow 
monotonously  Increasing  load(10tf/ms).  and  Flg.7(b)  is  a 
shear  and  negative-ending  fracture  mode  under*an  Impact 
load  with  the  imposed  hanmer  velocity,  V»10.7m/s.  The 
later  mode  is  similar  to  the  fracture  modes  observed  in 
the  experiments  as  shown  in  FIg.8.  Ff-irc  9  shows  the 
comparison  of  strain  histories  of  the  top  and  bottom 
reinforcing  bars.  Fair  agreement  is  seen  between  the 
numerical  results  and  the  experimental  results  at  a 
point  away  from  the  span  center.  But  It  seems  that  con: 
sldcrably  difference  between  them  is  yielded  at  the  span 
center  because  of  the  effect  of  three  dimensional  local 
failure  near  the  loading  point. 

V.  CONCLUSION 

The  proposed  method  showed  good  applicability  to  Impact 
fracture  problems  including  crack  or  shear  failure  of 
reinforced  concrete  beams  as  well  as  stress  wave  propa¬ 
gation  problems  of  elastic  plane  beams. 
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FIg.7  Fracture  modes. (a)Slow-lncreaslng  load(P»10tf/ms) 
(b)Impact -load(V>10.7m/s) 


(a)Static  landing  (xaxioua  load  5.4t£) 


(b)Iapact  load  (V*10.7o/s) 


Fig.8  Cracking  pattern  and  failure  mode  for  test  beams 


tioc(as) 


Fig. 9  Comparison  of  strain  response  of  re-bar 
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APPLICATIONS  OF  3D  RIGID  BODY  SPRING  MODEL  TO  STEEL  AND  CONCRETE  COMPOSITE  SLABS 
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Abstract  -  Steel  and  concrete- composite  slabs,  described 
herein,  are  two  layered  thick  plates,  consisting  of  con¬ 
crete  and  thin  steel  plate  withfmechanical  shear  con¬ 
nectors.  Their  behavior  up  to  failure  Is  simulated  using 
3-dimenslonal  Rigid  Body  Spring  Model  considering  ma¬ 
terial  nonlinearity.  Verification  study  can  show  the 
validity  of  the  model,  even  when  the  coarse  subdivision 
is  used  based  only  on  major' failure  modes  to  control  the 
slabs  "at'thc  ultimate  state  of  loading. 

I.  INTRODUCTION 

The  concept  of  Rigid  Body  Spring  Model:  RBSMUJ  is 
to  idealise  structures  aacroscopically  to  the  asseablage 
of  rigid  bodies  together  with  springs,  based  on  the 
failure  oechanlsm  observed.  It  has  an  advantage  of  com¬ 
prehensive  expression  to  the  discontinuous  phenomenon 
due  to  separation  and  slip  to  play  a  vital  role  at 
structural  collapse. 

Steel  and  concrete  composite  slabs (2,31' have  excel¬ 
lent  performance  on  load  carrying  capacity  and  execution 
workability-in  comparison  with  ordinary  reinforced  con¬ 
crete  slabs.  Thus  they  have  been  applied  to  plate  mem¬ 
bers  such  as  bridge  decks,  building  floors  and  the  an¬ 
other  In  civil  engineering  field. 

At  the  ultimate  state  of  loading,  the  slabs  exhibit 
some  kinds  of  complicated  and  solid  failure  modes.  More¬ 
over.  the  modes  change  dependent  upon  shear  connector 
arrangement,  applied  Toad -condition  and  so  on.  lfe  have, 
therefore,  carried  out  3-dimenslonal  nonlinear  analysis 
of  the  slabs  using  RBSM.  It  Is  significant  to  simulate 
their  behavior  up  to  failure,  because  it  could  be  re¬ 
flected  on  establishment  of  their  rational  design  codes 
when  the  model  can  predict  the  behavior  sufficiently. 

II-  MODELING  OF  CONSTITUTIVE  RELATION 

In  RBSM  formulation,  material  properties  arc  Intro¬ 
duced  to  normal  and  tangential  springs  on  the  interfaces 
of  neighbor  rigid  body  surfaces.  According  to  the  ma¬ 
terial  properties,  the  couple  of  spring  stiffness  of  the 
model  are  determined  and  can  associate  relative  dis¬ 
placements  of  neighbor  rigid  body  with  interfacial 
forces.  The  nonlinear  characteristics  of  the  materials 
made  tho  composite  slabs  up  arc  modeled  as  follows: 

A.  Concrete  Material 

Figure  1(a)  shows  the  relationship  between  normal 
stress  and  normal  strain  of  concrete  taking  account' both 
of  cracking  in  tension  and  crushing  in  compression.  The 


Fig.l  Concrete  material  model 

(a)cf-c  relationship,  (b)Failurc  surface 


shear  slip  of  concrete  is  subjected  to  the  analogous 
failure  surface[4j  with  Mohr-Coulomb's  as  shown  in  Fig. 
1(b);  Furthermore,  within  the  failure  surface,  the  elas¬ 
tic  relation  In  shear  is  applied. 

B.  Steel  Material 

Steel  plates*arc  assumed  to'be  elastic  and  perfect¬ 
ly  plastic.  Interaction  between  normal  stress  and  shear 
one  for  the  failure  surface  of  concrete  is  Ignored. 

C.  Shear  Connector 

The  shearing-force'-  slip  relationship -on -Interface 
through  an  embodied  stud  connector  in  concrete  is  ex¬ 
pressed  as  a  function  of  Its  strength.  Q„15J  as  shown  in 
Fig. 2.  In  addition,  its  longitudinal  action  Is  regarded 
as. an  elastic  bar  element. 

The  effect  of  the  stud  Is,  hence,  evaluated  by  two 
modelings.  One  is  to  allocate  a  couple  of  springs  with 
the  above  property  to  every  point  of  stud  and  no 
interaction  of  concrete  and  steel  plate  except  stud* 
points.  The  other  Is  to  spread  the  stud  strength  over 
appropriate  area  surrounding  studs,  and  the  relation 
between  shearing  stress  and  slip  on  the  concrete  -  steel 
plate  interface  is  given  by  an  clastic  perfectly  plastic 
curve.  The  former  and  the  latter  arc  called  discrete  and 
smeared  model,  respectively. 


Flg.2  Mechanical  behavior  of  stud  conncctoriS] 

III.  VERIFICATION  STUDY 

A._  Test  Slabs 

All  of  the  slabs  we  carried  out  loading  tests  for 
verification  w^re  square  slabs  with  sides  1600mm  long, 
and  were  simply  supported  with  1375am  span  length.  Con¬ 
crete  of  upper  layer  was  i20mm  high  and  steel  plates  of 
lower  one  was  6mm  thick.  Concrete  was  396kgf/cma  In  com 
pressive  strength:  fc  and  28kgf/caa  in  tensile  one:  ff, 
and  steel  was  3574kgf/cma  in  yielding  point:  f.„.  As 
shear  connector,  headed  studs  with  80aa  height. and  13aa 
diameter  were  welded  on  steel  plates  in  the  following 
two  ways;  In  slab  #1,  they  were  arranged  out  of  the 
supported  edges  at  Intervals  of  2S0aa,  while  in  slab  #2, 
over  the  whole  at  those  of  I25aa.  Square  patch  load  with 
sides  125aa(Load  #1)  or  375ma(Load  #2)  long  was  applied 
at  central  portion  on  the  top  surface  of  the  slabs. 

The  observed  failure, modes  of  the  slabs  could  be 
classified  Into  two  types;  One  was  bond  slip  failure  due 
to  distinct  slip  associated  with  cracking  on  all  edge 
sections,  shown  in  Flg.3(a)  for  the  slab  having  coarser 
stud  arrangement.  The  other  was  punching  shear  failure 
pushing  out  of  concrete  locally  at  the  loading  portion 
in  Fig. 3(b)  for  that  having  lntlmater  one.  Additionally, 
in  both  the  modes,  steel  plate  had  not  yielded. 
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Flg.3  Test  slabs  with  cracking  of  concrete  observed 

(a) Slab'*l;  Bond  slip  failure  under  Load  t 2 

(b) Slab  $2:  Punching  shear  failure  under  Load  #1 

B. _ 3-dlmcnslonal  RBSM  Idealization 

To  give  full  play  to  the  ability  of  RBSM,  we 
Intended,  In  particular,  to  subdivide  3-dimcnslonally 
the  slab  as  coarse  as  possible  considering  only  the 
major  failure  modes  observed:  Namely.  It  was  to  form  1) 
a  corned  shape  division  expanding  from  the  sides  of 
loading  area  corresponding  to  the  punching  shear  crack, 
2)  a  polygonal  one  near  the  supported  edges  to  express 
the  accompanying  negative  bending  crack  and  3)  one  on 
steel  plate  -  concrete  Interface  to  express  slip  and 
separation,  as  shown  In  Flg.4  with  referring  to  FIg.3. 
The  slabs  In  1/4  space  dotted  In  Flg.4  were  analyzed 
owing  to  symmetry. 

A  direct  Iteration  scheme  was  employed  modifying 
the  secant  modulus  of  stiffness  at  each  step  of  computa¬ 
tion.  according  to  a  current  state  on  the  nonlinear 
material  models.  The  reference  points  to  the  current 
state  coincided  with  the  sampling  points  for  numerical 
integration  to  evaluate  the  stiffness  of  the  model. 

C.  Results 

The  load  -  deflection  curves  obtained  numerically 
using  the  discrete  model (Model  #J)  or  the  smeared  model 
(Model  #2)  for  stud  connectors  arc  shown  In  Fig.S  with 
that  observed  In  the  test.  Model  #1  predicted  the  fail¬ 
ure  loads  sufficiently, for  slab  #1  having  coarser  stud 
arrangement.  On  contrary.  Model  #2  gave  good  results  for 
slab  #2  having  Intlmatcr  one.  Thus  It  was  essential  for 
the  successive  simulation  to  evaluate  the  effect  of  the 
stud  adequately  dependent  on  the  arrangement  on  the 
modeling. 

"Figure  6  shows  the  failure  modes  obtained,  those 
gave  the  failure  loads  In  good  agreement  with  the  ob¬ 
served  ones.  The  numerical  results  also  expressed  the 
satisfactory  failure  modes.  One  can  find  the  occurrence 
of  significant  slip  on  the  steel  -  concrete  Interface  in 
FIg.6(a)  characterizing  the  bond  slip  failure.  Further¬ 
more,  the  situation  pushing  out  the  corned  region  down¬ 
ward  at  loading  area  corresponding  to  the  punching  shear 
one  can  bo  found  in  Fig.6(b).  In  both  the  modes,  steel 
member  had  not  yielded  similarly  to  the  test  results. 


■ 

Flg.4  3D  RBSM  idealization  which  has  98  rigid  bodies 
and  588  degrees  of  freedom  in  1/4  space  dotted 


Fig.S  Load  -  Deflection  curves.  (a)Slab  #l,  (b)Slab  #2; 
Numerical  and  experimental  results 


Fig.S  Failure  modes  (1/4  space);  Numerical  results 
(a)Bond  slip  failure,  (b)Punchlng  shear  failure 


Thus  we  can  conclude  that  the  model  can  predict  satis¬ 
factorily  the  mechanical  behavior  of  the  slabs  even  when 
the  coarse  subdivision  Is  used  based  only  on  major  fail¬ 
ure  modes,  provided  with  the  adequate  modeling  of  the 
stud  connectors  dependent  upon  Its  arrangement. 


IV.  CONCLUSION 

Using  3-dimensional  Rigid  Body  Spring  Model,  we 
have  simulated  the  behavior  of  the  steel  „nd  concrete 
composite  slabs  up  to  failure  and  examined  (he  numerical 
results  obtained  in  comparison  with  those  observed  In 
the  loading  tests  both  on  the  failure  loads  and  modes. 
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Abstract:The  numerical:  analysis -is, carried  out 
to  many  kinds  of  slope  stabilities  uslns  the 
Rigid  Body  and  Spring  Model  (  hereafter  called 
RBSM)  proposed  by  Prof .Kawai(1977).  This  paper 
reports  on  the  application  of ’RBSM" analysis  to 
the  retaining  wall  model  test  and  the  field 
embankment  reinforced  with  polymer  grids  that 
RBSM  is.  an  effective  analytical  model  for 
earth  reinforcing  mechanisms. 

I  .INTRODUCTION 


Fig.1  Sheets  arrangement  in  the  retaining *wall 
model  test  (  the  wail  is  at  left  side  ) 


In  discrete  limit  analysis  of  reinforced 
soils,  the  interface  between  relforclng 
material  and  soil  is  modeled  by  the  beam 
element  and  the^plane  element,  each  of  them. is 
defined  by  RBSM.  Stress,  (t  .  o  )  of-  RBSM  are 
transmuted  by  two  springs  (  the  shearing 
spring  and  the  normal  spring)  distributed  over 
the  contact  surface  of  two  adjacent  rigid 
elements.  The  discrete  surface  such  as  the 
lnterfacecan  ne  easily  assessed  by  properties 
Of  RBSM. 

II  .RETAINING  WALL  MODEL  TEST 

Fig.1  -illustrates  the  retaining  wall  model 
test  reinforced  with  vinyl  sheets.  In  which 
the  bottom  of  the  wall  at  left  side  is  hinged. 
Fig. 2  shows  the  analytical  result  of  slip 
lines  inside  the  back-fill  caused  by.  moving 
of  the  wall.  In  the  present  analysis,  it  is 
assumed  that  yielding  of  the  Interface  is 
defined  by  Mohr-Coulomo’s  criterion  and 
plasltc  strain  is  computed  by  the  procedure 
using  associated  flow  rule.  Therefore,  if  the 
Interface  has  become  to  be  yielding. the  strain 
of  the  adjacent  sheet  will  not  be  able  to 
Increase,  Test  data  and  computed  results  of 
strain  of  vinyl  sheets  show-  that  the  occur¬ 
rence  of  the  slip  in  the  upper  sheet  (  the 
measured  line  number  71  corresponding  to  the 
computed  result  markedly  •  in  Fig. 3  ). 
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Fig. 2  Slip  line  by  the  present  analysis 


Fig. 3  Occurrence  of  the  slip  in  the  upper 
sheet  (Solid  lines  indicate  test  results) 
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Ill  .REINFORCED  EMBANKMENT' WITH' SURCHARGE: 

A  8.5m  high  and  70m  Iona  reinforced  embank¬ 
ment  with  the  8m  hlah  surcharee  fill  was  built 
on  the  stabilized  foundation.  The  embankment 
reinforced  with  polymer  grids. has  a  slope  of 
1:0.3  and  the  upper  fill  has  a  slope  of  1:0.8. 
Fill  materials  are  mainly  aravelly  soils 
including  the  fine  fraction.  Therefore.it  took 
for  a  Iona  constructing, -period  avoldlna  the 
settlement  after  completion. 

Fig. A  shows  the  distribution  of  strain  of 
polymer  grids  at  three  constructina  staees.  In 
this  figure,  strain  marked  by  cicles  are 
unreliable  because  one  guaae  of  pairs  had 
become  unstable.  Fannin  et  al. (1988)  presented 
that  polymer  grids  laid  In  the  field  embank¬ 
ment  were  influenced  by  the  long-term  load. 
Kutara  et  al .  (1988)  showed  that -the  long-term 
strain  of  polymer  grids  increased  from  about 
0.5%  to  about  2%  by  their  pull-out  test. 
Therefore,  it  is  considered  that  increasing 
of  strain  for  the  suspended  term,  without 
rainfall,  caused  by  the  long-term  deformation 
of  polymer  grids  themselves.  So.  the  straln-by 
the  self-weight  load  at  completion  shoud  be 
enual  to  the  value  given  by  deducting  the  long 
-term  strain  increment  from  the  strain 
measured.  Fig. 5  shows  that  the  analytical 
results  agree  well  with  the  modified  strain 
except  the  toe  of  slope. 


Fig. A  Distribution  of  strain  of  polymer  grids 


:Fig.5  Comparison  of  the  modified  strain  of 
polymer  grids  with  computed' results 

IV  .CONCLUSION 

Computed  results  show  explicitly  the  non¬ 
linearity  of  reinforced  soils  caused  by  the 
self-weight  load  and  the  reduction  of  earth 
pressure  by  reinforcing,  "owever.  deformation 
of  reinforced  embankments  in  field  is  more 
complicated.  Especially,  rainfall  effect 
should  be  considered  in  Japan  and  other  heavy 
rainfall  countries.  In  this  work,  the  rainfall 
effect  was  only  calculated  by  increasing  the 
unit,  weight  of  soils  as  well  as  current  design 
methods. 
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Abstract-  It  has  been  already  confirmed  that  the  beam  element 
of  KBSM  proposed  by  KAWAI  is  useful  fox  limit  analysis  of  fiamea 
structure.  This  piperpiesents  its  application  to  the  soil-stxactuie  in- 
texaction  problem  of  framed  structure.  In  this  model  effect  of  soil 
reaction  distributed  along  individual  beam  component  is  integrated 
and  it  is  replaced  by  three  different  springs  lumped  at  the  center  of 
a  given  beam  As  a  result,  analysis  of  such  non-lmear  problems  can 
be  simplified  to  great  extent.  The  usefulness  of  this  method  will  be 
discussed  with  some  examples. 

^INTRODUCTION 

Several  mechanical  models  of  foundation  are  proposed  and  they  are 
expressed  as  combination  of  spring  which  obeys  Hooke’s  law,  dashpot 
which  obeys  Newton’s  law  and  slider  which  represents  plasticity  It 
is  complicated  to  obtain  stiffness  matrix  including  the  effect  of  visco¬ 
elastic,  visco-plastic  foundation  explicitly  by  using  the  conventional 
beam  elements.  But  a  beam  element  of  RBSM  assumes  to  be  rigid, 
so  the  effect  of  foundation  pressure  distributed  along  individual  beam 
component  is  integrated  and  it  is  replaced  by  three  different  springs 
lamped  at  the  center  of  a  given  beam.  Considering  this  advantage,  the 
characteristics  of  visco-elastic,  visco-plastic  foundation  can  be  solved 
easily  [ij 

2.  ANALYSIS  OF  BEAM  ON  A  VISCO-ELASTIC 
FOUNDATION 

The  visco-elastic  medium  is  time  dependent  and  its  linear  visco¬ 
elastic  behaviour  can  be  represented  by  combining  the  linearly  clas¬ 
tic  spring  and  the  viscous  dashpot  filled  with  a  liquid  which  obeys 
Newton’s  law  of  viscosity.  In  this  paper  a  visco-elastic  foundation  is 
considered  as  shown  in  Fig.l.  This  is  Standard  solid  model  which 
Frendenthal  and  Loisch  used  in  an  analysis  of  beam  on  a  linear  visco¬ 
elastic  foundation.(2] 


Fig.l  Standard  solid  model 


2XRgyrit.gf.bttm  n  juasaabsik  foundation 

Beam  whoes  section  properties  are  uniform  is  analyzed  in  order  to 
examine  the  usefulness  of  this  model  and  a  concentrated  load  is  applied 
at  the  center  of  the  beam  md  boundary  condition  is  free  at  both  edges. 
Section  properties  used  are  decided  to  become  ( EI/K)fL 4  s  I  O'3 
Where  I  is  momentof  inertia,  E  is  modulus  of  elasticity  and  L  is  length 


of  span.  Time  step  interval  used  for  calculation  is  A  t/r  =  0  5  Where  r 
is  r]/ki.  With  regard  to  the  modebng,  the  length  of  half  span  isdivided 
into  S,  10,  IS  and  20  elements  in  order  to  examine  the  relation  between 
nurber  of  beam  elements  and  convergency  of  solution.  The  deflection, 
beading  moment  and  bearing. pressure  were  compared  with  those  of 
analytical  solution  carried  by  Sonoda  et  al,  from  tfr  -  0  to  tjr  =  oo 
as  shown  in  Fig.2.  In  this  case,  number  of  elements  were  10  Analytical 
solution  is  given  by  sobd  lines  while  result*  of  the  present  analysis  given 
by  the  mark  O-  The  deflection  solved  by  this  model  is  a  slightly  larger 
than  analytical  solution,  but  in  general  good  agreement  is  observed. 
Both  bending  moment  and  bearing  pressure  are  also  in  good  agreement 
along  the  whole  beam.  The  relation  between  the  error  of  deflection  at 
the  midpoint  and  number  of  division  at  the  optional  time  level  from 
t/r  =  0  to  t/r  =  oo  is  shown  m  Fig.  3.  The  error  of  deflection  decrease 
with  increase  mesh  division,  and  same  with  the  elapse  of  time  (3) 


Fig.2  Beam  on  the  visco-elastic  foundation 


20  iwffili'  vt  iftttk 


Fig  3  Convergence  test  of  a  beam  element 
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Z.  LIMIT  ANALYSIS  OF.  FRAMED. STRUCTURE 
3.1  Model  of  visco-plastic  foundation 

The  elasto-plastic  media  in  shown -in  Fig'4  is  not  considered  time 
dependent  and  it  is  assumed  to  become  plastic  instantaneously  when 
it  reaches  yielding  condition.  The  visco-plastic  medium  is  considered 
time  dependent  but  it  igonies  yielding  of  the  foundation.  The  visco- 
plastic  medium  is  considered  as  combination  of  elasto-plastic  medium 
and  yisco-clastic  one.  In  this  paper  the  visco-plastic  model  of  Bingham 
type  shown  in  Fig.5  is  used  and  this  model  express  linear  creep  under 
yielding  condition. 


pVrTTP  pT^-]Q|f 


Fig.4  Elasto-plastic  model  Fig  5  Visco-plastic  model 


Limit  analysis  model  of  framed  structure  made  ofuniform  frame 
element  is  shown  Fig  6.  Also  result  of  elasto-plastic  analysis  is  shown 
in  Fig,7,  Number  in  the  figure  are  order  of  collapse.  At  first  the 
surface  of  the  foundation  is  yield  and  plastic  hinge  appears  at  pile  head, 
And  then,  yielding  foundation  advances  downward,  Finally  plastic 
hinge  appears  under  ground  pile  and  the  structure  is  collapsed.  In  this 
case,  collapse  load  is  51  tf.  Next,  the  results  by  means  of  visco-plastic 
method  are  described.  Fig.8  shows  the  relation  between  displacement 
of  pile  head  and  time  for  three  different  loading  velocity.  Collapse 
occurred  at  55  tf.  This  result  is  considered  satisfactory  because  the 
calculated  collapse  load  by  the  elasto-plastic  analysis  was  51  tf.  For 
piles,  the  location  and  order  of  plastic  hinge  formation  were  the  same 
as  in  case  of  the  elasto-plastic  analysis  (4) 


B  -  600*» 
t  =  I2u 
A  =  22l.7c«5 
I  =  95800c*4 
Jty  =  76.56t** 

-E  =  2.lXI0*tf/** 

cohesive  soil 
c  *  5tf/»* 
kh  =  l  kgf/c*5 


Fig  6  Analytical  moan 


Fig.7  Collapse  mode  by  using  elasto-plastic  model 


Plastic  hinge 


■  P  *  50tf 
» - o ■ “  P  ~  55tf 


Fig.8  Time  variation  of  displacement  at  the  top  of  pile 
by  using  visco- plastic  model 


4.  CONCLUSION 

The  framed  structure  including  the  effect  of  the  foundation  is  ana¬ 
lyzed  by  using  discrete  models  proposed  by  K  AVVAI.  The  foundation  is 
expressed  as  combination  of  springs,  dashpot  and  sliders  Analysis  of 
the  framed  structure  including  the  effect  of  visco-elastic,  visco-plastic 
foundation  has  been  considered  intractable  by  using  the  existing  finite 
element  method.  It  is  shown  that  the  proposed  method  is  very  promis¬ 
ing  in  analysis  of  such  difficult  soil-structure  interaction  problems. 
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Abstract-A  net  discrete  elegent 'of  Voronoi  polygon  is 
presented  in' this  paper.  This,*odel/is  produced  from  Voronoi 
Regions  using  random  points  after  studies f  of?  the  crack 
pattern  of  the  ^fractured  or  collapsed  metallic  materials, 
soil  or  rock  mass.^These  probjemYcan  be  solved  only  by  using 
Kaval  ’s  RBSM.'because  the'  shape  of  the  elements  may  be 
polygonal,  the  total  number  of  sides  may  be  4-15. 

From  the  results  of  analysis,  It  can  be  concluded  that  this 
nev  models  give  us  the  sets  of  suitable  crack  lines  and 
their. patterns  those  characteristics  is  of  a  Fractal  Degree 
<D>.3>  ' 

I.  INTRODUCTION 

Frequently  It  Is  explained  the  collapse  of  the  rock  mass  is 
discussed  plotting  the  stress-strain  curve.  In  situ,  then  te 
judge  the  safety  on  rock. vails,  for  example,  usually  te 
might  observe  some  cracking  charactaristics  of  the  slope, 
pi  Her,  roof  or  ground  surface  as  the  target.  On  Uiat  time, 
te  cen  not  get  the  stress-strain  curve  plot  at  the  sites.  It 
is  believed  that  the  strength  can  be  predicted  from  the 
deformation  pattern  of  the  material  mass.  Insitu  services, 
te  evaluate  the  cause  and  effect  about  safety  tith 
ackrot lodgments  and.knothot  from  engineering  standpoint  of 
vict. 

It.  COLLAPSE  PATTERN  OBSERVED  IN  THE  FIELD  STUDY 

The  lineaments  as  the  cracked  lines  are  tidely  observed. 
Thcir-Iength(L)  related  to  frequency (N>  as  function 

N  (l1)  «  L‘D  (Olconstant)  (I) 

This  function  is  adoptive  on  many  facts  fi^m  macro  to  micro 
and  km  to  #i  in  length..This  is  a, net  experimental  rule. 

A.  Geological  lineaments  of  Satellite-Photos:  Ihen  these 
lines  control  frequency  relationship  in  length,  to  tan  get 
the  very  simple  rule.  It’s  in  fractals.  The  line  length  is 
about  500-Nx 1000m  (N>1). 

B.  Ceophisictl  hypothesis  tell  us  another  Information  that 
is  coreerning  a  Iimitted  mass  block.  One  is  the  tide 
dimension  in  about  1001a  flat,  another  is  about  the  Iimitted 
depth  in  30-50km,  called  Mohorovicic  discon tirwity  surface. 

C.  After  crushing,  the  distribution  size  of  the  rock  about 
the  accumulate  teight  ratio  over  sieve  has  a  linearity  in 
RRB(Rosin-RamUer-Bcnett)-Plqt.l*a)  The  relation  is  given 
is 

foliot. 

loglog  (100/Wt)  oc  log(d)  <2) 

in  size{d)  and  accumulated  teight  percents(llt)  over  sieve. 


D.  Crack  lmes  in  rock  tall  are  appearently  arbitrary.  But 
their  frequency  of  length  are  in logalismic  normal 
distribution.  Sum  of  frequency  on  length  present  fractal 
relation  about  the  density  of  distribution  on  square  net. 

E.  Some  micro-cracks  are  remained  in  minerals  of  rocks 
including'some  gas  or  liquid  thich  vere  created  in 
geological,  old, period.  They  are  theYopen  cracks  or  adhesive 
cracks  and  their  size  are  20#  and  I5#«  in  maximum. 
Frequency  of  traced  length  of  them  have  fractal  relationship 
in  length. 

HI.  VORONOI  REGION 

A.  'Definition 

He  can  get  a  set  of  the  triangle  net  in  a  area  connecting 
arbitrary  points  on  a  flat  plane.  A  edge  of  Voronoi  polygon 
Is  perpendicular  at  the  center  of  the  edge  of  a  triangle  to 
it.(Fig.l)  A  Voronoi  point  is  the  cross  point  of  this 
perpendicular  lines.  An  area  is  devided  into  Voronoi 
regions  by  this  lines.  The  Voronoi  points  define  the 
circumcenters  of  Delaunay’s  triangle  in  coordinates.  The 
count  of  initial  points  is  same  as  the  count  of  nev  polygons 
to  produce.  Tho  Voronoi -regioms, are  polygonal.  Not  if  te 
have  arbitrary  points,  it  is  easy  to. convert  the  net. of 
polygons  on  them, exactly.  Also  arbitrary  points  can  be 
obtained  from  the  experimental  datas. 

B.  Charactaristics 

Voronoi  polygons  have  the  many  unique  characteristics. 

1)  Count  of  tops  on  a  polygon  is  six  in  anticipation. 

2)  This  polygon  is  convex  shape  completely. 

3)  Directions  of  boundary  edges  of  polygons  vhich.are 
generated  crack  points  by  random  numbers  in  uniform, 
are  in  uniform. 

4)  Frequency  of,  length  of  edges  in  polygons  have  a  Fractal 
Coefficient. (Fig.3) 

5)  Frequency  of  size  in  polygons  has  the  fractals  too. 
Charactaricltlcs  4>,5>  are  very  interesting  and  important. 

N.  MAKING  THE  MODEL 

Hot  to  make  the  polygonal  net  on  our  limited  plane  ? 

After  considering  about  experimental  rules,  te  had  developed 
several  techniques  include  geometrical  autogencrating  tools. 
Usually  it’s  generated  voronoi -polygons  by  program 
’POLYG*.  If  necessary,  it  can  get  to  overlay  some  stright 
lines  as  the  pre-cit  slits  or  special  cracks  on  the  model. 

V.  SOLVER 


MTcforol 
Polil) 

Fix.l  Defioltloo  o I  tofoool  Sc«l^a 


FU.2  ?< |ld  Body  $*•!(«  Model 
*«tk  PqIhokI  iU;<s 

1905 


Fig.4  To  laprovo  Accuracy 


I.  Cilcaliud  etomitsU  p*rt>  b.  Crack  patl*n»  In  arbitrary  sole 

Fi(.$  Ci leu! tied  patterrs 


Fig .7  Silhouette  of  top  ridge  line  on  model 
In  exaggerate  scale  (likes  as  landscope) 


A.  Consideration.  The  geometrical  dualrelatlori  between 
Delaunay’s  triangle  and  Voronoi  region  is  more  suitable  to 
KAIAI’s  R8SN.(Pig.2)  On  the  Voronoi  polygons,  a  boundary  ed 
ge  cross  in  perpendicular  to  a  edge  of  Delaunay’s  triangle. 
On  RBSM,  if  the  optional  point  to  Involve  force  set  the  cent 
er  of  polygon,  ?e  can  get, the  lost  agreeable  condition  to 
analize.  This  technique  can  not  be  used  In  case  of  FEM. 

B.  improve  accuracy.  Referring  to  Fig. 4. 


Khen  (Ayi»Ayj)  are  the  distances  between  (G|,G2)  and  edge 
line, 

ih  --  ‘  -§& 

♦  -ife-  <♦> 

Ay*  »  Ayi  £  0 

I hen  cvaluting  position  of  strain  energy  balance  on  tho 
boundary  edge  of  voronoi  regions  set  the  summit  of 
Delaunay’s  triangle,  perfectly  Ay*  =  0.  Then, 

-gJL  .Jt XftyliLu.  -gtt-  (5) 

So  the  accuracy  improve  unconditionally  in  spite  of 
geometrical  arrange  of  the  constructed  net.  It  is  some 
Halted  cases  of  condition  that  is  "Ay*  =  0  ”,  on  which 
eleaents  are  in  regular  triangle,  right-angle  triangle. 

The  rectanglar  aust  bo  separate  as  like  Union-Jack  flag, 
square  and  regular  hexagon  etc.  These  limited  net  aodols  are 
very  difficult  to  generate  exactly.  Voronoi  polygonal  net 
are  very  easy  and  exact  to  construct  and  the  best  model  to 
get  tho  best  accuracy  completely. 

ANALYSIS 

A.  Create  crack  branch  and  their  fractals(Fig.5) 

Va  always  conduct  aaterial  testing  to  survey  the  strength 
and  charactiristics  of  materials  used.  On  such  cases,  we 
observe  crack  lines  initiated  on  the  pieces  as  like  branches 
of  tree. 


'After  shcarinng  calculation,  it  happend  u  mount  of  plastic 
hinge  and  broken  boundary  lines  which  have  a fractal  degree 
with  their  length  and  frequency.  The  pattern  of  these  lines 
show  likes  treo  shape,  which  has  a  few  trunk,  bough,  twig 
and  leaves  much.  Simulated  material  is  concrete  aortal. 
Finite  elements  to  calculate  are  natural  random  arrayed 
polygons.  Just  force  crcato  his  crack  ways  with  hlsself. 

8 •  Collapso  of  cylindrical  test  Piecc(FiglB) 
lo  frequently  studied  shearing  strength.1  This  simulation 
was  made  under  plain  strain  condition  at  a  section.  The 
loads  set  on  two  pin  points  by  the  top  and  bottom.  Loading 
balance  is  not  so  exactly,  so, the  deformation  of  elements 
in  model  are  unbalance.  After  tho  result,  broken  elements 
set  their  position  in  natural  on  arbltorary  scaling. 

C.  Collapse  model  of  Earth  Crust(Fig.7) 

Under  the  horizontal  pressure,  a  rock  mass  model  of 
rectangular  body  deform  with  hinge  and  crack  lines.  Upper 
edge  line  shows  a  modal  pattern.  This  deformation  images  a 
landscope  in  perspectives.  It  looks  far  mountain  ridge. 

Vfl.  CONCLUSION 

This  paper  presents  a  new  finite  element  model  cf  polyginal 
shape.  They  can  solvo  only  on  RBSN.  Generally,  it  can’t 
solve  this  polygonal  net  problems  by  FB4. 

It  is  described  that  the  relationship  of  dual  relation 
between  Delaunay  and  Voronoi  Regions  is  suitable  for  KAIAl’s 
RBSN.  lhe  lenglh  of  each  edges  beyond  Voronoi  Polygons  have 
tho  fractal  degree.  So,  after  loaded  analysis,  collapsed 
hinge  lines  and  broken  mass  show  fractals  about 
charactaristics  and  shape  of  model  on  outlines. 

Simulated  crack  pattern  is  presented  which  is  analogous  to 
the  patterns  of  lineament  as  like  as  photo-image. 
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Abstract  -  We  present  a  new  linear  nonconforming  finite  clement 
method  for  Reissncr-Mindlin  plates.  For  the  element  we  have 
proved  optimal  order  error  estimates  which  are  uniformly  valid 
with  respect  to  the  plate  thickness.  This  means  that  the  element 
will  never  "lock”.  We  give  numerical  examples  which  show  that 
this  is  the  case. 

L  THE  ELEMENT 

The  weak  form  of  the  Reissncr-Mindlin  plate  model  is  the  fol¬ 
lowing  (cf.  (4)):  Find  the  deflection  to  €  Ho(ft)  and  *hc  rotation 
0  €  (tfd(ft))*  such  that 

£(/3.io)  <  E(M  v(M  e  M(n))>  * 

with  the  energy  defined  as 

E(M  =  |  a(M)  +  /ntv>  -  Vvl1  dx  - 

where  G  is  the  shear  modulus  and  n  is  the  shear  correction 
factor.  The  thickness  of  the  plate  is  denoted  by  i.  The  bilinear 
form  a  is  given  by 

40,<>)  =  )2(1  -„»)  i^K1-1')  !  div/3  divvS]  dx, 

with  E  and  v  denoting  the  Young  modulus  and  Poisson  ratio, 
respectively.  For  simplicity  we  present  the  case  of  a  clamped 
plate. 

The  locking  of  finite  element  methods  occurs  for  "thin*  plates 
and  hence  it  is  customary  to  consider  a  sequence  of  problems 
obtained  when  assuming  that  the  transversal  load  is  of  the  form 
/ *> t’j  (i) 

with  g  fixed.  This  ensures  that  the  plate  model  has  a  finite  limit 
solution  when  t  -*  0.  We  will  make  this  assumption  below. 

In  this  note  we  will  present  a  new  finite  element  method  intro* 
duced  in  (3)  and  independently  in, {2]  by  Duran,  Ghioldi  and 
Wolanski.  The  method  can  be  viewed  as  a  modification  of  a 
recent  method  by  Arnold  and  FYtlk  (1}.  In  the  method  the  de¬ 
flection  is  approximated  with  linear  nonconforming  elements;  we 
let  Ck  be  a  triangulation  of  the  domain  and  define 

WK  ={  u  6  L*(rt)  |  v[K  €  Pi(K),  K  €  CK  and  v  is 
continuous  at  midpoints  of  clement  edges  and 
vanishes  at  midpoints  of  boundary  edges  }. 

For  the  rotation  we  use  the  standard  space  of  continuous  piece- 
wise  linear  functions 


Bk  =  { 0  6  vnm*  I  /v  e  (P. wi*.  K  e  c„ }. 

The  discretization  is  then  defined  as:  find  and  6  Bk 

such  that 

Ek(^k)<Ek(^v)  V(^)€fl*xin, 
with  the  following  modified  energy  expression 

£*(<>.  v)  =  jo(v>,V’) 

Here  hj ;  denotes  the  diameter  of  the  element  K  6  C\.  The 
operator  R\  is  defined  as  the  /^-projection  onto  the  piecewise 
constant  vectors,  i.e.  we  have 

//*•  K6C“ 

The  parameter  o  is  positive  and  fixed.  In  practice  the  inclusion 
of  the  reduction  operator  simply  means  that  the  local  stiffness 
m  »trix  is  calculated  with  the  one  point  integration  formula. 

For  our  method  wc  have  proved  the  following  error  estimate  (3], 

Theorem.  Suppose  that  R  is  convex  and  that  the  load  /  is  in 
L2{U)  and  satisfies  (Ij.  Then  there  is  a  positive  constant  C, 
independent  of  the  plate  thickness  i,  such  that 

J!u>  -  u>a(1o  + 1|0  -  Pkb  <  Ch* (}j{|o. 

For  the  details  of  the  error  analysis  and  fo:  a  discussion  of  the 
advantage  of  the  method  compared  with  the  method  of  Arnold 
and  Falk  we  refer  to  |3), 

II.  NUMERICAL  EXAMPLES 

Wo  will  give  results  of  computations  for  a  clamped  and  simply 
supported  square  plate.  We  consider  both  a  uniform  and  con¬ 
centrated  center  point  load.  The  symmetry  of  the  solution  is 
utilized  and  only  one  quarter  of  the  plate  is  discretized  The 
type  of  meshes  used  is  seen  from  Figure  1. 
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Figure  i  'The  finite  element  mesh  with  h  =  1/2. 


The  side  length  of  the  quarter  plate  is  chosen  equal  to  unity 
and  we  let  u  =  0.3,  We  have  chosen  t  =  0  01  and  compare  our 
results  with  the  exact  classical  Kirchhoff  solution.  We  use  the 
"hard”  simply  supported  boundary  conditions.  In  principle  the 
parameter  a  can  be  chosen  arbitrary  as  long  as  it  is  positive;  but 
is  clear  that  the  solution  will  depend  on  the  choice.  The  best 
results  we  have  obtained  for  a  in  the  range  0.05  —  0.3.  In  the 
calculations  presented  below  we  have  chosen  a  «  0.1.  Wc  denote 
by  Mx  the  exact  normal  moment  and  by  M)  the  approximation 
calculated  from  the  discrete  rotation  /?*. 

Our  numerical  results  clearly  show  that  the  method  performs 
very  well.  We  have  also  indicated  the  convergence  rates,  i.c.  the 
powers  of  the  mesh  length  h  by  which  the  -errors  decrease,-  and 
we  sec  that  the  calculations  confirm  the  error  estimates  proven 
mathematically. 

Some  further  numerical  results  and  comparisons  with  some  other 
linear  elements  are  presented  in  (5). 


Table  4.  The  normalized  center  deflection  iCA(ccntcr)/tr(centcr) 
for  the  concentrated  center  load. 


-  .1/5 

.Clamped  ’ 

Simply  supported 

2 

1,0755 

1.0744 

4 

1.0418 

1.0305 

8 

1.0193 

10119 

Table  5.  The  error  flw 

the  concentrated  center  load. 

i/h 

Clamped 

Simply  supported 

2 

0.0710 

0.0403 

4 

0.0188 

0.01M 

8 

0.0057 

0.0033 

Conv.  rate 

1.82 

1.81 

Table  6.  The  error  |jA/*  -  A/f  |1o/||Af*|]o  for 
the  concentrated  center  load. 


Clamped  Simply  supported 


2 

0.5851 

0.4410 

4 

0.2093 

0.2161 

8 

0.1424 

0.1015 

Conv.  rate 

1.02 

1.0G 
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Uh 

Clamped 
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4 

1.0121 

1.0077 

8 

1,0043 
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1/5 

Clamped 

Simply  supported 

2 

0.4379 

0.2002 

4 

0.2285 

0.1001 

8 

0.1149 

0.0501 

Conv.  rate 

0.97 

1.00 
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1.  Introduction 

The  fxmslaikm  of  a  rmmbcr  off  pedal  eESsreUial  eqxalioas 
involves  a  *Hdi  »aw  oct  of  physical 

cobc^*&c&.  Fee  euxplc,  plate  aid  sheS  medd*  iwhe 
V,  the  tHdaew,  heat  tanfe  tbx^t  iwotopec  aatcwli 
lardves  'F,  the  ratio  of  cwdadnitxs  in  dSfctat  iFrrrtin— , 
aad  elasticity  problems  involve  V,  tie  PctMoa  ratio-  Ixx&imj 
is  the  ten  grea  to  the  deterioration  a  numerical  «d— 
wiida  occur  when  the  parameter  »  doae  to  a  fiuntiag  wise . 
(t  —  0,1  —  O,*'  — •  (15).  for  instance,  it  is  vdl-faun  that 
tie  sse  of  tie  i-tesoa  FEM  with  piecewise  Baear  dements 
for  seariy  incompressible  maternal*  (ie.  fee  s'  dose  to  05)  is 
ineffective,  predidj  dne  to  locking  (see  flj  for  other  example* 
cfbdricg). 

A  xr£»t  method  is  cae  which  guarantee*  a  certain  rate 
of  convergence  uniformly  with  respect  to  a  parameter.  Vari¬ 
ces  robust  schemes  hare  been  suggested  for  different  problems 
lardiing  locHrg  -  for  example,  mixed  methods,  higher  order 
i-Tcrsica  methods  aad  the  p- version.  la  [2j,  we  hare  developed 
a  general  mathematical  theory  for  locking  aad  robustness  aad 
their  quantitative  a— esnaent,  which  can  be  hdpful  in  the  anal¬ 
ysis  of  such  methods-  Is  [3j,  this  theory  has  been  applied  to  a 
specific  example,  that  of  Poisson  ratio  loddng,  and  the  locking 
and  robustness  of  various  scheme*  has  bees  analysed. 

Onr  goal  in  this  paper  is  to  present  some  of  the  theory  from 
[2],  which  we  do  in  terms  of  the  specific  problem  of  Poisson  ratio 
loddng.  It  sbccld  be  kept  in  mind  that  most  of  the  definitions 
aad  theorems  we  state  in  Section  2  are  applicable  to  general 
parameter-dependent  problems  as  well  (see  (2]).  In  Section  3, 
we  present  some  theorems  from  [3J  for  the  robestcess  of  various 
schemes  when  the  Poisson  ratio  is  dose  to  0.5. 

2.  Locking  and  robustness 

We  consider  the  elasticity  equations 

(2.1)  —  AO,,  —  (1  —  2t')’,grad  dirtv  =  0  in  ft 

(2-2)  £(«»(»*)  +  ~  2s/)',divS*)ni  =  gt  on  I*, 

i*i 

where  s  *=  1,2,  e  is  the  nsnal  strain  tensor,  and  g  satisfies 
a  compatibility  condition  Here,  v,  the  Poisson  ra‘io,  lies  in 

5  =  (o,o  5).  ut  v  m  (fl'en))’  «ad 

B*(vtv)  =  /  /  (  53  -  2v)“,diTttdm>)  dt 

J  ijm  1 

with  the  energy  norm  (juj^  =  {/?*(«,  ttjJ1**. 

In  the  limiting  case,  as  u  -♦0.5,  equations  (2.1)-(2.2)  lead 
to  a  Stokes*  problem  and  the  solution  satisfies  the  incompress¬ 
ibility  constraint 


(23) 

In  the  seqad,  we  take  B  =  {h***(Q))*  aad  dense 5-|*by 

Ako,  define  B  9  =  {«  €  H  \  fwj*  <  K}  and  H*  =  {«  €  H  \ 
I*Jr  <  K).  (Here,  if  may  represent  different  nlets  bet  h 
will  always  be  bounded  by  K*,  r.  constant  independent  of  s'.) 

For  cads  A'  €  A',  AT  unbounded,  let  V*  C  V  be  a  finite 
densest  smbapnee  of  dnnenrion  A-  Chen  aay  “exact  solution” 
»T  €  we  may  sow  define  the  “finite  dement 
tion”  €  Vx  by 

(IS)  BJfl^BJ^F)  Yv  €  V-* 

The  set  s  =  {V'*v}  therefore  describes  an  extension  oXforitfim, 
Le.  a  rule  for  increasing  A'  (and  hence  the  accuracy).  Let  £, 
be  an  error  fa  notional,  EJ(p)  =  (rjr  or  We  assume 

that 

C,F*(A)<  sup  inf  £rs-r5r<ftF*(A') 

*<v 

and  lor  v  6  [0,5],  5  <  0-5, 

(2JI)  CtFJlK)<  t=P  B^,-^)<C,Fc(N) 

where  CitCi  are  positire  constants  independent  of  A,  v.  Here, 
F*(A)  — ♦  0  a*  A  — *  co  and  is  independent  of  y.  Then  we  hare 
the  following  definitions. 

Definition  2.1  Let  Iimx-«#/(A}  =  oo.  The  extension  al¬ 
gorithm  -F  shows  loddng  of  order  /(A)  for  the  family  of 
problems  (2.5),  s'  6  (0,0.5),  with  respect  to  the  solution  acts 
{H„}  and  error  measures  {£,}  iff 

0  <  ^Jia^iop^inpj  I=p^ fi-K— »*)) (Fe(N)f(N))  ’)  «= 

(5.7)  ’  <  oo 

For  the  case  that  M  it  bounded  (respect  ly,  infinite),  we  say 
that  the  order  of  locking  is  at  most  (respectively,  at  least) 
/(A).  If  (2.7)  holds  with  /(A)  5  1,  then  we  say  that  T  is  free 
from  loddng. 

Definition  2.2  Let  p(A)  —  0  asA  -»  oo.  The  extension 
algorithm  F  is  robust  with  uniform  order  g[  A)  for  the  family 
of  problems  (2.5),  v  6  (0,0.5),  with  respect  to  the  solution  sets 
{//„}  and  error  measures  {£.}  iff 

jlirn^  srp(sap  (jaP#  «?))  (s(A))  =  M  <  oo 

We  see  from  the  above  definitions  that  if  /(A)  is  such  that 
/(A)Fd(A)=jKA)-0  as  A -oo 

then  T  shows  locking  of  order  /(A)  iff  it  is  robust  with  maxi¬ 
mum  uniform  order  g(A). 
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Let  St  denote  ii c  Hnrii  of  lie  *fU  assctag  of  eJe- 
racx.1*  a  €  J7  satisfying  (2^).  Define/?/  =  25*0x7*-  Ties  lie 
fctSciigcee&ixbs  been  pn»a  fee  tie  problem  (2. 1)^25) 

“P3- 

Condition  (a)  For  any  g-5*f  tiers  »*s*  6  £/  («# 
depending  03  O  «ch  till 

(i*) 

with  C  mdepesdesl  of  v  »sd  v 

Toe  fcZk»*ing  theorem  thaws  tie  sxibac  of  tie  set  S*. 

Theorem  2.1  Let  BJfn)  =  Jr  Jr  <w  let  Cm&unt 

(5x)  hold.  Then  F  shows  locking  of  order  /(//)  with  respect  to 
Brig 

s(K)=  rap  gi  |s-»Jr»<7fi(JI0/(if). 

•e*,* 

and  u  /rce  from  lodany  iff  the  afore  holds  with  f{K)  =  1. 
Monoxer,  T  shows  locking  of  order  /(.V)  ta  lie  V  norm  iff  it 
thews  locking  of  order  f{H)  in  the  energy  norm. 

Tie  advantage  o{  Theorem  2.1  u  that  one  reduce*  the  ques¬ 
tion  cf  locking  to  a  question  of  approxisaKEt j  done,  iaroinrg 
^A').  Ia  fact,  fcr  ocx  problem,  tie  following  equivalent  char- 
•etexizrtjcc  of  zs  ray  usefcl. 

Theorem  2.2  Let  g[N)  he  cs  defined  in  Theorem  2.1.  Then 
g[K)  is  efdxclent  to 


st*T  =  rap  M  U-xSw 

MLB’  «*' 

6*  =  {<&  e  W‘*’(n)  I  <  if)  «iw*  =  {*  | 
vxxev1}. 


robustness  isO(hr~l).  Forp>  3,  the  order  of  locking  is 
OfjT1)  end  tie  order  of  robustness  is  0(h?~3). 

Tie  short  shows  that  locking  cannot  be  avoided  fog  p  <  3 
foe  tic  ircnca  Also,  obj  rectangular  desati  leads  to 
locking  for  all  p. 

In  contrast,  it  tm  shown  in  [5]  that  tie  p-reeshn  (t*:=g 
ilai^l-RtJed  tout's)  leads  to  unifom a  robustness  of  order 
that  is  optimal  up  to  aa  arbitrary  c  >  0.  Ia  [3],  we  show  that 
using  bar  formulation,  it  is  possible  to  prore  this  optimal  order 
without  the  t,  both  for  triangles  aad  rectangles,  so  tiat  locking 
is  completely  rf?ar»ld,  For  the  p-vtrssca,  however,  the  sk 
of  carriH&ear  dements  is  indispensable  ia  most  cases,  la  this 
connection,  ve  hare  the  following  theorem. 

Theorem  32  Let  F  be  the  p-terno*  for  the  problem  (2^)  us- 
hf  ondaecr  elements  5*  (triangles  ond  fsainUterrds)  which 
cs*  he  mopped  onto  the  standard  triangle  or  square  using  ratio- 
nsl  mappings  Bi.  Then  F  is  free  from  locking  and  is  uniformly 
robust  with  order  (p  —  where  s  >0  depends  wpon  the 

mappings  Bi  but  is  independent  off . 

When  the  mappings  are  affine,  ve  may  take  s  =  0.  For 
additional  related  resalts,  see  [3] 


Figure  3.1  Uniform  meshes  (a) Rectangular  (b)Triaagalar 


The  abort  theorem  is  used,  for  instance,  ia  the  proof  of 
Theorem  3.1  in  the  next  section  (*ee[3j). 

S.  Robustness  results 
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We  now  consider  problem  (2.1),(2.2)  orer  0  =  (-1,1)’.  Let 
{T\}  aad  {Tf}  be  the  uniform  rectangular  and  uniform  tri- 
argular  meshes  tbova  in  Figure  3.1(a)  and  (b)  respectively. 
For  aay  set  S  CR1,  denote  by  'P^[S){7>)(S)'j  the  set  of  poly* 
nomials  of  total  degree  (degree  in  each  variable)  <  p  aad  let 
'P}(S)  =  V}(S)Q{z’y,xy>}.  Define 

'&  =  {«ec°( n),  tt| ,€^(S)  ysct?} 

for  i  =  1,2,3  where  T}  =  7?.  Then  ve  hare  ([3J,[4]) 

Theorem  3.1  Let  F  he  an  h*r ersion  extension  algorithm  for 
the  problem  (2.5)  using  subspaces  VK  =  (Vj*)*  trilA  p  fixed. 
Then  with  27,  j|  -  E*  «  above  (k  large  enough),  the  following  is 
true  for  locking  in  the  energy  norm 

Vfj,  For  p  <3,  the  order  of  locking  is  0(h~l)  and  the  order 
of  robustness  is  £)(&*"*).  For  p  >  4,  there  is  no  locking 
and  the  order  of  robustness  is  0(h *). 

Vfx  For  p>\,  the  order  of  locking  is  0(h“l)  and  the  order  of 
robustness  is  0{h*~l) 

Vf, h  Borp  <  2,  the  order  oflochng  is  0[h~l)  and  the  order  of 
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Asymptotic,  Homogenization  and  Galerkm  methods  m 
Three-Dimensional  Beam  theory 
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Abstract.  -  la  this  work  we  consider  a  three-dimensional 
linearized  clastic ty  beam  moed  end  stedy  different  approxima¬ 
tions  of  the  ihrec-dizxnsioaal  soktioa  tning  asymptotic,  home- 
gem ration  and  GaJerkm  methods.  We  show  that  the  choice  of 
the  basis  functions  of  the  GaJerkin  approximation,  commutes 
with  the  homogenization  process  bet  the  same  does  not  bold 
for  the  final  approximation  of  the  three-dimensional  solution. 

L  INTRODUCTION 

The  summation  convention  on  repeated  indices  will  be  used. 
Latin  indices  (i  J Jk, —  )  shall  take  values  in  {1,2,3}  and  Greek 
indices  (<*,£,%...)  shall  take  values  in  {1,2}. 

Let  (e,)  be  the  canonical  basis  of  fR*  and  let  u  be  a  domain 
in  the  plane  spanned  by  the  vectors  ca.  Let  j  u  j  s  1  and  let 
7  =  cb  denote  its  boundary.  Given  L  G  R,  L  >  0,  define : 

ft=«xJ0,L[,  ro=yx{0,i},  r,=7x]0,Z(. 

Consider- F  =  [0, 1}  x  [0, 1|  and  let  T  be  an  open  subset  of 
Y,  with  a  regular  boundary  5T,  such  that  T  C  ini  Y.  Define 
Y*  =  Y—T  and  let  x  be  its  characteristic  function,  periodically 
extended  to  all  the  plane  generated  by  (e«).  Let  0  be  the  area 
of  Y*  ,  that  is,  0  =}  Y*  [.Given  6  G  IR,£  >  0,  consider x*  given 
by  x*(x)  =  X(f),  for  all  x  G  Cl2.  X*  defines  a  subset  of  Cl2  with 
a  periodic  structure  haring  SY*  as  basic  cdl.  Suppose  £  is  a 
small  parameter.  If  represents  the  characteristic  function 
of  w,  x**  =  X*X*  defines  a  subset  «*•*,  of  u,  obtained  by 
perforating  with  boles  ST,  with  a  periodicity  SY.  To  avoid  a 
nonregular  boundary,  only  holes  whose  closure  does  not  touch 
7  are  to  be  considered,  j.c. ,  whenever  the  boundary  ox  a  hole 
intersects  7  wc  will  consider  x*A  =  1  in  all  the  respective  cell. 
This  adjusted  function  \*  *  defines  a  set  that  will  be  represented 
by  ut,l1  with  boundary  7^.  Define 

siKl  =  w*4  xjo, if,  rj-1  =  u>tA  x  {o,  i},  r*  '  =  74-1  xjo, i(. 

Given  c  G  R,  C  >  0,  a  dimensionclcss  parameter  that  may 
be  as  small  as  wc  please,  consider  the  homothety  je(x i,rj)  — 
(exi,era)  =  (x{,rj),  for  all  (xi,rj)  6 IR2  and  denote 

u‘=j‘(tj),  7'  =i‘(7),  u1''  = 
dT4-,=j'(0T‘y),  y‘-'=j' 
n'=u*xio,£(,  rs=u>‘x{o,i},  n  =  7'xlo,£(, 
tfs =u‘*X]0,n,  r1/  =u‘-'  x  {a,  l),  rJ''=7‘-'x]o,L|. 

Let  n  =  (n,)  (respectively  nf  =  (nj))  denote  the  outward 
unit  normal  vector  to  50  or  to  OSl*'1  (respectively  50*  or  dO***) 
and  define  the  following  differential  operators,  depending  on  c, 

%  =  ifc'h  =  =  f£. 

II.  THE  PROBLEM  UNDERSTUDY 

Wc  consider  a  beam  occupying  volume  0*-*  made  of  an 
homogeneous  isotropic  linearly  elastic  material  with  Lame’s 
constants  A  and  p,  both  independent  of  6  and  t.  Let  j\  G 
L*(Sl**)  and  g*  G  I2(r^’*)  be  the  itk  components  of  the  volume 


L.  Trabocbo 
CMAF 

At.  Prof.  Gama  Pinto  2, 

1690  Lisboa  Codex,  PortugaL 

density  of  applied  body  forces  and  surface  density  of  applied 
surface  tractions,  respectively.  With  no  loss  of.  generality  we 
consider  that  the  system  of  applied  forces  does  not  depend 
on  the  parameter  6.  We  also  admit  that  ^(i*)  =  0  if  1*  £ 
dT**x]0tL[.  Consider  the  space  of  admissible  displacements 


Y(Cl‘')  =  {r  =  (r.)  6  :  r  =  o  on  rfr, 

equipped  with  the  usual  Hl-nozm  and  the  following  three- 
dimensional  elasticity  problem  in  SY* ; 

{U**  6  V{Sl^)t 

In*  «  [2pc*(it#'r)  -f  At  rec(u,-‘)  /dje^'  )dSlf*  = 

=  fa*.,  f  ■  *<&•'  +  /rj.-  5*  -  tuff**,  Vr  € 

where,  for  each  r  G  ef(rj  represents  the  hneanzed 

elasticity  strain  tensor  and  JJ  the  second  order  identity  tensor. 
It  b  a  classical  result  that  this  problem  has  a  unique  solution. 

We  are  interested  in  the  behaviour  of  uijc%  when  6  and  t 
tend  to  zero. 

IIL  THE  CALERKIN  APPROACH 
An  equivalent  formulation  of  ibe  elastiaty  problem  posed 
now  in  a  fixed  twith  respect  to  e)  domain  SI*’1  is  considered. 
This  b  achieved  using  the  same  type  of  transformation  as 
in  the  asymptotic  expansion  method  for  beams.  Then,  the 
transformed  displacement  field  u\e )  is  approximated  by 
the  projection  of  u\e)  onto  the  approximation  space  Vu(Slf,i) 
of  l'(fl,,l))  The  following  result  holds  . 

Theorem  1.  Let  /,(*)  =  0  if  *  €  ft**1 ;  <7o(*)  =  0  if  x  G  r#-1 
and  jrfx)  =  0  rfx  G  dTSA  xjO,L[.  Let  Gj  =  {<?j  .  x  gJ0,L[-* 
Sj(x)  =  djh,  h  G  //o(]0,L[)},  the o,  if  the  force  component 
9i  €  Gjn//J<V_1(10,L[)  foriV  >  1,  there  exist  constants  C  and 
K,  independent  of  r.  6  and  N,  such  that : 

|[u‘(e)  -  «i,(t)Im<ou>  S  CXKK-'c™-\ 
flu*’'  -  4'0,/.(n-  •)  <  . 

The  approximation  space  V>v(D*'*)  is  given  by ; 

l'#M)={»  =  (‘’.)€¥',); 

N 

*.  =  £  *>„“(*„*»)■£(*>).<£  e  6 

Jfc*  0 

o>  =  «l,  6  Hi(10,i[),<3“  e 

1-0 

V0  <  k  <  Nf  (no  sum  on  6)  } 
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zsd  the  basis  functions  are  defined,  by  recurrence,  as  tbc 
solution  to  the  following  boundary  value  problems  : 

-rfd.tft-,»+lil~n‘)s.d~M  =0.  V» 6 

( L-..  p(3.<?“ + P.)a.r<£>«  - 

I  -L,.,[(A+2p)Q<*-,>‘+Aa„P“lr<L«  =6ufrrdy. 


Vre^V-*)- 


IV.  THE  HOMOGENIZATION'  RESULT 
We  proceed  in  order  to  study  the  behaviour  of  approxi¬ 
mations  ix^.(e),  as  S  becomes  very  small.  The  following  result 
bolds. 

Theorem  2.  Let  F1*  and  Qkt  be  the  basis  functions  defined 
in  Theorem  1.  For  each  k  there  cost  extensions  P  and  Ql*  of 
P*‘  and  Qkt,  respectively,  that  arc  bounded,  in  [H*(w)]2  and 
Hl(u),  respectively,  independently  of  6.  Any  other  extensions 

PM  and  QiS,  boimdcd  independent fy  of  6,  satisfy  the  following 
convergences 

p“  r,  rrnUy  in  [H' (u'jf. 

Qlf  -*  Q9,  weakly  in  Hl(u)t 

where  P*"  and  Qk*  are  uniquely  defined  by  ihe  following 
recurrence  formulas ; 

.F~lm  s  0,  Q-"=  0,  (t  =  -1); 
p-  =  P1' +*!*-,  Ql9=7t+nQ'\  (b  >  0); 

and  where  P**  is  thesolution  of  the  following  plane  deformation 
elasticity  problem  into  : 

-o,s% = pc^Pl,-,)- + aa<?!‘-"')+ 

+A(es«j+<;^,,)a„Q(1',)',  in  «, 

=  +  on  7, 

.  J„  ?« <*■»  =  =0. 

xith  5‘j  given  by 

—  Soll<,c<r(P  >'  She,  =  +  Safr,3r,e,' 

<«,= 

for  =  (x£')  defined,  up  to  a  constant  rector,  as  the  solution 
of  the  following  problem,  in  the  unit  cell  F* : 

*<'  5'-  periodic  , 

/1..(2pe««(x<')+Ae„(j;<')^j]roi(¥l)<il"  = 

=-A-lVc(v)+ 

the  elasticity  tensor  (5*^)  being  positive  definite,  satisfying 
She,  -  Si°e,  =  S’e^f  A"  =  Kj)  is  a  2  X  2  symmetric 
and  positive  definite  matrix,  whose  coefficients  arc  defined  by 
aJi  =  Qtef+fy.  0„Xi)  =  Jy.  V(X'.+’/t.Mxt+Vt),  and  nhm 
Xfi  is  the  unique  solution  of  llic  following  problem  defined  in 
the  cell  Y*  : 


fjB».  y-penodic  .  fy.X)  =  0, 

^  -Axp  =  0,  ia  I'\ 
l  3nXj  =  -”)<  “>  ar. 

Moreover,  (3**  is  the  solution  of  the  following  membrane  or 
(anisotropic)  torsion  type  problem  in  linearized  elasticity,  for  a 
two-dimensional  body  occupying  vohimeu  : 

-p  AV(.A'V<J*")  = 

'  =  p  a;* (a-T*-) + xiec„(^‘) + =h;,ec,(^‘)}+ 
+5irr«';,we(,'”‘'KA+2i')0<3“"’‘-  “  “• 

(A*  Vy“)  ■  n  =  -(A*P‘*)  -  n + A&,,  on  7, 
XQ‘-dL-=0; 


*/>*•  =  o}'  +  l‘-x,,  KJ^*=o}'+4‘"x,1 
*0‘*  =  -  a{*x,  -  aJ'xj  +  c*\ 

where  to*  is  the  homogenized  solution  of  Saint  Vensnt ’s  torsion 
function  and  where  constants  a**,  \A9  and  e4*  depend  only  on 
the  geometry  of  the  cross  section.  Q 

Theorem  3 .  Let  f  Z  [L2^1))3  and  g‘  €  The 

solution  u,»c  €  V*A  of  the  three-dimensional  elasticity  problem, 
where  the  volumic  forces  are  the  restrictions  of  f*  to  fl*"*  and 
where  the  surface  forces  coincide  in  rj  with  gf,  admits  an 
extension  in  {/^(iF)]1,  bounded  independently  of  6.  If  u*'*  is 
any  extension  of  u**;  bounded  in  independently  of 

6,  then,  for  each  fixed  C  and  as  6  —  0,  ti**e  converges  to  u9>f, 
weakly  in  [H1^))3,  where  u9‘*  is  the  unique  solution  of  the 
following  problem  : 

v,9*  6  V  =  {»  =  (»,)€  W'Jfstso  on  ro) 

=  e  tfv.dSl'  +  /n  5,f u.dTt ,  Vc  €  V*, 

o\f  =  2p[Qc*J(u9't)  +  9*  »,«!;(«••*)]+ 

+A(ee;,(n*-‘)  + 

where  the  coefficients  qi/il*  safisf>- ; 

X  A*  . 

rtoc,  =  ah(r  ihu  =  + 

=  IrSJo  -  9oJJS  “  =  ~ 

and  all  tbc  others  vanish.  O 

Theorem  4.  Under  the  same  /hypothesis  as  before  and  using 
the  previous  notations,  there  cxiit  constants  Z7  and  Tv,  C*  and 
If,  independent  of  c,  S  and  jV,  sudi  that 

||»*(«)  -  ««(<)!!/;.,«)  <  VnT!n-'cw-\ 

liu-(i)  -  u'v(«)|| «!(«)  <  C' 

!!«■•'  -  u-/H„.(n.,  <  C‘U(l<‘f-'tw-'. 
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in  any  adaptation  of  a  numerical  technique 
for  treating  a  singular  situation,  it  is  advantageous 
to  have  the  knowledge  of  the  foma  of  the  singularity. 
In  this  paper  we  extend  the  quadratic  sapping 
technique  to  the  treatment  of  singularities  in  three 
dimensional  problems .  Element  types  are  developed  for 
a  three  dimensional  situation  and  comparison  of 
results  is  sade  for  a  plate  with  circumferential 
crack.  Kith  the  quadratic  capping  technique  we  can 
perform  fracture  mechanics  computations  with  rather 
simple  finite  element  meshes.  The  element  is  shown  to 
be  able  to  reproduce  a  singular  .'train  field  in  threa 
dimensions  for  a  curved  singularity  line. 

X.  INTRODUCTION 

The  analysis  of  the  finite  element  cethod  usually 
relies  on  the  assumption  that  the  solution  of  the 
given  problem  is  regular  enough.  However,  the 
implementation  of  the  method  Is  very  often  done  oe 
problems  with  polygonal  domains  which  prevent  the 
solution  free  being  smooth  in -some  points  or  lines. 
According  Grisyard  (1)  the  presence  of  .comers  lead  to 
the  singular  behaviour  of  the  solution  only  near  the 
corners.  This  singular  behaviour  occurs  even  when  the 
data  of  the  problem  are  very  smooth.  It  strongly 
affects  the  accuracy  of  the  finite  element  cethod 
troughout  the  whole  domain.  A  considerable  body  of 
analysis  now  exists  showing  that  singularities  can 
occur  at  such  boundary  p  tints  and  lines, with  the 
effect  that  the  regularity  of  the  solution  is  reduced 
from  what  is  expected*  for  such  problems  when  the 
regions  have  smooth  boundaries.  However,  many  problems 
in  potential  theory  and  linear  elasticity  occur  lr 
regions  which  contain  sharp  comers  and  edges. 

In  any  adaptation  of  a  numerical  technique  for 
treating  a  singular  situation,  it  is  advantageous  to 
have  the  knowledge  of  the  form  of  the  singularity.  The 
fora  of  the  singularity  is  determined  by  the 
coabination  of  geometry  and  boundary  conditions.  If  we 
know  the  fora  of  a  singularity  in  general  non  convex 
domain  in  two  distensions,  wo  can  use  the  technique  of 
Aalto  12)  for  locating  the  nodal  points  of  standard 
isoparametric  quadrilateral  elements,  properly  around 
the  singularity.  The  use  of  this  quadratic  mapped 
element  for  treating  singularities  has  been  shown  by 
Michavila,  Gavete,  Dlez  and  Whiteman  (3,4,5).  Also,  it 
has  been  demonstrated  by  Gavete,  Michavila  and  Dies 
(6,7)  that  the  Serendipity  and  Lagrange  quadratic 
mapped  elements  can  be  applied  in  linear  elastic 
fracture,  because  their  strains  and  displacements  in 
the  vicinity  of  the  crack  tip,  are  appropiate  to  the 
form  of  the  singularity. 
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In  this  paper  we  extend  the  quadratic  mapping 
technique  to  the 'three  dimensional  problems.  Element 
types  are  developed  for  a  three  dimensional  situation 
and  comparison  of  the  results  is  made  for  a  plate  with 
circumferential  crack.  With  the  quadratic  mapping 
technique  we  can  perform  fracture  mechanics 
computations  with  rather  simple  finite  element  meshes. 

The  element  is  shown  to  be  able  to  reproduce  a 
singular  strain  field  in  three  dimensions  for  a 
circumferential  singularity  line.  Other  forms  of 
singularity  lines  could  be  possible. 

XI  A  NEW.  APPROACH  TO  THE  TREATMENT  OF  3-D 
CRACK  PROBLEMS. 

Our  starting  point  is  the  work  of  Aalto,  Gavete, 
Michavila  and  Diez  (2-7),  and  in  this  paper,  w© 
calculate  a  new  mapping,  giving  a  special  singular 
finite  element,  which  is  used  to  approach  a  3-D 
circumferential  crack  problem.  The  strain  form 
approach  for  this  new  singular  element  is  appropiate 
to  the  singularities  involved,  as  it  is  demonstrated 
in  this  paper.  Tho  mapping  is  as  follows: 


x  .  1  <XV>*[('»i)  -  l(lV)J<l<oi«I) 

(1) 

y=§An 

[(!♦»)  -  *  p.2-  II2)]  sen 


being  >.M.ve  (0,2) 


r 
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This  capping  is  applied  to  the  standard  20-nodes  brick 
finite  element  giving  the  decent  of  the  fig.  1  in 
physical  space.  The  singularity  line  (crack  line)  is 
placed  along  1-2-3. 

-The  approximations  of  the  strains  «jj.  «22  an<J«l2  in 
planes  ortogonal  to  the  crack  lice  are 

r^3U(Xp.v)_3U  +3U  3*_ 

3x  31  3 x  3^1  3 x  3v  3x 


p  -3vftjuv)  _  av  ax.  ,  av  au  ,  av  av 

21  3y  3X  3y  3ji  3y  3y  3y 


Similar  results  *e  obtain  for  (jtoO.  v  -  constant)  and 
for  (X*  K  p.  v  -  constant) ,  radial  lines  emanating 
froa  the  singularity  in  a  plane  ortogonal  to  the  crack 
line  (1  -  2  -  3  of  fig.  1),  where  K  >  0  is  a  constant. 
Also  we  obtain  similar  results  -for  E22  and  <v  - 
constant).  Thus  for  small  r  the  r“*/2  term  dominates 
in  all  directions  eaanating  froa  the  line  of 
singularity  in  planes  ortogonal  to  this  line,  giving 
the  approximations  to  the  gradients  the  correct 
singular  fora  as  required  by  the  true 
solution.  Similar  results  to  those  obtained  in  the 
case  of  mapping  the  20-nodes  brick  element,  can  be 
obtained  by  mapping  the  27-nodes  quadratic 
isoparametric  La  grange  brick  element  in  a  similar 
way. 


Fll.i  fau^y) .  3v(V..v)l 

2  l  3x  J 

For  the  capping  (1)  we  have 

K-  |^J  ■  [('♦»  -  JP.’-n’)]  (*J+n*)  0) 

The  forms  of  U  (X|i<  v)  ,  V(X.p.  v)  are 
U  (X,  |i,v)  m  gi  +  gj  X  -f  G3  p  ♦  ouv  +  ajX,  +  cup2+a7V2 
+  cu  tyf  a»  Xv  ♦  aicpv  +OiiXV+a^4i,+ai^i2v  v2  + 

+  01  jX*v  4auXvJ+  anXpvtaiiXV  v  +ai»Xji2 v  +a2oXjiv2  (4) 


HI.  AN  EXAMPLE  CONTAINING  A  SINGULARITY  LINE 

The  3-Dimension  crack  models  represent  a  very 
important  problem  nowadays  in  mechanical  engineering, 
considering  that'  pipe"  and  pressure  vessel  cracks  can 
be  detected,  before  the  broken  pressure  arrives,  width 
and  length  intervene  very  directly  in  this  models,  and 
even  the  crack  shape' is  important. 

As  an  example  we  have  studied  the  problem  of  a .plate 
with  a  circumferential  crack  which  is  under  an  uniform 
stress  field. 

Figuro  2  shows  plate,  subject  to  an  uniform  stress 
field  and  having  a  circular  center  crack.  Data  are 
a/t-  0.4,  L/t-  2.5  and  a/w -  0.2. 


Fig.  2 


and  hence -for  v  -  cto,  i.e.  in  a  plane  ortogonal  to 
the  crack  line 


Eu*Ai  +  £i»  Ai  +  £x  n) 

(i  A 


whore,  A^  and  A|  (1-1,2)  are  constants. 


7n  order  to  uso  the  new  napping  defined  in  (1)  for 
the  case  of  circumferential  crack  lino,  fig. 3  model, 
has  been  carried  out.  After  the  displacements  have 
been  obtained,  wo  use  the  Manu's  formula  (8),  to 
calculate  the  SXF. 

As  a  differenco  with  the  quarter  point  elements, 
here  we  don't  have  negative  pivots  during  the 
factorization  which  is  carried  out  in  the  equations 
system  solution  or  other  numerical  difficulties  as 
reported  by  Peano  and  Pasinl  in  [9) .  The  results 
obtained  for  S.I.F.  versus  the  orientation  (Wangle  of 
figure  1)  are  shown  in  figure  4. 
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Fig.  4 
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The  performed  errors  when  compared  with  references 
[101  end  [11 J  can  reach  up  to  51  and  even  lower,  which 
•represents  an  acceptable  error  level,  if  ve  consider 
that  reference  aodels  of  Miyazaki,  Watanabe  and  Yagawa 
[10J  and  Yagawa,  Zchiniya  and  Ando  [111  have  more 
finite  elements.  In  general  ,  these  errors  depend  bn 
the  numerical  integration  order,  on  the  diameter  and 
on  the  elements  aspect  ratio  of  the  model. 

Also  a  very  good  agreement  has  been  obtained  with 
the  ^accurate  results  of  Schnack  and  Karaosaanoglu 
[121 . 

XV.  CONCLUSIONS 

He  have  compared  ours  curves  of  the  SIF  values 
versus  the  angle  0,  with  those  obtained  by  Miyazaki, 
Watanabe,  Xchimlya,  Ando  and  Yagawa  (10—11)  ,  when  they 
solve  a. similar  example  with  different  strategies, 
their  results  are  very  much  the  same  than  ours 
(integration  orders  3  and  4),  but  our  grid  is  much 
less  reflnate.  This  method  has  the  advantage  of  its 
simplicity,  it  .is  possible  to  solve  problems  with  much 
loss •  ref inate  grids.  We  have  demonstrated  that  a  new 
singular  element  can  approach  the  behaviour  of  curved 
cracks  in  3-D,  calculating  with  high  precission  the 
stress  intensity  factor.  Others  forms  of  singularity 
lines  could  be  possible.  It  would  be  convenient  also 
to  study  the  Influence  of  the  geometric  shape  of  the 
crack. 
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Abstract 

It  is  well  known  that  singularities  are  present  in  solutions  of  ellip¬ 
tic  boundary  value  problems  in  domains  with  conical  boundary 
points.  The  solution  consists  of  singular  terms,  which  appear  in  a 
neighbourhood  of  a  conical  point,  and  a  more  regular  term.  The 
coefficients  of  the  singular.terms,  the  so-called  stress  intensity 
factors,  are  especially  of  interest  for  applications  We  describe  a 
method,  how  some  of  them  may  be  calculated,  if  the  right  hand 
sides  are  from  standard  Sobolev  spaces.  In  some  cases  the  co 
efficients  are  unstable  and  a  stabilization  procedure  is  necessary. 
We  handle  as  examples  boundary  value  problems  for  the  Laplace 
equation  in  two  and  three  dimensional  domains. 


I.  THE  ASYMPTOTIC  EXPANSION 

Let  fl  be  an  open  subset  of  R“,  whose  n- 1  dimensional  boundary 
an  is  smooth  with  exception  of  one  conical  point  0.  We  consider 
an  elliptic  boundary  value  problem  with  constant  coefficients 

A(D„)u(x)  =  £  AM*)  =  /(*)  in  n  W 
_ H-r- _ 

u(*)  =  £  V,DXr)=S,(r)  on  =  l,...,m. 

W  »"»i 

The  right  hand  sides  are  from  the  Sobolev  spaces  and 

the  trace  spaces  If  n  =  2,  the  boundary  con¬ 

ditions  are  also  considered  on  pieces  of  the  boundary.  In  this 
case  we  assume,  that  the  functions  <?;(*)  satisfy  compabihty  con¬ 
ditions.  Since  the  right  hand  sides  can  be  splitted  into  Taylor 
polynomials  and  functions  from  certain  weighted  Sobolev  spaces, 
we  get  e  g.  the  following  asymptotic  expansion  of  a  weak  solution 
«  from  ^"(ft) : 

(t+lm-,.1  \ 

53  r,|"+  53  r°'u«. )  + w*  (2) 

IkO  /&«*€!  / 

where 

M, 

v'=53  (lnr)iwi.“)'  (3) 

1*0 

««,  =  53  <?«..<  5^(lnr),’e,-,(a„a),  (4) 

ibO  ;«0 

/  =  (_2  +  m,  +  Jt+2m)  ,  and  to  €  W‘””'-’(n)  for  p  yf  2 . 

We  denote  by  (r,w)  the  spherical  coordinates,  s  =  s(n,p)  is 
an  integer.  Mi  >  0  is  the  multiplicity  of  the  ’’eigenvalue”  / 


of  a  generalized  eigenvalue  problem,  which  we  get  from  (1)  in¬ 
troducing  (r,w)  and  using  the  Mellin  transform  with  respect  to 
r.  Aft  —  0  means,  that  /  is  not  an  eigenvalue.  In  the  second 
term  the  complex  numbers  o*  are  eigenvalues  of  the  multiplicity 
Mo*  and  M‘0„  =  Ma„  —  I . 

The  first  term  of. (2).  comes  from  the  polynomial  parts  of  the 
right  hand  sides  [l],  the  second  term  is  known  from  the  theory  in 
freighted  Sobolev  spaces  [1],  [2)  - 

II.  THE  PXANE  CASE 

Due  to  the  compatibility  conditions  we  restrict  to  the  problem 
Au  =  0  in  ft  (5) 

Q 

Bi(,)u  =  *!,)  onrf,  =  = 

Assume  that  the  domain  ft  coincides  in  a  neighbourhood  of  a 
corner  point  H  l',+i  -0,-0  with  the  infinite  cone  K  - 
{(r,w),  0  <  r  <  oo,u?o  =  0<w<wd  =  u>q}  with  the  sides  r— . 
Introducing  polar  coordinates  we  write  the  differential  operators 

A{DX )  =  r”J"I(rDr,w,Do>) 

BflD.)  =  r-"f,W,±(r,flr,u,Dj)|ri  . 

We  say,  the  complex  number  a  is  an  eigenvalue  of  the  operator 
.4(A)  =  {/.(A,u,D„),  A/^A.w.D.,)}  if  there  is  a  nontrivial  so¬ 
lution  eo(A,u)  of 

A(A)c(A.u)v.Q  . 

We  now  split  the  right  hand  sides  of  (5) : 

= waxf  1 

where  G^(0,wf)  =  J 
s  =  0  for  2  <  p  and  s  —  1  for  1  <  p  <  2. 

The  functions  vt  in  the  expansion  (2)  are  solutions  of 

/lrrt>  =  0  in  K 
Bprlv  -  on 

They  can  be  calculated  ’’easily”,  starting  from  the  general  solution 
of  the  ordinary  differential  equation  L{ A,w.  Dw)e(f,w)  =  0  and 
using  the  ansatz  (3). 

Thus  we  get  for  the  Dirichlet  problem  for  the  Laplace  equation 

(Gj(0,0)  cosIw+i^^sinfw|  for  fu*  ^  vz 
G'i(O.O)  cosftu-f  ~^(Ip  rsinfw+wcos/w)] 

for  =r  vx 

(6) 

It  is  evident  that  one  coefficient  in  the  first  row  of  u,  is  unbounded 
(unstable),  if  wo  is  from  a  neighbourhood  of  j-.  This  behavior 
influences  also  the  coefficients  Ca„  of  u0„  (in  our  example  is  = 
£i,  ua„  =  Corsinor^*>).  Following  an  idea  of  V.G.Maz’ya  (3)  we 
get  a  stable  asymptotics,  organizing  the  sums  of  (2)  as  follows 
(here  for  our  example):  In  a  neighbourhood  of  a  critical  angle 
we  write 
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r*(7/(0,0)  ^  sioL^)  si° ^ * r^c*» *m  a>fj3 

^  ,A  .. .  ,  .  |V sink; -r-*  sin o^u/l 

=  Gi(0,0)(l  —  cosZmd)  | - — - j 


+r- 


,  <7/(0, 0)(1  —  coslup) 

sin/^j 


(7) 


The  new  coefficients  c*  =  <7/(0, 0}(1  —  cos/o^)  and  d0w  =  Cq,  + 
G^oafti-^otL*))  ajc  bounded,  if  wq  is  from  a  neighbourhood  of 
and  theSrst  term  of  (7)  converges  for  u*  ^  to  the  term  of  the 
second  row  of  (6). 

A  general  stabilization  procedure  is  given  in  [3]  and  (4). 


III.  THE  THREE  DIMENSIONAL  CASE 
We  consider  the  problem 

Au  =  0  in  ft 
BjV  =  g,  on  dft|0, 

where  the  domair  ft  coincides  in  a1  neighbourhood  of  0  with  a 
circle  cone  K  ='  =  (r,<j)  :0<r<oo,  0<y><  2r, 


If  there  is  for  a  given  d0  an  index  ko  with  cosd0)  =  0 

(see  Figure),  then  /  is  an  eigenvalue  and  terms  with  In  r  as  in  (3) 
occur. 

The  asymptotic  expansion  is  unstable  in  this  case  and  we  can 
apply  a  stabilization  procedure  analogously  to  the  plane  case. 
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0,<  d  <  do}-  Analogously  to  the  plane  case  we  define  «4(A)  and 
consider  a  decomposition  of  g,: 


'GjA 0,  y>,  d0)  +  9)  (r,  y>,  d0), 


where 


and 


for  p  >  3 
for  §  <  p  <  3 
for  1  <  p  <  | 


G,A 

It  is  meaningful  to  assume  that  ff,(r,y>,do)  ^iZ/Ky^x.do)  and 
to  consider  a  Fourier  expansion  of  Gjjfi 


G^(0,g>}  d0)  =  A[(l, do)  cos  hip  +  B’K(l, d0) sin  hip 

MO 

The  functions  vt  of  (2),  which  satisfy  the  equations 


L(/,w,  Dw)vi  =  0  in  K 
M}(I,u,Dw)vi  =  Gjj-mjCO,?,  d0) 


can  be  calculated,  writing  vt  as  Fourier  series  with  respect  to  ip. 
Thus  we  get  for  the  Neumann  problem  for  the  Laplace  equation 
(the  index  j  is  cancelled) 


v/  = 


/-/!*(/-  l,do) 

&Up.'Wo) 


(cos  0)  COS  hip 


if  £jPt~ '(cos do)  ^  0  for  all  h.  Pf  k(co$d)  are  Legendre  functions 
of  the  first  kind. 
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Abstract.  The  computation  cf  the  singular  behavior  of  an  elastic  field  near 
a  three  -  dimensional  vertex  subject  to  displacement  boundary  conditions 
leads  to  a'one  -  dimensional  integral  equation  on  a  piecewise  circular  curve 
y  in  R1  depending  hoIomorphicaUy  on  a  complex  parameter.  The  cone- 
spending  spectral  points  and  packets  cf  generalized  eigenfunctions  charac¬ 
terize  the  desired  singular  behavior*  of '  the  stress  field  We  derive  a 
decomposition  in  regular  part,  edge  and  vertex  singularities. 

The  spectral  problem  is  solved  by  a  spline  -  Gakrldn  method,- We  present 
numerical  results  for  varic  is  geometries,  characterizing  the  leading  singular 
term  of  the  desired  stress  field. 


0.  Introduction 


Elastic  fields  governed  by  the  Lame  equations  will  have  singular  bchavicr 
at  edges  and  vertices  of  the  boundary.  These  singularities  pollu’e  the  accu¬ 
racy  and  convergence  of  finite  difference,  finite  element  and  boundary  ele¬ 
ment  schemes.  For  their  improvement  by  grading  meshes  or  augmenting  the 
dements  with  special  singular  functions  one  needs  explicit  knowledge  of  the 
form  of  the  singularities  near  the  edges  and  vertices  (see  [4j  and  (2J).  This 
knowledge  is  also  very  important  for  predicting  crack  propagation. 

We  consider  the  first  fundamental  problem  of  elasticity.  Boundary  integral 
formulation  and  a  regular  transformation  of  the  vicinity  of  a  vertex  into  a 
cone  yields  a  fractional  convolution  as  the  local  form  of  the  equation.  Thus, 
Mellin  transformation  leads  to  a  holomorphically  parameter  dependent  one 
-  dimensional  integral  equation  over  a  non  smooth  curve  with  the  Melhn 
transformed  stress  as  unknown.  The  singular  expansion  terms  are  deter¬ 
mined  by  the  nonlinear  eigenvalues  and  their  corresponding  generalized 
eigenvectors  of  the  parameter  dependent  integral  equation.  For  the  numer¬ 
ical  treatment  of  the  spectral  problem  we  use  a  spline  -  Galerkin  boundary 
integral  equation  method.  We  show  numerical  results  for  circular  cones  and 
the  well  known  Fichcra  vertex. 


1.  The  singular  expansion 


Let  flcR*  be  a  bounded  polyhedral  domain  with  boundary  P.  P  shall  be 
given  by  the  union  of  piecewise  plane  boundary  pieces  P/t  P  ■■  U  0*  We 

further  assume  that  P  may  be  described  in  the  neighborhood  of  each  vertex 
by  a  corresponding  conical  surface  y  •  Rf,  y  cR\  where  y  is  a  piecewise  cir¬ 
cular  one  -  dimensional  curve  on  the  unit  sphere. 

The  fist  fundamental  problem  of  elasticity  is  given  by 

HAu  +  (2  +  ji)graddivu  *■  0 


A  denotes  the  Laplacian  and  2> fi,  /i>0  are  the  Lame  constants. 
We  assume  the  displacement  w  has  finite  energy.  ^  is  a  given  displacement 
on  the  boundary  with  at  Jeast  4>  t  //{(O ,  For  exteriorprobleras  we  assume 
suitable  radiation  conditions.  The  solution  of  (1.1)  is  determined  by  the 
given  displacement  and  the  unknown  stress  on  the  boundary  T  and  can  be 
represented  by  Betti's  representation  formula  J6]  The  So.nigJiana  identity 

-|-(.-i)Z(*)  +  i 0  2> 


yields  a  boundary  integral  equation  for  the  unknown  stress  7.  In  short  we 
write 


Tbc  fundamental  solution  of  the  Lame  equat.on  is  given  by 

_i±2L  c*  ~  y)(x 


By;)’ 


and 


■*  +  In  j  / 

+  2#l)  |  Jx  _  y 


X+Zn 


Wa)  »  (7',ko>a)])7' 

denotes  the  corresponding  boundary  traction  r(x)  is  a  ngid  body  motion. 
The  factor  e  =  I  for  smooth  boundary  points  must  be  modified  at  edges  and 
vertices  J5J  Wc  remark  that  the  behavior  of  the  solution  u  of  (1 1)  is  con¬ 
nected  via  Betti's  representation  formula  with  the  behavior  of  the  stress  I  in 
(12). 


Let  now  x  be  a  O  function  with  sufficiently  small  support  V  and  y  ■  1 
near  the  vertex  x,  If  we  set  »  =  yt,  which  is  zero  outside  ft  ,  m  spite  of 
our  assumption  on  P  equation  (1.2)  can  be  wntten  as 

f  ffti. t i> «<»!)<«, -f-  - *«)•-/<' o - r«).  (i.3) 

R  7 

with  the  remainder  term  s(r{)  =  ^(1  -  y)7  Thus,  Mellm  transformation 
defined  by 


l(a) :  -  f  rh~'  dr, 

V 

applied  to  the  fractional  convolution  (1 J)  yields  a  holomoiphic  parameter 
dependent  operator  family 


X<(«)t(.,a-i)  -A(.,a) 

f  (R>f.  <»«)“('!.  »  *<*.«)« 

7 

The  asymptotic  expansion  of  t  is  completely  determined  by  (1,4)  |7J. 
L  K /  m<* 


(1.4) 


v«> + z  z 

W  T  x(r)  + 


(1.5) 

dp*  v/>/i(£) 

“7 


where  r  is  the  distance  to  the  vertex  jc*.  L  is  the  number  of  singular  functions 
due  to  the  smooth  remainder.  at  is  an  eigenvalue  of  4  («).  The  dimension 
of  the  geometric  eigenspace  of  <J(a,)  is  denoted  by  K  ^  The  maximal  length 
of  the  chains  of  generalized  eigenfunctions  (7m,  ... ,  ca*^)  to  an  eigenvalue 
9/  and  an  eigenfunction  7m  is  mat.  The  number  c^is  a  complex  constant  and 
-rjt  is  a  smooth  remainder. 

The  expression  in  brackets  belong  to  the  edge  singulanties  which  axe  given 
by  corresponding  special  two  dimensional  problems  (see  [8)j.  These  edge 
singularities  can  analytically  evaluated  and  arc  not  ol  our  interest  here.  The 
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main  problem  is  to  obtain  the  vertex  singular  exponent  ah  which  can  only  Figure  2.  shows  the  interaction  of  the  the  smallest  imaginary  eigenvalue  and 
be  done  numerically.  the  Lame  constants  for  the  well  known  fichera  vertex  |3J 


2.  .Numerical  approximation- of  the  eigenvalue 
problem 

To  approximate  J(a)e  »  0  we  use  graded  meshes  near  the  corners  of  y 
|1J,  TTic  grading  exponents  are  chosen  suitable  to  the  singularities  of  the 
generalized  eigenfunctions,  which  themselves  belong  to  the  edge  singularities. 
Our  Galerkin  procedure  for  the  approximation  of  the  non  linear  eigenvalue 
problem  reads  as  follows  |8J* 

Compute  Tahiti  a.",  such  that  the  linear  system 
2  NR 

Yj  9  Ki  (a)  *i*  A> o  -  0  ■  I-  U  -»  2MP 

/-i 

has  non  trivial  solutions  (A. Omr). 

where  m  and  n,  ore  piecewise  constant  B  -  splines  on  y. 

The  value  a*  is  the  approximated  eigenvalue  of  4  («)-  We  have  proofed  the 
following  asymptotic  convergence. 

with  constant  c  independent  of  N,  Nt<  k  denotes  the  Rtesz « Index  of  the 

eigenvalue  *».' 


Notice,  that  the  alpha  axes  in  both  figures  denotes  the  imaginary  part  of  a 


3.  Numerical  results 

First  we  consider  the  case  when  our  domain  O  is  a  circular  cone  with  angle 
20: 

Kan 0  co%<f\ 

sin0  sin<£  I  .  rtR,  OS0S©, 

«,,<?  J 

In  the  following  Figure  1.  we  show  the  dependence  of  the  eigenvalues  a " 
with  real  part  equal  zero  and  smallest  absolute  value  on  the  angle  0, 
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Abstract 

The  solution  of  an  elliptic  boundary  value  problem  as  the 
Lam4  system  contains  singularities  at*  reentrant  edges  and 
comers.  The  stress  Intensity  factor  .is  given  by  the  coefficient 
of  the  known  edge  singularity.  Tt  can  be  calculated  efficiently 
by  a  mixed  numerical  method  with  special  boundary 
elements  around  the  crack  front  containing  the -asymptotic 
cr«:k  solution  and  with  finite  elements  in  the  residual 
domain.  The  unknown  eigenvalue  of  .  the  comer  singularity  in 
case  of  a  surf**  crack  can  be  determined  numerically  from 
the  stress  field  around  the  crack  tip  computed  by  a  FE 
method 

jatesfasfea 

Sngulanbes  In  the  solution  of  elliptic  boundary  value 
problems  are  important  for  theoretical  considerations  in 
mathematics  as  well  as  for  the  calculations  in  engineering 
applications.  In  linear  elastic  fracture  mechanics  (LEFM)  crack 
problems  are  described  by  the  LamA  system  tor  the 
displacement  field  u  For  three  dimensional  domains  this 
ellphc  boundary  value  problem  contains  singularities  at 
reentrant  edge*  and  comers.  In  the  interior  of  a  structure 
cracks  are  forming  reentrant  edge*  with  an  opening  angle  2x, 
At  the  intersection  of  a  crack  with  a  free  surface  a  reentrant 
comer  exists.  At  the  Intersection  point  a  comer  angularity  is 
superpcaed  the  edge  singularity  of  the  intersecting  crack  front 

Edge  angularity. 

Fox  cracks  in  the  interior  of  a  structure  the  asymptotic 
solution'  is  well  known.  The  stress  field  near  the  crack  tip 
behaves  like  1/VT.  Important  parameters  to  calculate  for 
fracture  mechanics  are  the  stress  intensity  factors  given  by 
the  coefficients  of  the  1/VT  singularity.  The  numerical 
calculation  of  the  stress  Intensity  factors  in  this  case  can  be 
done  very  efficiently  by,  a  mixed  method  using  trial  functions 
fulfilling  the  equilibrium  and  Neumann  boundary  condition 
ill.  [2]  13]  realized  by  the  eigenfunctions  of  the  solution  for 
the  semi-infinite 'crack  [4). 

The  considered  domain  is  devided  into  a  region  around  the 
crack  front  Qj  with  elastic  fields  fulfilling  the  equilibrium  and 
the  Neumann  condition  exactly  and  a  residual  domain  0^ 
where  a  finite  element  formulation  is  used.  Applying  the 
principal  of  virtual  work  a  generalized  equilibrium  <x,  lition 
is  obtained. 

Jo  «*do  -  /fu*  dr  ♦  J"t  u*  ar  -  /tv  dr  a  o 


where  0  and  s  denote  the  stress  and  strain  tensor,  t  and  u 
the  traction  and  displacement  field  and  V  the  given  traction 
as  Neumann  condition.  The  test  functions  are  marked  by  a 
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star.  The  surface  TN  represents  that  part  of  the  boundary  with 
the  Neumann  condition.  In  addition  there1' results  a  generalized 
continuity  condition: 

/  (uo  -  V  *” dr  3  0 

r, 

where  denotes  the  displacement  field  fulfilling-  the 
equilibrium  and  Neumann  condition  and  Uj.  an  additional 
displacement  field-  on  the  boundary  of  the  crack  region, 
which  is  compatible  to  the  adjacent  finite  elements. 

For  the  discretization  of  the  crack  region  trial  functions 
u^=Lfi  and  tsK0  are  formulated  on  the  surface •  of  macro 
elements  with  free  parameters  p.  The  'matrices  L  and  R 
contain  functions  fulfilling  the  equilibrium  and  Neumann 
condition  realized  by  the  eigenfunctions  of  the  solution. for  the 
semi-infinite  crack.  Additionally  the  displacement  field  u^Nd 
is  formulated  with  the  interpolation  matrix  M  and 'the  nodal 
displacements  d.  As  result  a  stiffness  relation-  with  the 
following  stiffness  matrix  for  a  macro  dement  in  the  crack 
region  is  obtained: 

1^  ■=  Tt  IT1  I 
with 

T=/sTKdr  «nd  HsJVldr 

r.  r. 

which  is  symmetric  and  positiv  definite.  Therefore  these 
macro  elements  can  be  easily  combined  with  other  finite 
elements.  This  method  has  additionally  the  advantage  of  a 
boundary  element  method  because  for  the  macro  elements 
only  surface  integrals  has  to  be  calculated. 

As  a  test  example  an  elliptical  crack  under  tension  was 
calculated  with  a  ratio  of  the  main  axis  of  3.  Because  of 
symmetry  only  1/8  of  the  donvin  has  to  be  considered. .  The 
region  around  the  crack  front  was  discretized  by  6  pain  of 
hexahedral  macro  elements.  For  the  residual  domain  about 
300  tetrahedral  elements  with  quadratic  displacements  were 
used.  The  numerical  result  for  the  stress  intensity  factor  along 
the  crack  front  (Fig.  1)  agrees  well  with  the  analytical  value* 
for  an  infinite  body.  The  maximum  deviation  is  aout  6%. 
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Fig.  2  Discretization  hr  a  surface  crack 


The  infteaectfon  pcsnl  of  a  cade  front  wife  a  5*e  mkt 
form*  a  reentrant  corner  wife  fte  cade  freed  «  an  fnror-lrg 
reentrant  edge.  Ai  Ws  pcinf  «i  urJcxmm  corner  £ijirty  Sc 
sipespoaed  to  the  known  edge  xngtijriey.  fa  Car  ca«  Use 
domain  can  be  deecribed-  by  *  jdyhekn  in  It*.  The 
following  deonporito  in  a  pdyhedrai  cone  wife  fte 
spherical  cocsdinate*  her  been  proved  by  M.  Deage 

151  and  von  IYteodceff  fox 

« = «0  ♦  xwS  v**  l,iM 

Jd 

[LOO  £'  »  ■! 

xtoES1  -  to^*W0r*i)  [ 

4-*  <rO  J 

The  comer  singularities  are  given  by  the  eigenvalue*  \ 
while  the  sura  over  j  describe*  the  edge  angularities.  In 
special  ernes  there  appear  logarithmic  term*.  The  function*  x 
are  cut-ofl  functions  and  vQ  represenk  the  regular  part  Fcr 
the  surface  crack  this  leads  near  the  free  surface  to  the 
(allowing  expression: 

"  5  "o  ’  E  V*1  V-f'  *  V*11  *cfe})!(v>) 

Sc  Sc 

with  c{z) — *0  fa  z— *0.  where  the  z-axis  is  given  by  the 
crack  front  and  p  denote*  the  distance  to  it  The  displacement 
field  u  can  be  decomposed  Into  a  coiner  singularity  given 
by  Xk  and  an  edge  singularity  of  Vp-type.  where  the 
coefficient  can  be  interpreted  as  a  stress  intensity  facto.  This 
stress  intensity  factor  depends  on  \y  and  tends  to  zero  to 
z — *0  when  the  smallest  Xj>-~ .  Otherwise,  it  increases  to 
infinity  approaching  the  free  surface. 

The  unknown  eigenvalue  Xk  d  the  comer  singularity  can  be 
determined  numerically  frem  the  stress  field  around  the  crack 
tip  computed  by  FEM.  If  logarithmic  singularities  are  absent 
the  asymptotic  solution  to  the  stresses  has  the  following  form: 

°u  * ,x  hp 

Plotting  the  stresses  versus  the  distance  to  constant  polar 
angle*  6  and  p  in  a  double  logarithmic  scale  one  gets  a 
straight  line  with  the  unknown  eigenvalue  as  the  slope.  In 
Fig.  2  the  discretization  o!  1/S  of  a  structure  with  a  crack 
front  coming  perpendicular  to  a  free  surface  is  shown.  The 
discretization  into  2300  tetrahedrons  results  in  about  11000 
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yJricfcd  a  very  effesUvt  pcssslbilUy  of  tise 
utilization  cf  lees  optical  sy;. thesis  for 
practical  less  optimization  arid  cteslp*  15J  - 
IP2.  In  this  paper  v?  construct  the  axial 
peustial  distribution  as  a  fifth  order 
vSpiine  to  improve  etr  ray  tracing  model  1101 
criminally  based  on  J»  ccbic  spline 
technique. 


Abstract  -  A  computerized  ray  tracing  model 
for  systematic  investigation  of  axially 
symmetric  electrostatic  lenses,  frees  tie 
point  of  view  of  the  spherical  and  chrcaatic 
aberrations  is  proposed.  The  calculation  ef 
the  first  erdsr  optical  preperties  cf  the 
lenses  and  their  spherical  and  chrccatic 
aberrations  is  based  cn  the  direct  numerical 
solution  cf  the  trajectory  equation  fer 
par  axial,  nonpar  axial  and  chromatic  rays. 
The  axial  potential  distribution  cf  the 
focusing  field  Is  construct**:  as,  a  fifth 
order  spline.  The  tvo-interval  fifth  order 
spline  lenses  have  been  studied  in  some 
detail  and  the  experiences  cf  the 
application  cf  the  codel  are  presented  in 
the  paper. 

X.  INTRODUCTION 

Axially  symmetric  electrostatic  lenses  are 
very  important  in  the  instruments  cf  the 
different  electron  and  ion  bean  technics 
serving  the  purposes  cf  surface  science, 
analytical  chemistry,  electron  and  ion  bean 
lithography  cr  icn  implantation.'  for 
instance.  Consequently,  the  developments  and 
the  utilizations  cf  new  computational 
methods  and  computer  applications  for 
designing  price  electrostatic  lenses  with 
rotational  symmetry  are  essential.  Since  the 
spherical  and  c  hr  oca  tic  aberrations  are  the 
main  limitations  cf  the  optical  quality  of 
the  electrostatic  lenses,  we  select  the 
spherical  and  chromatic  aberration 
coefficients  In  infinite  magnification  mode, 
referred  to  the  object  space  and  related  to 
the  object  side  focal  length  as  figures  of 
merit  to  describe  the  lens  performance. 
Currently,  beside  the  advanced  computerized 
techniques  of  lens  analysis  Cl)  -  C3).  a 
very  effective  approach  to  design  axially 
symmetric  electrostatic  lenses  which  provide 
usable  optical  properties  in  first  order  and 
also  the  best  figures  of  merit  is 
electrostatic  field  optimization  and 
synthesis  cf  lenses  (41.  C5).  The  essence  of 
icn  optical  optimization  and  synthesis  is 
that  any  imaging  electrostatic  field,  its 
optical  properties.  aberrations  and  the 
applied  figures  of  merit,  together  with  the 
electrode  configurations  of  the  lens  making 
possible  the  imaging  field  are  totally 
determined  by  the  axial  potential 
distribution  of  the  field.  Consequently, 
instead  of  analyzing  a  large  number  of 
different  electrode  and  pole  piece 
configurations  varying  the  geometrical 
parameters  and  the  electrode  voltages  to 
find  lenses  with  higher  performance,  it  is 
advantageous  to  investigate  axial  potential 
functions  to  select  the  optimum  axial 
distribution.  Obviously.  In  the  approach  of 
synthesis  it  is  essential  how  the  axial 
potential  distribution  Is  defined  and  built 
up.  The  application  of  cubic  splines  has 


XX.  EAT  TRACING  HCCGL. 

In  case  cf  electrostatic  lenses.  the 
meridional  mot  lens  cf  charged  particles  can 
be  considered  to  calculate  the  first  erder 
cptlcal  properties  and  the  spherical  and 
chromatic  aberrations  without  violating  the 
generality  of  the  problem.  For  meridional 
motions,  the  following  traversal  differntial 
equation  flOJ  determines  the  actual  particle 
trajectories: 

1  ♦  CQ/n  Xu  —  «  )/cz 
r~  m  O  r* 

12  *  CQsa  >Cu  -  u  J/'e'jCu  -  n) 


where  Q  is  the  particle's  charge.  ~o  is  the 

particle's  rest  mass,  u  ■  uCr.j)  Is  the 
potential  of  the  focusing  field,  r  »  rim)  is 
the  particle  trajectory,  ti^  Is  the  potential 

where  the  velocity  cf  the  particle  -is  zero, 
and  c  Is  the  speed  of  light  in  vacuum.  The 
cylindrical  coordinate  system  Is  used  and 
the  primes  represent  differentiation  with 
respect  to  z  which  Is  the  Independent 
variable  along  the  optical  cr  symmetry  axis 
of  the  xens.  The  potential  can  be  expressed 
as  a  power  series  In  r  In  the  fora. of  ill) 

uCr.zJ  -  IKzi  -  CrViltrCzi 

.  Cr‘/« >U — CzJ  -  C 2> 

where  V  ■  (Xz)  Is  the  axial  potential 
distribution  of  the  field.  If  the  axial 
potential  U  built  up  as  a  cubic  spline  the 
third  term  in  Eq.  C2>  will  be  a  delta 
function.  This  fact  can  yield  some 
inaccuracy  for  the  third  order  spherical 
aberration  Cthe  error  of  the  calculation  can 
be  estimated  by  using  Integral  expressions). 
When  a  fifth  order  spline  model  Is  utilized 
the  forth  order  terms  are  continuous, 
therefore  Inaccuracies  can  occur  only  in 
higher  order  approximation.  The  differential 
equqtlons  for  paraxial,  nonparaxlal.  and 
chromatic  rays  can  be  given  by  only 
substituting  the  power  series  of  uCr.z)  in 
the  appropriate  order  into  Eq.  Cl). 

III.  RESULTS 

A  systematic  analysis  of  the  uni  potential 
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2s!  retarsiisg  splice  leMt: 

hive  beta  carried  cet  by  cslc?  the  =od*l 
cctliari  in  Sec.  IX-  In  case  of  oipstentiil 
lenses,  the  spbtrial  flcsre  of  serit  C^//^ 
as  the  fcactlen  cf  the  axial  pcter.tlil  at 
the  middle  pciat  ef  the  lens  is  shewn 

by  solid  erve  in  rig.  1.  is  related  to 

the  wlu*  cf  the  axial  potential  at  the 
cbjezt  side  lens  endpoint.  The  dashed  curve 
shews  the  spherical  figure  ^  merit 
calculated  by  utilising  the  ray  tracing 
scdel  based  en  cubic  splines  (123. 


Fig.  1.  C  //  as  the*  function  of  the  axial 

so  o 

potential  at  the  lens  nidpeint  t/^.  The 

solid  and  the  dashed  lines  are  for  the  fifth 
and  the  third  order  spline  sodels. 
respectively. 

In  a  special  case  of  retarding  lenses,  when 
an  approximately  nine  tines  change  in  the 
kinetic  energy  of  the  particles  is 
considered,  the  spherical  figure  of  serit  is 
shown  in  Fig.  2. 


Fig.  2.  C#0/'/c  as  the  function  of  the  axial 
potential  at  the  lens  midpoint.  U  .  The 

mid 

solid  and  the  dashed  lines  are  for  the  fifth 
and  the  third  order  spline  models, 
respectively. 


The  results  sees  to  indicate-  that  the 
epheric*!  figure  of  serit  has  a  similar 
trend  in  cases  of  the  fifth  order  and  the 
cubic  splines.  However,  the  corresponding 
particular  values  are  significantly 
Afferent.  In  ease  of  the  retarding  lenses 
the  sdrilsua  of  the  curve  of  the  spherical 
figure  of  serit  determined  by  using  the 
fifth  order  spline  sodel  is  such  sere 
expressive  than  vhich  is  calculated  by  using 
cubic  splines. 
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ABSTRACT 


cface==crri=l  triads  as  teaks  sjs^s.  Uestcfce  pc*£to5 

sy~=s  *be=  cayagSr  effects  tsgJcctsdL  1=  da  r=?= 
s-spfcr  ccasii=tx»r  rcbccsriyp*  arc  adepts  fcf  rcssaoce  aad 

Cdds  fcaricg  pastor.  csss  Ga*  rto  3=d  trrpemtos  os 
pzizccas.  Waft  toss  ctorrts.  c  is  pcssSfs  to  rrpresssi  astsal 
psc=^sjr^xji==r^=eicl»5===3-  Tbc  ccecrci  t^hc  zed  to 
p^e  srs  codtlfcd  2J  cqrirtoa  criScss .  Tbs  or  '£ct  garamstos  namely. 
CeBcctcctexcc  c  sajHcattdfttcccRMV  rttsaari 
byasm^feitotoraiecteaps.  The  tetorrspcssen  terms  cf  pressure 
nKa  &  take  ctomber  is  csSriL.  Acts!  field  cxpcacrass  ere  cco- 
&=ri  o  ease:  to  djuatos  k^ckk.  A  good  corixa  to  lea 
cteared  bz*tca  to  ojgriscssl  nria  23d  to  simserd  exxfeL 


1.0  INTRODUCTION 

Dyussac  response  «  vny  important  in  as  teoke  aa=rxo  ard  to  3  toeet 
tear—  c=  to  srppbg  £to=ce-  Utemme  safriy  cf  to  Triple  ifcpesis 
open  to  jrppcg  distoee.  Tiers  are  riso  cstoa  cstoqr 
10  be  compiled  »<h.  by  to  teaiirg  systems.  There  is  hardly  a=y  »■=* k 
reported  on  artosawive  air  brake  sjstons  response  studies.  These 
respects  ck2raetsrisucs  can  be  best  ebtaaed  through  cstoasxrri 
models. 


Of  to  maay  methods,  to  power  tend  gmph  it  chosen  because  of  A 
saperiorey  ta  rcprrseatmg  physical  acuca  totatog  to  dynarme  response 
by  means  cl  compact  sjsabcto  coulee  aod  case  cl  codification.  Fetor, 
to  brake  q«a  actsauso  is  ctS3D»Kd  by  ijess'c  twjy  flows. 
Hence  bosdjrapi  repre jersauca  it  vcryrcfcvasi  n  this  cosies.  Wh3c  it 
is  well  cKobfitod  to*”  to  c*l  hydraulic  systossjhc  css  cf  bond  graph  for 
to  pncosaoc  sysra  is  still  odar  dcvcfcpmeat  So  an  acemjt  is  nod: 
toe  10  dcvxop  a  bood  graph  fesnboa  clsi»  brake  system. 

Air  tcsJx  systems  css  compressed  air  10  apply  to  brakes.  Air  it  coo* 
pressed  by  a  compressor  driven  by  to  ergbe  and  stared  in  a  reservoir. 
When  to  driver  depresses  to  foot  pedal,  air  from  to  reservoir  flows  id 
to.  brake  chambers  at  a  pressure  (fetenumed  by  driver's  pedal  effort  and 
displaces  the  diaphragms  in  the  chambers.  Tbs  noitmcct  of  tbs 
dopbagms  is  transmitted  to  to  brake  shoes,  free;  tom  against  to 
dress  and  thus  braking  tbs  wheels. 

The  dynamic  response  cf  a  cocrplete  brake  system  consists  cf  a  quasi-static 
component  and  a  uansiect  component.  The  transfer:  behaviour  is  as* 
soebted  with  rapidly  changing  system  variables .  soch  as  brake  bns  pressure 
following  a  rapid  pedal  force  spot.  The  qsasi-suic  behaviour  is  as¬ 
sessed  with  slowly  changing  variables,  soth  as  to  change  b  cocffkien: 
in  friction  between  lining  and  rettr  <fcc  id  change  w  wtoel  speed  dcring 
to  (fcceleration  of  to  vehicle.  In  this  only  to  transrera  behaviour  is 
modeled.  The  aetoal  supping  distance  is  affected  by  tone  deb>s  required 
to  appl/lrakes  and  to  build  tip  to  brake  force  .Generally  the  attaa&n 
time,  ts  small  compared  10  to  build  up  time  and  hence  to  boi^up  lime 
is  alone  modelled. 

2.0  MODELLING  OF  THE  FLOW  PROCESS 
The  flow  through  to  brake  pipes  and  valves  can  be  modelled  either  as  a 
continuous  model  or  as  a  lumped  model.  Daring  to  IniuaJ  design  of 
b)outs.manycoo figurations  require  quick  evaluation .  So  a  lamped  model 
approach  is  favoured  as  continuous  models  are  highly  one  ceo«uming.  To 
arrive  at  a  suitable  lumped  system  .Pot  w  bond  approach  has  been  adoped. 


Heycgsn^cptlisagriitcnmsrit}  ^ufesttopsopsar^ 


KARNOPP [1J  hjjiowibtixactcprt^lbJfbvpjcraitB 
sccesszj  id  use  to  seipoeca  ceto^py  Ch  )  as  to  rffen  variri*  aai  to 
csss  fia-#  raj;  (c)  as  to  Os*  vrsifc  to  prrperfy  acax=a  for  r=d  do*. 
CH£NGIHpo£fi^  to  above  cqxtoaaad  developed  a  sacfcribgcti 

ckrgats  fo;  pucrcccbe:  sjSZ=3.  The  Ctchripy  (h)  h  oprrsced  tn  Krz=s  cf 
pttssge  (p)aad  to  temperance  (I)  aaj  toy  art  used  as  tficn  rsiahfcs. 
This  ^jrexh  wrii  suobfc  cc&kziecs  has  fcena  adcpced  fix  to  troir 
qnotohtica  AsaspSSedbyoacfaarirtrakesyecotsshowaa 
Bg.  V>hga  to  Tohe  a  eperred.  rr  fowa  from  to  rtsmxxi  lo  acgaScr. 
The  vetesg  cf  to  szsssxc  is  early  ccessnc  Resscnce  is  offered  by  to 
run.  asd  also  by  to  pipe.  Wfceo  to  ito  is  frity  opened,  to  rsisan 
effered  by  to  vahe  sod  to  p^e  cm  be  ccesatocd  esuex 
The  steps  iaveJttd  are  : 

-To  find  sutarie  cctod  to  specify  vrire  as  as  eq=vafca  ottos 
-To  find  eqnxvrien*  crifke  for  pipes 

-To  &vefcp  pewtx  fte*  model  cf  to  sjsum  aai  seaericrily  iaejrsu  to 
cqearioes  derhod  from  to  model 

2.1  TIIESVSTEM  BOND  GRAPH 
The  lumped  model  rcpccseseatma  cf  to  simpETied  rir  treks  system  Is 
shown  in  Fig.  2.  The  brake  appScatma  wKs  is  represemed  as  aa  orifice 
whose  ccrduetasee  V  and  critical  pressure  raw  V  is  esrimaaed  fcocgli 
a  transect  technique  [3J-  The  pipes  am  also  represemed  by  equhriea 
orifices  whose  c  aad  b  values  art  cfctatod  nsacg  an  empirical  rebucosh;p 
ICj.  Tbs  pepc  vofcocs  arc  lamped  with  acfcnior  volcmc  as  capacitance. 
The  sysan  equaooas  derived  from  bond  graph  arc  listed  below  with  to 
paramctfr  Vabcs. 


rvocaooci  . 

cl-cooduetasce  cf  brake  appficabco  valve  In  do  fsbsx 

Crcooducuncc  of  ppcs  corxcctcg  to  (reel  brake  chamber  in  to  /s-bor 

CyCoodDaaacc  of  pipes  ccnaccung  to  rear  broke  chamber  in  dm  /sTur 

p#-rcjervoir  pressure  in  bar 

pi-pressure  at  pipe  junction 

pf  pressure  in  front  brake  chamber  in  bar 

pj-prassurc  in  rear  brake  chamber  in  bar 

uvmass  in  kg 

di«-RUSS  flow  rate  from  reservoir  to  pf*  jsnerion  in  kg/s 
ihrmass  flow  raic  from  juxticn  to  (ton  brake  chamber  in  kg/s 
nyreass  flow  rate  from  junction  to  rear  bake  chamber  in  kg fs 
ai-nct  mass  flow  rate  fiom  rcservwr  to  pipe  junction  in  kg/s 
THcmperatcre  Ln  k 
Cfrcapacfcance 

R  -gas  constant  »2S7  Nm/kg  i 
Vrvolsme  in  tn* 

l  -time  in  seconds  -  . 

p,  -  Density  of  airkg/.m5;  20C*  !23kg/n 
•n  -polyiropc  constant 
To-Tcrnperaicre  20  C 


[quilions  .  ,-w. — 

s.c.p.r^r 


(illtlm  bt ) 
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Zgzc tpA  4^iw  is  c3)  <ss*a 

s,  «  =*-n,  *fir  t-5) 

Pi-PiCc j  ./-jij-  ~  •  -£?4j  csj 


Ti.rfCo)(3^r)<-**  C5) 

lcc=l  votes 
rtesrc  *a  jscjs  *=irs 
?^C)-7Jfc=r 
p:(0)  *  0  fc=r 
p^C)  -  0  ter 
pXO  -  0  ter 

7  (0)  =»  300  K 
PARAMETER  VALUES 
ci-1425  (*i2e  cb~^ 

*  3.19(»i2cdisSarjs:;) 
b]z0 
c,«5.SS57 
fc»x02739 
c^3.010 
bj4X3217 
ol.4  0 
V..0L725-10  a* 

Vy*  1159-10  n> 

Vi- a<23-lff,sa> 

These  cqssesss  srs  esserkdly  sccpdcd  to  cicb  Sc  sysca  response. 


Tic  S-ccrsccl  e=pe=sc  set  cccpsgd  win  Sc  cpssgsi  e=£s 
chsdfco3i<ij±fc)ca.p^.3»i4) 

22  RESULTS  AXD  DISCISSION 

t^vciccscsperc^mLlsctrsssj.  ilcecsfcs»rs±)J5cf 
Sc  sjspa:  pas:  g  03  srssfs  sd  fie  arrjciS:;  gated 
tots  b  =i»  0L3.srcaA.  Tic  cyrlr'M  s  besr  *3  £e  fca  trie 
cLir-Ser.  To  \=^zrz  Sc  fossae  =©£d.  Sc  Go»  »  Sc  s=f  pcrcca 
%LiS  a  js«==ed  by ;Sr  pescsc  cc  Sc  jsrdca  =qr  be  cxrScSci  «S 
scec  f=cexe  a  srrrcrt  fir  Sc  scree  tcdk  Gc»  Ss  eco/  C&s 

fbs  £ac  Se  facjcas. 

3i)  CONCLUSIONS 

Tie  pcssSsE^cftsajpcaer  bead  pcp&ocieba  Sc  leaped  ssaSdcf 
ccrragSd^cia^siftg^sya^gpc^arSgcfcgcgcqaaaaaaa 
Sc  -acy  o  ysfy  Sc  ;=ra»  cc  Tic  6sad  kSs 

css y=c  «c3  »sS  opryragcrd  fei  TLa  »cd:  csdd  faa  a  fas*  fcg 
ccs=S=S  Sc  codd  so  sdsdc  Sc  oa==$  *c!*e  focaaes  sad  sl»  so 
Scfc^X3sytoscasa^y«sQjgas^^ss2^cfbciaqrC£sg«aS 
ancvcftsp^ys^Serdjsscda.  t*S  ~z  crzxzl  a  Se  K»ss3xk 
fcrcie  sysacs- 
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A  CONTRIBUTION  TO  THE  NUMERICAL 
STUDY  OF  THE  VAPOR  FLOW 
CHARACTERISTICS  OF  SLENDER 
CYLINDRICAL  HEAT  PIPES 
by 

dzsem  S.  AtiiB; 
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Abstract:  Use  Xarser-  Stocks  cqratios  and  the  continu¬ 
ity  equations  which  are  applicable  to  Use  laminar  vapor  fl otr  a 
slender  cyEsdnczl  heat  pipes  are  solved  numerically.  An  impHdt 
finite  difference  method  is  used  tc  sohe  Use  ep’-tKxn.'lhc  pres¬ 
sure  gradient  terns  is  eliminated,  which  reduces  the  number  of 
nsknowns  by  one.  The  effect  of  pressure  drops  is  then  studied 
for  different  Reynolds  numbers. 


du  .  Idjtr) 
dz  ‘  r  dr 

Subject  to  tie  boundary  conditions, 
ced  wall  conditions 


i)  u(0,r)  =  «(l,r)  =  0, 


(2-2) 


1  INTRODUCTION 


Through  a  porous  wall,  the  vapor  flow  dynamics  in  the  evapo¬ 
rator  and  condenser  of  a  beat  pipe  is  very  much  similar  to  pipe 
flow  with  injection  and  suction.  The  principle  of  thermal  fluid 
governs  the  performance  of  the  heat  pipe.  The  reason  bring  that 
the  heat  pipe  is  a  heat  transfer  device.  Good  attention  was  given 
to  the  fluid  mechanical  aspects  of  heat  pipes.  See  for  example 
the  review  given  by  Tien[2]. 

To  obtain  steady  state  operation  of  a  heat  pipe,  the  sum  of 
the  axial  pressure  difference  due  to  body  forces,  the  pressure 
along  the  flow  lice  due  to  surface' tension  and  both' the  liquid 
flow  and  the  vapor  flow  gradients  must  all  add  up  to  zero.  It 
should  be  noted  that  rdativly  speaking,  the  fluid  dynamics  of 
sapor  flow  is  not  so  easy  even  if  the  geometry  and  boundary 
specification  of  the  problem  are  simple. 

Based  on  the  radial  flow  of  the  vapor  at  the  wall  of  a  heat 
pipe,  a  radial  Reynolds  number  Re  is  introduced.  It  is  a  well 
known  fact  that  if  the  absolute  value  of  fie  is  much  less  than 
one,  viscous  effects  dominates,  while  if  lie  is  much  bigger  than 
one,  momentum  forces  dominate. 

Theoretical  investigations  have  been  done  to  steady  incom¬ 
pressible  laminar  flow.  \Vhiter{4)  have  found  similarity  solutions 
but  Masuzawa,  Tanahashi  and  Ando(5],  Gupta[6]  and  Hombcck, 
Rouleau  and  0slerle[7]  found  entry  region  solutions.  To  sum¬ 
marise  the  results  for  different  Re.  There  will  be  dual  solutions 
at  the  walls  for  Re  less  than  2  3  and  for  values  of  Re  between 
9.1  and  20  6  .  Multiple  solutions  are  obtained  when  Re  is  20.6 
and  one  solution  occurs  when  Re  is  2.3  .  The  flow  approaches 
poiseuille  form  when  Re  approaches  zero. 

2  BASIC  EQUATIONS  AND 
ASSUMPTIONS 


We  will  assume  steady,  incompressible,  axisymmctric  and  lami¬ 
nar  flow.  The  pressure  distribution  depends  cn  axial  coordinates. 
Also,  we  will  assume  constant  radial  velocity  through  the  pipe 
wall  and  the  radial  velocity  component  compared  to  the  average 
axial  velocity  is  small. 

The  governing  equations  arc. 

Axial  momentum  equation 


du  du  -1  dp  fd7u  1  du . 

,,s+'8;=7s+'l9?+^)' 


(2.1) 


Continuity  equation 


axial  conditions 


ii)  r(0,r)  =  r(l,r)  =  0, 


i:z)  r(r,0)  =  0, 

■»)  ^(i>°)=°. 


no  slip  condition 


and 


where 


»)  «(*.ro)  =  0, 

p)  =  c  (constant) 

{.c>0  Injection. 

c.=  0  Adiabatic  zone, 

c  <0  Suction. 


(2-3) 


This  approximation  is  made  for  positions  of  x  which  are  more 
than  one  radius  from  the  end.  This  b  since  the  pressure  gradient 
term  may  not  be  negligible.  See.  Busse[l).  Equations  (2.1)  and 
(2.2)  are  two  equations  in  three  unknowns,  namely  u,o  and  p. 
To  solve  numerically,  one  must  eliminate  the  pressure  term.  To 
do  so,  multiply  equation  (2.1)  by  r  and  integrate  with  respect  to 
r  from r  =  Otor  =  r0to obtain 

1  dp  2  3  p  ,  ,  2 vdu  lf.  . 

pdx  r%dzJo  To  dr 

which  when  substituted  in  (2.1)  leads  to 

du  du  2  d  (*•  7,  2*>dti  .(Pu  l  du  .  . 

u— +V-J-  =  -r—  /  ru7dr - —  p  +t(— +-— ).  (2.5) 

dz  dr  rldzJo  rodr  ' dr 1  rdr 

The  details  of  the  above  procedure  will  be  presented  else¬ 
where. 

Using  the  dimensionless  parameters 

u  _  vr0  _  r  ,  _  xu  .  . 

u  =  — ,v= — ,r  =  —  and  x  =  — ,  (2  6) 

u4  v  ro  r$u4 

equations  (2.2), (23)  and  (2.5)  can  be  written  in  the  dimension 
less  form 

subject  to  the  boundary  conditions, 

i)  fi(0,r)  =  C(0,f)  =  0, 
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Abstract  -  In  this  paper  an  optimization 
procedure,  based  on  the  simulated  annealing 
strategy,  is  proposed  in  order  to  assure  a 
suitable  solution  to  the  problen  of  built-in 
systems  with  assigned  invariant  parameters. 
The  proposed  approach  is  very  attractive  due 
to  its  capability  to  avoid  local  minima. 

I  -  INTRODUCTION' 

The  possibility  of  assigning,  in  open  loop 
scheme,  for  SISO  systems,  both  the  eigenvalues 
and  other  invariant  quantities,  characterizing 
some  structural  properties,  such  as  the 
stability,  controllability  and  observability, 
has  been  previously  studied  by  the  authors 
{1],  The  sane  problem  has  been  also 
investigated  for  HIKO  systems  (2);  in  this 
second  case  sore  parameters  can  be  freely 
chosen  making  the  problem  less  hard.  In  any 
case  the  existence  of  the  problem  solution 
cannot  be  proved. 

It  was  proved  [1]  that,  for  SISO  systems,  if 
an  asymptotically  stable  system,  having 
distinct  real  eigenvalues  A2, ...,  Aft  and 
singular  values  <rf,  <r2, ...,  <rn  is  searched, 
than  it  is  necessary  to  solve  a  nonlinear 
algebraic  equation  system  respect  to  the 
system  residues.  In  this  paper  a  numerical 
approach,  founded  on  a  nonlinear  optimization 
algorithm,  i.  e.  the  simulated  annealing 
algorithm,  is  proposed  in  order  to  find  a 
suitable  solution  of  the  introduced  nonlinear 
algebraic  equation  system  and  therefore  to 
"solve"  the  problen  of  built-in  systems  with 
assigned  invariants  quantities.  The  goodness 
of  the  proposed  approach  is  tested  by 
considering  some  examples. 

II  -  PRELIMINARIES 


characterize  the  system  signature  matrix, 
described  as  it  follows: 

S  -  diag  (s1#s2, ..,si,..,sJ1) ,  (2) 

where  s.»?l,  since  s^  is  the  generical  sign  of 
the  eigenvalue. 

Proposition  f 11 : 

Given  an  asymptotically  stable  system  with 
distinct  and  teal  eigenvalues,  the  following 
relations  hold: 

det(sk<rk-I  -  R-Q)  «  0  (k  »l,..,n)  (3) 

where  R  “  diag(r1,r2,..,r(j)  is  a  diagonal 
matrix  containing  the  unknown  system  residues 
r  ,  1  is  the  identity  matrix  while  each  term 

of  the  Q  matrix  is  given  by  the  following 
expression: 

q(J  -  -<V(VV>  <4) 

being  the  generic  eigenvalue  of  the  system. 

The  algebraic  equation  systen  (3)  relates  the 
system  residues  with  its  structural 
parameters;  then,  if  at  least  a  set  values  Rf 
can  be  found,  the  proposed  problem  can  be 
solved  and  it  is  possible  to  assign  both 
system  structural  parameters,  e.g.  eigenvalues 
and  singular  values. 

An  analogous  formalization  of  the  problen  can 
be  introduced  in  the  case  of  placement  of 
eigenvalues  and  characteristic  values  by  using 
the  Cross-Riccati  equation  studied  in  (4).  The 
equations  (3)  lead  to  a  nonlinear  algebraic 
equation  systen  of  the  form: 


In  this  section  some 
preliminaries  are  presented. 


Theorem  f 31 ; 


mathematical 


F«v-,S.  ■?, 


<yl,y2,..,n) 


Let  us  consider  an  asymptotically  stable  SISO 

system  S  (A  c  Rwn,  B,  CTe  if*1)  in  minimal 
form,  for  such  a  system  it  is  possible  to 
introduce  the  Cross-Granian  equation  as  it 
follows: 

Wco-A  +  A*W«  »  — B'C  (1) 

The  eigenvalues  (i-l,..,n)  of  the  matrix 

We*  are  real  invariants  of  the  system,  whose 
nodula  are  equal  to  the  singular  values  <rl 

(i-l,..,n).  Moreover  the  sign  of  the  W«« 
eigenvalues  are  other  invariants'  that 


where  s.,s.  ..,s  lt  represent  all  the 

combinations  without  repetition  of  n-j+1 
integers.  The  solution  of  this  systen  depends 
on  the  matrix  signature. 

Ill  -  OPTIMIZATION  STRATEGY 

Referring  to  equation  (3),  our  goal  can  be 
stated  as  it  follows: 

given  st,  A,  (i-l,..,n),  let  us  determine 
the  set  of  system  residues  R,»(rif  r2,..,rn). 
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belonging  to  an  assigned  domain,  such  that  the 
R-Q  matrix  has  eigenvalues  e%  as  close  as 

possible  to  the  desired  invariants  p  *s -0  • 
Therefore  the  cost-function,  that  must  be 
minimized,  has  been  stated  as  it  follows: 

n  2 

E(R,}  -  Y.  (  a*  -  *'<?*  )  <5) 

k*i 


where: 

0k  is  the  k-th  eigenvalue  of  the  R-Q 
matrix  corresponding  to-  each  considered 


R-(rt,r2,..,ra)  set; 

sk  and  <rk  are  respectively  the  k-th 
imposed  signature  and  singular  value. 


The  cost-function  absolute  minimum  is  known  to 
be  ’zero  only  in  the  case  that  the  original 
problem  adaits  a  solution;  when  it  does  not 
exist  we  hope  to  find  the  systea  that 
guarantees  the  global  minimum  of  the 
cost-function  (5).  To  minimize  EfRj)  the 

S inula ted  Annealing  optiaization  algoritha  (5] 
has  been  adopted.  The  use  of  the  "Annealing 
Algoritha"  is  due  to  avoid  local  niniaa  that 
represent  a  heavy  drawback  in  our  search;  in 
fact,  differently  froa  the  classical 
optiaization  algor ithas,  the  proposed  one 
accepts  not  only  those  solutions  that  produce 
negative  changes  AE  in  the  cost-function,  but 
are  taken  into  account  also  the  ones  giving 
positive  changes  that  are  accepted  with  a 
probability  P  given  by: 


P  »  exp(-AE/T) , 


where  T  is  a  control  parameter  called 
"effective  temperature" .  Froa  this  latter 
consideration  it  follows  the  algoritha  ability 
to  go,  trough  the  local  ainiaa  looking  for  a 
better  result. 

Let  observe  that  the  stated  cost-function 
gives  the  square  of  the  euclidean  distance 
between  the  desired  set  of  singular  values  and 
the  found  one.  For  each  choice  o(  the  systea 
eigenvalues,  signature  and  singular  value 
sets,  no  a  priori  condition  ensures  us  about 
the  solution  existence,  however  the  aininua  of 
the  cost-function  found  by  the  optiaization 
algoritha  gives  us  the  systea  that  best 
approaches  the  desired  one. 


IV  -  EXAMPLES 


IV. 1  -  Exanplo  1 


Let  us  try  to  synthesize  a  system  with  the 
following  poles,  singular  values  and 
signatures: 

X—  [  1000  550  120  100  50  10  ) 

O-(308.5  69.9  8.6  0.2  0.17  0.02) 

3-(  -11-11-11) 

By  applying  the  simulated  annealing  in  order 
to  minimize  the  cost-function  (5) ,  the 

following  residues  R*  have  been  found: 


R*  =  [6501.6  -17875.4  -5228.7 
4611.6  -36652.3  2624.2) 

Moreover  the  equation  (5)  assumes  the  final 

value  £>i»al0’*.  Such  a  value  implies  that  the 
stated. problem  allows  a  solution. 


IV. 2  -  Example  2 


In  this  second  example  we  search  for  a  systea 
with  the  following  eigenvalues,  singular 
values  and  signatures: 


A  -  -  [  9  7  5  3  2  1) 
am  [  90  80  70  60  50  40  ) 

S  -  (-11-11-11). 

Due  to  the  fact  that  the  cost-function  value 
found  in  this-  case  is  E»in»70S9.9,  it  is 
possible  to  assert  that  a  solution,  to  the 
stated  problem,  does  not  exist.  However  the 
simulated  annealing  furnishes  the  system 
closest  to  the  searched  one. 

The  optimization  algorithm  gives  the  system 
characterized  by  the  assigned  eigenvalues  and 
with  the  following  residues: 

R#  -  (  -166021.0  299194.4  -178760.344 

50005.7  -11435.3  426.9  ). 


VI  -  CONCLUSIONS 


In  this  paper  it  has  been  proposed  a  numerical 
solution  of  the  problen  of  assigning  system 
structural  properties  as  eigenvalues  and 
singular  values.  The  employed  optimization 
algorithm  is  based  on  the  simulated  annealing 
strategy. 

The  examples  reported  show  the  effectiveness 
of  the  proposed  approach. 

The  procedure  could  be  introduced  for 
assigning  other  sets  of  invarants  such  as,  for 
example,  the  eigenvalues  and  the 
characteristic  values. 
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ESTIMATION  OF  PIECEWISE  CONSTANT  ROTATIONAL  MOTION  IN  THE  PLANE 
by  Coeit  Olmsted 

Alaska  SAB.  Facility,  Geophysical  Institute,  University  of  Alaska  Fairbanks 


Abstract.  Sea  jee  motion  applications  raise  the 
problem  of  fitting  a  piecewise  constant  rotational  model 
to  a  set  of  displacement  vectors.  The  problem  is  twofold. 
Estimating  a  rotation  from  data  and  partitioning  the 
data  into  rotationally  constant  classes. 

To  solve  this  image  classification  problem  we  use  a 
reciprocal  polar  transformation  and  a  linear  clustering 
algorithm. 

Introduction.  Remote  sensing  of  sea  ice  describes  its  mo¬ 
rion  by  a  two  dimensional  displacement  field  {Kwok  et  al.  1900). 
Inspection  (see  Figure  1 J  supports  the  assumption  of  discrete  el¬ 
ements,  each  c£  which  rotates  rigidly  in  the  plane  about  a  fixed 
center  with  a  constant  angular  \elodty  (Olmsted  1991]-  The 
model  is  rotationally  piecewise  constant  with  curvilinear  discon¬ 
tinuities.  The  motion  of  each  element  can  be  specified  by  three 
parameters,  the  coordinates  of  the  center  mid  the  angular  speed. 


Figure  1.  Displacement  field  for  iec  motion  in  the 
Beaufort  Sea  between  SEASAT  revolutions  1439  and 
1482.  Arrows  on  some  vectors  show  the  direction.  The 
remaining  displacements  follow  similar  streamlines.  The 
tails  arc  on  a  regular  5  km  grid. 

The  Problem.  Wc  need  to  estimate  the  parameters  from 
a  set  of  rotational  displacement  data  with  errors.  This  problem 
has  been  well  studied  (Watson  1988].  In  our  case  it  is  compli* 
catcd  by  the  existence  of  several  rotations  the  number  of  which  is 
also  an  unknown.  Wc  must  first  partition  the  data  into  a  number 
of  elements,  then  estimate  the  constant  rotation  of  each.  The 
latter  problem  is  relatively  easy  (See  Appendix  I).  The  former 
is  a  more  difficult  matter  of  image  classification.  A  measure  of 
the  goodness  of  the  classification  is  given  by  the  total  variation 
from  constancy  of  all  classes. 

Brute  force  optimization  of  the  total  variation  over  all  par¬ 
titions  is  clearly  too  large  a  search.  It  is  trivial  that  the  optimal 
partition  will  be  the  discrete  one. in  which  each  data  point  is 
a  separate  class.  Then  the  total  variation  is  zero  but  no  infor¬ 
mation  has  been  gamed.  Restricting  the  number  m  of  classes 
still  leaves  non-contiguous  elements.  Thus  only  geometrical  di¬ 
sections  are  acceptable.  This  suggests  using  cluster  analysis  in 
some  space  including  the  plane  of  the  motion. 

Analysis.  For  instance  in  the  case  m  =  2  uc  can  consider 
partitions  resulting  from  bisections  of  the  plane  by  straight  lines 
Parameterize  a  line  with  two  real  numbers  (s,<),  say  slope  and 
intercept.  Then  for  each  such  pair  the  data  will  be  divided  io*o 


two  sets.  The  sums  of  squares  of  deviations  from  estimated 
best  constant  rotation  may  be  calculated  for  each  set  and  then 
optimized  with  respect  to  s  and  £  (Thorndike  1989].  This  is  a 
discrete  objective  function  which  jumps  when  the  bisecting  line 
crosses  the  location  of  a  data  point.  Each  additional,  bisecting 
line  adds  2  dimensions  to  the  search  space,  greatly  increasing 
the  complexity.  This  model  is  also  restricted  to  straight  line 
discontinuities. 

Applying  cluster  analysis  directly  to  the  data  space  encoun¬ 
ters  the  difficulty  that  a  single  displacement  datum  (Ax,  Ay)  at 
(x,y)  docs  not  uniquely  determine  a  rotation.  Any  pair  of  data 
determines  a  center  of  rotation  at  the  intersection  of  the  per¬ 
pendicular  bisectors  to  the  displacement  segments.  However, 
the  displacements  may  not  be  compatible  with  the  center.  To 
insure  compatibility,  cluster  only  intersections  (xo,yo)  of  pairs 
of  bisectors  for  data  for  which  the  displacements  are  roughly 
equal,  i  e.,  a*  ~  a*,  where 

a, 

There  are  n(n+ 1)/2  pairs  implying  quadratic  complexity.  Then 
cluster  in  the  velocity  3  space  (xo,yo,a).  To  use  contiguity 
information,  however,  we  must  include  the  coordinates  of  each 
datum,  which  raises  the  dimension  of  the  search  space  to  5,  an 
impractical  le\el  for  the  amount  of  data  and  compute  power 
available. 

Alternatively  associate  each  perpendicular  bisector  with  a 
point  in  such  a  way  that  images  of  lines  which  intersect  at  a 
common  point  have  a  distinguished  configuration  in  the  image 
plane.  One  such  method  is  reciprocal  polars  (Brace well  1990]. 
Parameterizing  lines  by  slope  and  intercept,  (s,<),  note  that  if  a 
family  of  lines  are  concurrent  at  (xo,yo).  then  for  all  s  and  t  we 
have  y0  =  szq  HM,  i  c.,  s  and  t  are  linearly  related  and  the  slope 
and  intercept  of  that  line  in  the  s-£  plane  depend  on  (x<>,yo) 
uniquely.  This  relationship  is  preserved  by  any  affine  transform 
of  the  s-t  plane.  Thus  reciprocal  polar  points  of  perpendicular 
bisectors  of  data  from  distinct  rotations  will  cluster  along  dis¬ 
tinct  lines  in  the  s-t  plane  and  this  plane  may  be  scaled  and 
sheared  so  as  to  enhance  the  separation  of  the  lines.  Figure  2 
shows  the  reciprocal  polars  of  the  data  of  Figure  1 
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Figure  2.  Scatter  plot  of  the  reciprocal  polar  points  of 
all  perpendicular  bisectors  of  displacements  from  Fig¬ 
ure  1  These  fall  into  six  natural  groups  as  labeled  1 
through  6  Where  geometrical  clustering  is  ambiguous, 
addition^  grouping  information  can  be  got  "by  consid¬ 
ering  contiguity  in  the  vector  field,  and  the  size  of  the 
displacements.  Thus  the  outlier  (marked  OL)  on  the  far 
right  is  compatible  with  the  center  of  Group  4,  but  its 
magnitude  is  too  large.  Similarly  the  member  of  Group 
5  whose  reciprocal  polar  clusters  with  Group  o  13  too 
small  for  that  displacement.  The  units  on  the  axes  are 
pixel  numbers  scaled  for  clarity  of  the  plot. 


/(*.- *o)2  +  (y.~yo)2 


1930 


Given  a  cluster,  linear  regression  will  estimate  the  center  co¬ 
ordinates.  We  must  now,  however,  search  for  elongated  clusters 
rather  than  round  ones,  a  nonstandard  image  analysis  problem. 
One  might  hope  to  2nd  a  transformation  of  the  plane  which 
would  map  distinct  lines  onto  distinct  points  and  thus  simplify 
the  clustering.  It  is  elementary,  however,  to  show  that  the  only 
transformations  of  the  plane  which  map  all  lines  onto  points 
are  the  constant  ones.  Thus  there  is  no  transform  which  in¬ 
jects  lines  onto  points  and  ,'ra  are  stuck  with  elongated  dusters. 
This  problem  can  be  attacked  with  specialized  algorithms  such 
as  in  Appendix  II.  The  result  is  a  determination  of  sets  of  dis¬ 
placement  segments  which  have  a  common  center  as  shown  in 
figure  3.  ' 


Figure  3.  The  vectors  of  Figure  1  grouped  by  dot¬ 
ted  lines  into  the  dusters  of  Figure  2  and  labeled  with 
corresponding  numerals  in  cirdcs. 


APPENDIX  I 

For  a  set  of  displacements  xT  -♦  y?  in  the  plane,  assume 
they  are  the  result  of  a  rotation  A(t?)  sampled  with  error  and 
estimate  the  center  ~f>  ;=(xo,y<>)  and  angular  displacement  9. 

The  first  of  two  regression  methods  minimizes  the  total 
square  error, 

S(z o,yo,0)  ==  £  II  y?  -  7  -  '1(51  -  7)  ||’, 

by  setting  the  three  partial  derivatives  of  S  to  zero.  This  results 
in  a  3  x  3  system  of  equations  which  is  quadratic  in  cot  §  with 
explicit  solution  cot  f  =  p  ±  y/TTW  where 
^ ‘hCyin 
c*i>j  —  c*»»» 

The  terms  defining  fi  are  the  covariances  between  the  various 
coordinates  of  the  displacement  endpoints  xj  =:  (xj yj,),  y?  =: 
(zji.Stti).  The  center  coordinates  are  then  found  in  terms  of  the 
centroid  of  the  displacement  endpoints,  the  mean  displacement 
lengths  and  cot  f . 

The  second  method  estimates  the  center  as  the  point  with 
least  total  square  distance  from  the  perpendicular  bisectors  of 
the  displacement  segments.  If  the  perpendicular  bisectors  are 
parameterized  a3  lines  c,x+s,y  a  p„  c,  =»  l,  then  the  total 
square  distance  is  S(xo,yo)  =  E;(c,xo  +  Wo  -  pi)7-  Setting 
derivatives  to  zero  and  solving  for  (xo,yo)  gives 

*«  =  (E3?Ep'c>  - 

y o  =  (Ec?Ep.».  -  E  w<'Ep,ci)/D 


w^lerc'^“  E*?  Es?_(E3'c«)*  =  Ei<,s>nJ7*j  and  7ij  is  the 
an^e  between  lines  t  and  j.  The  angular  displacement  8  can 
then  be  estimated  as  the  mean  of  the  angular  displacements  6, 
of  each  data  displacement  around  the  center  (xo,yj)- 

Both  of  these  methods  appear  to  be  numerically  stable  for 
small  angular  displacements  even  for  the  case  of  distant  centers 
where  the  displacements  are  nearly  parallel. 


APPENDIX  II 


For  a  set  of  data  points  in  the  plane  S  =  {P,  :=  (x,,y,)}J,ml 
which  are  distributed  -with  random  error  along  a  set  of  lines 
{y  =  a,x  -J*  estimate  N  and  cr„6,. 


Rotating  Swath  Algorithm 

1.  Translate  the  origin  to  (xmiym)  £  5,  a  point  closest  to  the 

centroid  of  5 

2.  For  a  given  half  width  w  and  a  slope  angle  let  F{to,$)  be 

the  number  of  data  points  falling  within  the  swath  of  width 
2 in  and  slope  tan<£  centered  at  (xm,ym)<  Compute  this 
function  as  a  two  dimensional  array  for  ^  =  0, 175®,  5°  and 
w  =  Ain,  Wj4,&io  where  W  :=  diamS  and  Aui  ss  W/50. 

3.  Apply  discrete  optimization  to  this  array  of  integers  to  find 

local  maxima  of  F.  One  such  should  occur  fairly  near  the 
low  width  side  of  the  u?,  4>  rectangle  for  each  linear  cluster 
of  points. 

4.  If  there  is  a  distinct  global  maximum  at,  say  let 

Sm  be  the  set  of  points  falling  within  this  swath  (of  width 
2wm  and  slope  tan^f  centered  at  (xm,ym)).  Consider  it 
as  a  seed  population  for  the  dominant  linear  feature  of  the 
data  set. 

5.  Do  linear  regression  on  Sm  to  get  a  slope  and  intercept  of 

the  seed  population  cluster  line.  Add  to  this  population  all 
points  of  S  for  which  the  deviation  from  the  line  docs  not 
exceed  the  maximum  deviation  of  the  seed  population  Re¬ 
compute  the  linear  regression  on  the  expanded  population 
and  discard  any  outliers.  Then  iterate  from  the  expansion 
step  until  no  new  points  adhere  to  the  cluster 

6.  Remove  the  cluster  obtained  in  Step  5  from  the  general  pop¬ 

ulation.  Then  start  over  at  Step  1.  Iterate  until  all  points 
are  exhausted  or  no  new  clusters  arc  produced. 

7.  To  check,  vary  the  initial  point  (xn,,yM)  and  see  if  the  same 

clusters  appear. 
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This  article  describes  the  tranfer  to  an  industrial  site  of  quarry  auto¬ 
mation  work  done  in  the  laboratory.  It  concerns  the  static  simulation 
of  a  French  quarry.  The  models  of  the  devices  are  described  first,  toge¬ 
ther  with  the  methods  used  to  solve  the  problems  inherent  in  the  simu¬ 
lation  of  the  processes  (perturbations,  search  for  equilibrium  point). 
The  simulation  is  described  in  detail,  together  with  its  use  by  an  NLP 
program  to  find  the  optimum  settings. 

I- INTRODUCTION 

Until  the  beginning  of  the  1980s,  quany  automation  work  was  limited 
to  a  combination  of  logical  automation  functions  and  local  regula¬ 
tions.  But  in  1983  the  LCPC  [4]  initiated  a  series  of  studies  intended  to 
optimise  the  operation  of  installations.  Part  of  this  work  consisted  of 
study  of  models  of  the  equipments  and  their  simulation  (2),  (3J.  This 
article  describes,  on  the  occasion  of  the  transfer  of  this  work  to  an  in¬ 
dustrial  site,  the  difficulties  inherent  in  the  simulation  of  this  type  of 
process  and  proposes  solutions.  The  various  points  amplified  are  the 
description  of  the  process,  its  modelling  and  simulation,  and  the  sear- 
ch^  for  the  operating  optimum  by  nonlinear  programming  (NLP), 
using  the  simulator. 


H. THE  PROCESS 


The  installation  consists  of  buffer  silos,  crushers,  screens,  and 
conveyors.  By  acting  on  the  number  of  devices-Sn  operation  and  the 
routings,  several  possible  operating  diagrams  can  be  obtained.  One 
important  characteristic  of  this  system  is  the  recycling  to  silo  B  of  a 
part  of  the  ouput  of  the  three  first  screens :  at  the  equilibrium  regime, 
the  levels  of  the  sil^a  are  constant,  and  a  variation  of  the  setting  of  cru- 
schers  2,  3,  and  4  can  make  the  level  of  silo  B  unstable.  The  control 
conditions  of  this  system  are  i  the  infeed  flowrate  Qi  and  the  gaps  of 
crusher#  gl,  g2,  g3  and  g4.  The  output  conditions  are  the  finished-pro- 
duct  flowrates  Q^,  Q2/4.  Q4/6,  Q^io»  Qi<yi4*  The  perturbations  are 
variations  of  grading  and  type  of  infeed  material  and  of  wear  of  the 
devices. 


HI  •  MODELLING  AND  IDENTIFICATION 

The  models  of  the  devices  are  static  models.  Since  the  Influence  of  the 
conveyors  takes  the  form  of  simple  pure  delays,  they  have  not  been 
modelled. 

3-1  Model  of  crusher 

The  perturbations  are  the  origin  not  only  of  the  instantaneous  errou , 
on  the  outputs  predicted  by  the  model,  but  also  of  the  non-stationarities 
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of  the  system.  The  former;n©t  mesurable,  are  caused  by  the  materials, 
and  the  latter  by  the  metallurgical  qualities  of  the  wearing  parts,  re¬ 
vealed  by.  a  decline  in  the  measured  power  over  time  It  is  difficult  to 
incorporate  an  estimate  of  wear,  which  varies  condiderably  from  one 
set  of  parts  to  another ,  in  the  model.  It  affects  the  value  of  the  opera¬ 
ting  point  (W,  Q).  This  is  what  led  to  a  model  based  on  small  diffe¬ 
rences.  The  tests  consisted  of  setting  to  an  operating  point,  increasing 
and  decreasing  the  clearances  of  the  devices,  and  recording  the  out- 
feed  curves.  This  gives  linear  functions. 


The  cruscher  model,  derived  from  the  results  of  the  campaign  of 
tests,  is  a  linear  variation  model. 

(atUrUli,  W«»r)  4*  »»rUU»M  « f 
W  IniUUvS 

Q  vector  *f*»t>ol  fto  wr,u» 

A  *.[QCOC.Qcn«.qoio.QV«, 

QO'i.Q'M) 
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To  ensure  the  coherence  of  the  models,  the  following  are  imposed : 
w*,w«  W.**KW%W0.4W) 


^2*0 

or  <5(3/14 

<5wwlVu  Q0^-»'»»».Q(yii.Qe«>**oooA*> 

vrfOi  W*  tit  ao-tae  pow*t  of  U>«  trwirt 

The  identification  is  easy :  the  outfeed  values  are  recorded  for  two  dif 
ferent  infeed  values. 


t\+wm) 

From  this  are  deduced :  .  . 

*  Ij’ii 

3-2  Model  of  screen 


Whithen’s  knowledge  model  (1972)  (l)  was  used  .  the  probability  of  a 
particle  of  size  x  not  passing  through  a  screen  of  mesh  width  h  and 
wire  diameter  d  is  considered : 

1L  »  -iLi. 

«T  «x>*(KtaBf(k*4BW  »ixS* 

cw*l  «xii 
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The  parameter  m  represents  the  effectiveness  of  screening  ,  it  de 
pends  on  the  infeed  flowrate.  Function  C  (x)  is  called  the  classication 
ftmetion.  Let  ST  ■*  Uj.  $2  .-.^n)  be  the  vector  of  the  particle  sizes.  Here 
ST  B  (30 14  10  6  4  2),  The  mean  probability  for  a  particle  to  be  oversize  is 
calculated  by : 

a-g-^-1  C(x)<t»  for  i  front  to  tv.  | 

1  *‘4.1 

q»*jL|  c(x)4x 

TSU  44m  •  tutrix  C  ■ 
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This  matrix  may  be  used  to  calculate  the  throughputs  of  materials  in 
each  grading  class  for  oversize  and  undersize. 


2i=CQe 

flB-a-ofit 


Ffc.  <  ScS«jmB<  4Ufrui  ef  ktm*  boM 


Where  I  is  the  identity  matrix  and  fie,  fir,  2a the  infeed,  oversize  and 
undersize  throughput  vectors.  In  the  absence  of  precise  measure¬ 
ments,  the  parameter  m  has  been  taken  equal  to  20,  a  realistic  value 

IV  -  THE  SIMULATION 

The  system  consists  of  a  set  of  machines  interconnected  by  conveyors. 
The  output  variables  of  one  constitute  the  inputs  of  another.  They  are 
calculated  using  models  of  the  devices  in  the  order  upstream-downs¬ 
tream.  However,  because  of  the  recycling  to  silo  B  (see  Fig.  1),  the  out¬ 
put  of  crushers  2, 3,  and  4  acts  on  their  own  infeed.  This  is  why  the 
calculation  requires  several  iterations  until  the  power,  flowrate,  and 
grading  variables  cease  to  change  as  Judged  by  some  criterion.  There 
is  therefore  a  fundamental  problem  here  in  the  simulation  df  these 
processes,  the  problem  of  convergence,  a  problem  that  is  still  unsol¬ 
ved.  The  steady  state  has  been  simulated  as  shown  by  the  following 
flow  sheet  (Fig.  6),  usable  in  the  general  case  (the  devices  are  numbe¬ 
red  in  upstream-downstream  order). 

For  a  particular  set  of  control  values  (gl,  g2,  g3,  g4,  Qi),  in  the  general 
case,  the  infeed  flowrates  of  the  silos  ars  not  equal  to  their  output  flo¬ 
wrates,  and  the  levels,  are  unstable.  To  locate  an  equilibrium  point 
(level  of  silos  constant),  the  sittings  of  crusher  1  and  the  infeed  flo¬ 
wrate  of  the  installation  are  calculated  as  a  function  of  the  settings  of 
crushers  2,  3,  and  4  by  an  iterative  method.  This  problem  of  equili¬ 
brium  in  a  process  involving  flows  of  materials  is  common.  The  me¬ 
thod  of  attaining  equilibrium  used  here  is  shown  in  figure  6. 


V- OPTIMIZATION 


Software  for  optimization  of  the  installation  using  the  simulator  has 
been  developed  to  determine  thi  optimum  settings  to  be  used  to  maxi¬ 
mize  or  minimize  the  production  of  one  class  of  aggregates  or  another. 


| — >  Qe»— 
j  CUwMwflj 


The  optimization,  by  nonlinear  programming,  uses  an  augmented 
Lagrangi&n  method.  The  Lagrangian  is  minimized  without 
constraint  by  a  method  of  Newton  The  criterion  maximized  is  the 
sum  of  the  throughputs  of  the  finished  products.  Contraints  are  used  to 
favour  one  type  of  output  or  another :  lower  and  upper  limits  are  impo¬ 
sed  on  the  throughputs  of  each  finished  products. 

(3o%sicms<o% 
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The  optimization  variables  are  the  settings  of  crushers  2, 3 ,  and  4 
VI- EXAMPLE 

The  user  has  here  sought  to  obtain  mainly  10/14  and  6/10  while  mini 
mixing  the  quantities  of  2/4  and  0/2  produced  (crushers  525,  535,  625, 
645  o  crusher  3, 4,2, 1  •  D  s  30  mm). 

Iteration  number :  224. 

Clearances  (discrete  values)  525, 535, 625, 645  *5  -2  -4  -5. 

Infeed  flowrate :  743  Tph,  required  flowrate :  1  to  1000  Tph, 
Throughputs  of  crushers  625, 535, 625  645  (Tph)  =  241  273  282  584 
Powers  of  crushers  525, 535, 625  645,  (KW)  =  230  166  89  133 

Crushers  gradings  (%) 


0/D 

0/14  < 

0/10  0/6 

■0/4 

0/2 

G525'  100.0 

762 

56.0  43.2 

395 

32.8 

G635  100.0 

71.2 

52.1  40.8 

37.5 

29.0 

G625  100.0 

985 

51.6  2L2 

130 

5.1 

G845  1000 

538 

380  22.3 

14.2 

65 

Finiiliticrojatta 

Class 

Throughput 

Percentage 

Percentage 

(Tph) 

(%) 

desired  (%) 

10/14 

361 

49 

30  to  50 

6/10 

171 

23 

30  to  50 

W 

78 

10 

10  to  20 

1/4 

74 

10 

StolO 

<V2 

60 

8 

0to5 

vn- CONCLUSION 


In  this  article,  on  the  occasion  of  the  work  done  on  an  industrial  case, 
we  have  attempted  to  show  the  difficulties  inherent  in  the  simulation 
of  quarry  installations  and  to  propose  some  solutions. 

The  main  difficulties  encountered  are  in  taking  Into  account  pertur¬ 
bations  in  the  materials,  the  modelling,  and,  in  the  case  of  a  static  si¬ 
mulation,  the  search  for  settings  of  the  devices  such  that  the  flows  of 
materials  are  in  equilibrium,  S.e.  the  levels  of  buffer  stocks  are 
constant. 

The  first  difficulty  was  resolved  by  using  models  of  a  variation  about 
an  operating  point  and  the  second  by  the  use  of  a  corrector  calculating 
the  settings  on  the  basis’of  a  criterion  of  equilibrium. 

This  simulation  work  is  only  a  first  step  towards  the  automation  of 
this  type  of  process. 
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Abstract  In  this  paper,  resonant  nodes  of 
different  waveguides  are  coaputed  with  "edge 
elenents"  .  These  finite  elements  avoid  all 
the  "spurious  nodes",  the  non-physical 
numerical  fields  obtained  from' the  solution 
of  eigenvalue  problems  with  classical  finite 
element  formulations.  Moreover,  comparisons 
with  analytical  results  and  previously 
published, ones  show  the  great  accuracy  of  the 
numerical  technique.  Both  empty  and 
dielectric-loaded  waveguides  are  analyzed. 

I.  INTRODUCTION 

The  finite  element  method  seems  to  be  a 
powerful  tool  for  the  analysis  of  arbitrary 
shaped  waveguides  :  cut-off  frequencies  can 
bo  obtained  as  solutions  from  an  eigenvalue 
problem.  The  most  serious  difficulty  in  these 
studies  is  that  the  computed  solutions  are 
plagued  by  non-physical  (or  "spurious") 
solutions  :  solutions  which  do  not 
automatically  satisfy  the  divergence  free 
condition  implied  by  the  Maxwell's  equations. 
Many  attempts  are  performed  to  circumvent 
these  unwanted  fields  (l)-[3).  It  has  been 
observed  that  discretized  fields  with 
continuous  tangential  components  suppress  the 
"spurious  modes"  (lj  but  no  precise  argument 
was  put  forward  to  explain  the  importance  of 
this  approximation.  Recently,  Bossavit  showed 
the  reason  for  which  "edge  elements"  (a  class 
of  mixed  finite  elements  (4)  (5))  will  not 

generate  "spurious  modes"  (5).  Here,  we  have 

developed  such  a  numerical  approach  (6)  and 
applied  the  technique  for  the  analysis  of 
empty  and  dielectric-loaded  waveguides. 

II.  VARIATIONAL  FORMULATION 

We  deal  with  the  Maxwell's  time  harmonic 
equations  inside  a  2-D  bounded  region  0.  This 
region  contains  lossless  materials  and  is 
surrounded  by  an  ideal  conductor  (o  a  n  *  0) . 

A  weak  formulation  of  the  wave  equation 
for  the  transverse  electric  field  e  ■  (ex,ey) 
in  ft  can  easily  be  derived  : 


rote. rote'  dft  -  ke  2  Jq  €r  e.c'  dft  =  0  (1) 

where  the  values  of  ke  are  the  cut-off 
wavenumbers . 

III.  FINITE  ELEMENT  FORMULATION 

The  region  ft  is  discretized  with 

a  classical  triangular  mesh. 

The  edge  elements  have  the  following 
properties  :  the  degrees  of  freedom  e,  and 
the  trial  functions  w^  are  associated  to  the 
mesh  edges  ;  e,  is  the  circulation  of  "e" 
along  the  edge  "a"  and  w4  can  be  expressed 
as: 

w*  -  A,  gradAj  -  Aj  gradA,  (2) 

where  Aj  and  Aj  are  the  barycentric 
coordinates. 

In  each  triangle,  e  -  (  ex,  ey)  is  : 
ox  -  a,  -  a2y  ;  ey  «  a3  +  a2x  (3) 

where  a,€  R  ;  a26  R  ;  a3€  R 

Substituing  in  (1)  the  discretized  fields 
obtained  with  (3)  for  "e"  and  ,  we 

deduce  a  generalized  algebraic  eigenvalue 
problem  of  the  form  : 

A  U  ■  kj  B  U  (4) 

A  (  the  "stiffness"  matrix)  and  B  (  the 
"mass"  matrix  )  have  dimensions  n#x  nowhere 
nt  is  tne  number  of  edges  in  the  finite 
element  mesh. 

IV.  NUMERICAL  RESULTS 
1.  Empty  waveguides 

A  hollow  rectangular  waveguide  (1x2) 
was  analyzed.  The  number  of  unknown  edges  is 
126.  This  simple  case  gives  spurious  modes 
when  solved  with  classical  finite  elements. 
(3] .  Here,  no  spurious  mode  is  observed.  The 
first  lowest  modes  were  computed  (table  1) 
and  compared  with  analytical  solutions.  The 
relative  error  never  exceeds  0.6%. 
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Modes 

*c 

error(t) 

1.569 

0.05 

TEji 

3.140 

0.03 

3.136 

0.17 

IE,, 

3.506 

0.18 

Table  1  Computed  kcfor  a  rectangular 
vaveguide 

A  similar  calculation  va s  perfo rued  for 
a  hollow  circular  vaveguide  (Radius  :1)  with 
130  unknown  edges.  No  spurious  node  was 
observed.  The  results  are  shown  cn  table  2. 


Modes 

K 

error  (%) 

IE,, 

1.860 

1. 

TEj, 

3.091 

1.2 

TE^ 

3.852 

1.2 

4.273 

1.7 

Table  2  Computed  Jcc  for  a  hollow  vaveguide 

Figure  1  shows  the  distribution  of  the 
electric  field  for  the  T£,j  node. 


Figure  1  Electric  field  for  the  TE„  code 

2.  Dielectric-loaded  vaveguide 

A  rectangular  waveguide  half-filled  with 
dielectric  naterial  (  figure  2)  was  studied. 


A. 


Figure  2  Half  filled  dielectric  vaveguide 

This  example  is  used  in  the  literature 
as  a  test  case  for  calculation  cetnods.  The 
number  of  unknown  edges  is  210.  Spurious 
codes  often  arise  in  this  problen  (2].  Here, 
no  one  occurcd  :  all  the  cocputcd  solutions 
correspond  to  physical  cut-off  values.  Table 
3  exhibits  the  lowest  cocputcd  values  of  kc 
for  different  pcrnittivitics  cf  .  A  conparison 
of  our  results  with  thoso  of  a  previous  work 
(7)  (obtained  by  ceans  of  truncated  series) 
shows  a  good  agreement. 


«- 

Our  Method 

Refer-  £7J 

2. 

2.516 

2.531 

3. 

.  2.140 

2.161 

4. 

1.831 

1.509 

5. 

1.693 

1.729 

Table  3  Lowest  values  of  kc  for  half-filled 
dielectric  waveguide 

V.  COSCICSICS 

"2d ge  elements*  have  been  used  to  scdel 
empty  and  dielectric-loaded  waveguides.  -The 
first  cut-off  frequencies  were  computed  ; 
comparisons  with  analytical  results  values 
and  with  previously  published  ones  shew  the 
efficiency  of  the  method. 

These  elements  avoid  all  the  well'  known 
■spurious  modes"  and  seem  very  premising  for 
the  study  of  more  complicated  problems  in 
high  frequency  applications. 
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Abstract 

Til*  piper  exteais  tie  field  model  lieg  z^roidi  to  tie 
simelatieo  of  enclosure  fire*  to  isclcde  fire-sprlskler 
interact I  ca.  Tie  toloe  fraction  or  Fulerizm-FmleTiaa 
approach  is  ad  to  slallte  tlx  tw  fiuif*.  Tie  tot 
fire  few*  repre*e*t  cot  p&ac  lAile  tie  liquid  water 
particle*  tie  secood.  Tie  wttowtfcal  model  prelected 
nice*  iato  accocsi  tie  tiree  modes  of  Jateractioa 
between  tie  pluses;  drag.  beat-  and  *t*s-tns*fer.  Tie 
ressltlcg  finite-difference  e^atlea  are  sol  red  for 
exist  the  commercially  available  computer  package 
P3XXICS.  which  employs  tie  two-phase  algorithm  IPSA 
(Inter- Phase  Slip  Algorithm).  Coxpar Isocs  between 
riper i  rental  aad  predicted  result *  are  presented. 
Ccrreat  research  Is  directed  towards  Isprovlog  tie 
efficiency  of  both  tie  algorithm  aad  tbeir 
Irpleseatation. 

Introduction 

During  IS  23.  fire  dales  cost  tho  Association  of 
British  Insurers  in  excess  of  £C00  Billion  aad  the 
live*  of  about  1000  people.  Tie  installation  of 
sprinkler  systems  within  occupied  enclosures  has  always 
been  seen  as  an  effective  sens*  of  reducing  fire 
losses.  Fires  in  structures  which  :.'c  protected  by 
sprinkler  systeas  are  often  controlled  aad  extinguished 
before  the  arrival  of  the  local  fire  brigade  resulting 
in  alnjaal  property  damage.  Tbe  loss  of  life  is  also 
greatly  reduced  (1.2]. 

In  order  to  efficiently  coabat  fire  It  Is  necessary  to 
understand  tbe  nature  of  the  Interaction  between  the 
hot  combustion  products  and  the  liquid  water.  This 
enables  fire  engineers  to  optimise  the  design  and 
location  of  sprinkler  devices. 

For  many  year*  physical  experiaentation  into  these 
phenomena  has  been  tbe  main  means  of  lnvestigatlon. 
However,  tbe  amount  of  buzan  and  financial  resources 
required  to  carry  out  a  full-scale  fire  test  with 
coapletely  fitted  enclosures,  such  as  a  room  or 
aircraft  fuselage,  can  be  extresely  expensive  or  even 
iapossible.  Additionally,  it  is  not  always  possible  to 
conduct  enough  fire  tests  to  adequately  deal  with  all 
alternatives,  such  as  the  nature  and  position  of  fire 
sources.  ventilation  alternatives  and  sprinkler 
options.  Xatheaatlcal  aodelling  provides  a  aeans  to 
overcome  these  difficulties. 

Xatheaatlcal  Modelling 

Xatheaatlcal  aodelling  of  fires  and  saoke  spread  in 
enclosures  has  received  a  considerable  aaount  of 
attention.  Field  aodelling  represents  the  most 
sophisticated  aodelling  strategy  available  for  the 
slaulation  of  enclosure  fires.  This  deterministic 
approach  (3,4]  provides  us  with  a  clearer  understanding 
of  the  coup] ex  processes  occurring  within  a  fire 
enclosure.  Studies  of  different  fires  within  a 
specified  coapartnent  geoaetry  can  be  carried  out  with 
ease  and  with  less  expense  than  full-scale  experiments. 
Pield  aodcls  involve  the  nuaerical  solution  of 
recirculating  three-dlaens local  turbulent  buoyant  fluid 
flow  with  heat  and  aass  transfer.  As  a  result  they 
consuae  a  considerable  aaount  of  coaputer  power. 


Fire-sjcimkler  istmetias  also  leads  itself  to  this 
form  of  analysis.  This  problem  Is  considerably  more 
complicated  thaa  the  stra  ght forward  fire  ia  reclomt 
sitsatfea.  There  are  taw  two  physical  phases  which  xstt 
be  incorporated  into  the  over all  mbnaticsl 
description.  These  are  the  gas  phase  isvolvisg  the 
georraJ  fluid  circular  I  co  of  tie  hot  cosbcstioa 
products  aad  the  liquid  phase,  representing  the  water 
droplets  which  have  been  impeded  into  the  fire 
compartment  aad  evaporate.  Pi  vines  studies  by  the 
authors  were  concerned  with  steady-state  s isolation* 
(5.6].  Cere,  tbe  analysis  is  exteaded  to  deal  with  tbe 
transient  nature  of  tbe  phenomena  (Gj, 

The  Mathematical  Description  of  the  Physical  Probiea 
Fire  creates  a  strong  buoyancy  drives  flow  which  gives 
rise  to  large-scale  turbulent  action  controlling  the 
dlffesioa  of  aass  and  nosen t cm.  Water  droplets  injected 
into  this  hostile  environment  Interact  with  the  bat 
gases  by  being  entrained  Into  the  thermal  plume  or  by 
re -direct leg  the  flow  of  the  hot  gases. 

Tbe  independent  variables  used  to  model  this 
fire  sprinkler  icienttjon  phenomena  are  the  width, 
height  and  length;  x.  y  and  z  of  a  cartesian 
co-ordinate  system,  as  well  as  time  t. 

Tbe  15  dependent  variables  requiring  solution  are  the 
six  velocity  cosponents  (Uj.v,.*,)  for  tbe  gas  phase 

and  (u,.v,.w,)  for  tbe  particulate  phase  in  their 
respective  cartesian  direction  (x.y.z).  along  with  the 
pressure  p.  which  is  assumed  to  be  the  same  for  both 
phases.  The  enthalpies  for  the  gas  and  water  phases.  h„ 

aad  hj  respectively,  along  with  the  concentration  of 
water  vapour  within  the  gas  phase,  e.  The  gas  and 
liquid  volume  fractions  r«,  and  rj  arc  solved  for 
Including  tbe  effect  of  evaporation.  The  ’shadow* 
volume  fraction.  rs.  is  the  volume  fraction  in  the 

absence  of  evaporation.  Finally  the  turbo] ence  kinetic 
energy 'and  dissipation  rate  of  the  gaseous  phase  (k.cj. 
Turbulence  in  the  liquid  phase  is  neglected.  The 
shadow  volume  fraction  technique  allows  us  to 
evaluate  the  diminishing  droplet  size  during 
evaporation  (7]. 

Governing  Differential  Equations 

All  these  dependent  variables,  with  the  exception  of 
pressure,  appear  as  the  subject  of  the  differentia] 
equations  of  the  fora: - 

(rlPi©|Miv  orrj  r0  grad  o)  -  rtSj  (1) 

where  o  stands  for  a  general  fluid,  property  such  as 
velocity  or  entnaipy.  r  ami  S  are  diffusion  coefficient 
•?nd  source  terns  respectively,  ar.d  subscript  i  refers 
to -the  phase  In  question  (gaseous  or  participate). 

The  Xass-Conservation  Fount Ion 

The  pressure. variable  is  associated  with  the  continuity 
equation:- 

5^"  (r|P!)»dI»  (r|PjV|)  .  Sj  «> 
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where  m  i*  lie  deultjr  ef  tbe  se. 

The  voleme  fractions  *rt  related  to  ndi  other  by  the 
•space*  sharing  condition :- 

-'f  •  r,  «  l.o  (31 

The  socrce/sisk  ter*  Sj  Is  ibe  rate  of  evaporatioa 
(source  for  the  gas.  si  ax  for  the  liquid). 

JUaHltfV  Ecatleet 

Dae  to  the  eery  suture  of-  the  probien.  l.e.  the 
ieteractica  between  two  phases.  certain  correlations 
oerf  to  be (included  la  the  model  to  close  the  problem. 
These  time-drpeodeat- relations  deal  with  the  interphase 
heat  zad  mass  transfer  sad  the  friction  between  the  gas 
and  the  liquid  phases.  The  assumptions  aade  are  that 
the  gas  and  droplets  are  dispersed  within  the 
control  ^volume  aad  that  the  droplets  arc  spherical.  The 
last  assumption  is  sot  essential  bat  sisplifles  the 
nature  of  the  empirical  Icpet  (6). 

The  Solution  Procedure 

The  above  equations  are  solved  for  csicg  the  coxpater 
program  P90ESICS  (Sj  with  the  ballt-in  equations 
solvers  SSXFLEST  aad  IPSA  (Inter-Phase  Slip  Algorithm). 
The  latter,  a  core  elaborate  solotioa  procedure  sbicb 
is  used  for  multi-phast.  modelling.  Is  able  to  handle 
the  presence  of  two  simultaneously  present  phases 
sharing  a  coxaoa  pressure.  It  evaluates  the  increased 
eLaber  of  governing-  equations  of  the  flow  and  the 
strong  interaction  between  the*,  such  as  interphase 
friction- and  Mass,  as  veil  as  the  space  sharing  of  the 
volcxe  fractions  condition.  For  these  calculations 
PHOEXICS  uses- the  conventional  staggered  grid  approach 
for  solving-finlte-volume  equations  (81. 

Fire-Sprinkler  Simulation 

In  order  to  evaluate  the  validity  of  the  model  outlined 
above.  it  is  necessary  to  cospare  experimental 
fire-sprinkler  data  with  rod el  results.  Two  fire 
scenarios  have  been  investigated.  The  first  concerned  a 
corner  fire  located  in  an  OPEN  office-sized  compartment 
(6.9).  The  slaulatlon  results  agreed  reasonably  well 
with  the  small  aaount  of  experimental  data  available 
for  cosparison  purposes.  The  results  presented  here 
concern  a  bed  fire  within  a  large  CLOSED  hospital  ward. 
The  experimental  results  were  produced  by  the  C.K.  Fire 
Research  Station  (10J. 

The  (ire  compartment  depicted  In  Figure  1  was  a  six  bed 
hospital  room  of  dJeensions  7.325a  by  7.85m  by-  2.7s. 
The  flow  domain  was  fitted  with  a  14x15x11  cartesian 
grid  comprising  of  2310  control  volumes.  The  thernal 
conductivity  and  thickness  of  the  walls,  floor  and 
celling  were  supplied  and  included  In  the  calculations. 
In  addition,  six  lkK  oil  filled  radiators  situated  on  a 
wall  were  modelled  using  a  single  6kK  heat  source  7.85a 
long,  0.3  in  width  and'0:5a  high. 


Figure  1  Hospital  Room  Schematic 

In  the  experiment  the  fire  was  situated  on  a  bed  which 
was  1  35m  wide.  i„75«  in  length  and  stood  0.5m  above 
the  floor.  The  bed-head  was  positioned  at  the  centre  of 


tie  east  wall.  Tie  ijrhilrr  was  located  0.353s  below 
the  ceil  lag  and  O.tKs  frost  tie  east  wall  alocg  t%c 
centre  llae  of  tie  room.  The  water  was  released  at  a 
rate  of  O.S5€xlO-3m*/s  at  an  egle  of  70*  from  tie 
sprisiler  beads  axis  of  symmetry.  This  dispersion 
patters  took  isto  accocst  tie  sprinkler's  deflector 
plate.  It  was  assumed  that  droplets  with  a  cal  fora 
average  di taster  of  In  were  released  with  as  initial 
temperature  of  10'C. 

Tie  room  conditions  prior  to  the  sprinkler  actiratios 
were  obtained  by  modelling  caly  the  fire.  The  results 
fro*  the  single-phase  simulation.  which  lasted  for  -420 
seconds,  were  used  as  laitial  values  for  the  two-phase 
fire-sprinkler  study.  Daring  the  single-phase 
s isolation  the  fire  was  raged  frew  CkX  to  <05df 
according  to  experlsental  data.  The  fire  was  modelled 
as  a  beat  source  ’  occupying  a  fixed  area  cf  0.4Sxx0.5m. 
posit Jored  0.25a  from  the  east  wall.  After  420  seconds 
the  sprinkler  ’was  activated.  During  this  stage,  which 
lasted  for  120 ''seconds,  the  fire  was  maintained  at  a 
constant  4CJdf.  The  simulations  were  performed  with  a  1 
second  time  step. 

Results 

During  the  experiment  gas  temperatures  were  monitored 
at  seven  main  locations.  The  thermocouples  were 
positioned  75mm  below  the  ceiling.  Comparisons  between 
experimental  and  numerical  gas  temperatures  for  two 
sites  are  presented  in  figure  2.  The  thermocouples  were 
located-  between  the  heat  source  and  the  east  wall 
(figure  2-a)  and  in  the  vicinity  jf  the  room's  west  end 
bJgh  corner  (figure  2-b).  These  clearly  indicate  that 
the  correct  trends  in  temperature  variation  have  been 
captured.  Ex peri cental  and  numerical  uncertainties 
concerned  with  the  fire  load  and  the  coarseness  of  the 
numerical  grid  contribute  towards  the  observed 
discrepancies. 


Figure  2  Predicted  and  measured  gas  temps  nt  two 
thermocouple  locations. 

allocated  between  heat  source  and  east  wall 
M located  in  the  vicinity  of  west  end  high  corner 
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Tie  foJS ewleg  tire*  ilepass  *re  uin  tirecrh  tie 
ctfitre  of  tie  roe*  *sd  throrgh  tie  fire  mad  sprJsfeJer 
wercfi.  flfsro  3  *ssd  4  skx  £*t  teiynicw  *ad 
velocity  vectors  before  raJ  120  secoodr  after  sprjekler 
actlratlcn.  These  clearly  sio*r  tie  scatr  la  fcSlch  tie 
spriekler  creates  a  witer  csrtalfl  cccfJalsj  tie  tot 
Cases  to  a  saall  area  of  tie  roc*.  Eveateally  tie  cases 
resote  fre*  tie  fire  *sA  sprJciler  are  cooled  doxa.  As 
caa  be  sees  la  figcre  <*a.  prior  to  sprJ&kler 
actlvatloa  tb*re  exists  ose  larce  re-clrcalatloa  cell 
drlttfl;^^ beat  socrte-  Tw>  sisstes  after  sprinkler 
actl^atloa  (figsre  4-b)  two  major  flows  are  appareat- 
Tbe  first,  gear rated  by  tie  sprinkler  Js  downward*.. 
oeflectlcj  off  tie  bed  aad  alosr  tie  floor.  Tbr  second, 
reaerated  by  tie  fire,  is  aloeg  tie  cel llcr.  These  two 
cerreat  meet  towards  tie  centre  of  tie  roca.  aid! nr  tie 
slxier  aad  coo  Her  process. 


Vector  2.0!  ■/ s 


Figure  4:  Predicted  gras  velocity  vectors 

a)flre  only  (420  sees!  b)fire  and  sprinkler  <540  secs) 

Finally  the  spread  of  the  water  droplets  In  teras  of 

their  volume  fractions  are  shown  In  figure  5  after  120 

seconds. 

A  further  observation  to  emerge  from  these  simulations 
is  that  this  approach  is  very  expensive  in  terms  of 
computing  time.  The  simulations  were  performed  on  a 
Norsk  Data  ND-5900  machine,  roughly  the  equivalent  to  a 
VAX  11/780.  The  calculations  required  75  hours  of  CPU 
time  to  simulate  the  420  seconds  prior  to  sprinkler 
activation.  The  next  120  seconds  consumed  21G  hours. 
These  calculations  were  performed  on  a  relatively 
coarse  grid.  Clearly  If  this  technique  is  to  be  adopted 
as  an  engineering  design  tool  a  means  must  be  found  of 
reducing  this  enormous  computing  effort.  Parallel 
computing  techniques  offer  a  way  of  achieving  this.  At 


Figure  S.  Predicted  volcre  fraction*  of  water 

Thames  Polytechnic  we  have  modified  tbe  Hold  flow 
package  FATVft t>- HA0C3  (li)  to  make  efficient  esc  of 
tbe  mnlti -processor  architects?*  offered  by  the  15X0$ 
Transpater.  Early  work  reveal*  that"  on  test  problems 
involving  cp  to  17.640  computational  cells  and 
utilising  15  transputers,  efficiency  of  over  can  be 
achieved.  On  a  15  processor  transputer  systen  this 
results  in  a  13  fold  speed -cp  (12).  Translating- this 
performance  to  the  above  fire-sprinkler  simulation  and 
using  15  transputers  It  Js  expected  that  tbe  291  hour 
simulation  could  be  performed  in  about  22  hours. 

Conclusion* 

Tbe  results  presented  above  indicate  that 
time -dependent  fire-sprinkler  scenarios  can  be 
siaulated  using  tbe  outlined  volume  fraction  approach. 
Further  studies  are  currently  in  progress  to 
investigate  the  effect  of  flow  rate  and  droplet  size. 
Furthermore,  detailed  grid-refinement  studies  need  to 
be  undertaken.  It  is  envisaged  that  these  and  other 
computationally  intensive  fire  field  modelling 
calculations  will  be  pursued  on  parallel  architecture 
sachines. 
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Abstract 

Polymer  flooding  is  za  oil  recovery  technique,  where  polymer  is 
added  to  water  which  is  injected  into  an  c>3  reservoir  to  increase 
hydrocarbon  recovery.  Polymer  flooding  is  a  complex  physical 
process,  which  can  be  described  by  a  system  of  nonlinear  par¬ 
tial  differential  equations.  A  numerical  method  for  simulating 
polymer  floods  is  presented.  The  method  is  based  on  special 
numerical  techniques,  which  introduce  almost  do  numerical  dif¬ 
fusion  and  which  permit  relatively  large  time  steps.  Locally 
refined  and  dynamically  adapted  grids  are  used  to  accurately 
represent  sharp,  moving  fluid  interfaces  and  to  reduce  computa¬ 
tional  costs. 

Istrgdaciica 

Enhanced  Oil  Recovery  (EOR)  deals  with  cost-effective  tech¬ 
niques  to  increase  oil  production  from  existing  Adds.  An  exam¬ 
ple  of  an  FOR-  technique  is  polymer  flooding  of  Adds  containing 
heavy  oil.  The  efficiency  of  the  recovery  process  can  greatly  be 
improved  by  adding  highly  viscous  polymer  to  the  injected  wa¬ 
ter.  The  polymer  is  costly  and  is  typically  injected  in  slugs. 

Polymer  flooding  is  a  complicated  process  involving  a  fluid 
composed  of  oil,  water  and  polymer.  Water  and  oil  are  assumed 
to  be  immiscible,  while  polymer  mixes  only  with  water.  The 
flow,  of  fluid  through  a  porous,  heterogeneous  reservoir  is  in¬ 
duced  by  the  pressure  gradient  between  injection  and  production 
wells  and  is  affected  by  gravitational  forces.  The  distribution  of 
components  (oil,  water,  polymer)  over  the  reservoir  is  governed 
by  nonlinear  convection  (shock  fronts),  physical  diffusion  (cap¬ 
illary  pressure),  and  adsorption  of  polymer  by  the  rock.  Fluid 
interfaces  may  bf^oxne  unstable  due  to  large  differences  in  vis¬ 
cosity  of  the  different  fluids.  A  fluid  may  form  so-called  menus 
fingers,  which  channel  through  another  fluid  towards  the  pro¬ 
duction  wells.  This  will  clearly  reduce  the  efficiency  of  the  oil 
recovery  process. 

Accurate  simulation  of  polymer  floods  is  quite  difficult  but  of 
vital  importance  to  an  oil  company  wishing  to  optimize  the  j>oly- 
mer  injection  strategy.  Conventional  (finite  difference)  meth¬ 
ods  often  introduce  excessive  numerical  diffusion,  which  badly 
sm cares  sharp  fluid  fronts  and  which  destroys  most  of  the  dy¬ 
namical  behaviour  of  the  simulated  slug  flow.  Our  solution 
method  is  based  on  decoupling  the  equations  during  timestep¬ 
ping  and  treating  each  equation  by  the  most  accurate  and  ef¬ 
ficient  numerical  technique.  All1  calculations  are  performed  on 
locally  refined  grids  in  up  to  three  space  dimensions.  The  grids 
are  dynamically  adapted  during  simulation  so  that  solution  ac¬ 
curacy  is  enhanced  and  computational  costs  are  reduced. 


Mathematical  equations  of  polymer  flooding 

A  mathematical  model  describing  polymer  flooding  of  a  porous 
medium  consists  of  a  coupled  system  of  .nonlinear  partial  differ¬ 
ential  equations.  Let  x  denote  the  position  in  the  reservoir  and 
let  t  be  time.  Pressure  p(x,f)  and  velocity  u(x,  t)  of  the  total 
(incompressible)  fluid  are  governed  by  Darcy  fs  lav 

V.u  =  q(x,<)  ,  u  =  -X(x)A(s,e)(Vp-p(s,e)W(x))  (!) 

with  q  the  well  flow  rate;  fC  the  absolute  rock  permeability;  X  the 
total  mobility  of  the  fluid;  p  a  gravity  term;  and  d  the  reservoir 
depth.  The  fluid  consists  of  two  immiscible  phases:  an  aqueous 
phase  containing  water  and  polymer  and  an  oleic  phase  contain¬ 
ing  only  oiL  Saturation  s(x,  t)  of  the  aqueous  phase  and  concen¬ 
tration  c(x,t)  of  polymer  in  water  satisfy  the  mass  conservation 
equations 

(Kx)|  +  V.(/(.,c)v(x,0)  -  W(D-(*,e)V.)  =  9,(x,l)  (2) 

tfx)  +  V.(c/(3,  c)v(x,  t))-V.m,,c)Vc)  =  c.(x,() 

(3) 

with  ^  the  rock  porosity,  /(s»c)v  the  aqueous  phase  velocity, 
qt  the  water  source  term,  and  c(c)  the  fraction  of  polymer  ad¬ 
sorbed  by  the  porous  rock.  The  tensor  V*  reflects  diffusion  at 
the  oil/water  interface  due  to  capillary  pressure  effects,  while  T* 
models  (molecular)  diffusion  of  polymer  within  water 

Numerical  simulation 


The  total  fluid  velocity  u  and  pressure  p  vary  slowly  with  time 
and  are 'weakly  coupled  to  saturation  s  and  concentration  c. 
Hence,  a  sequential  approach  is  used  to  solve  the  equations 
Each  timestep,  the  equations  arc  separated  into  ah'  elliptic  equa¬ 
tion  (1)  for  velocity  and  pressure,  and  a  system  of  convection- 
dominated  diffusion  equations  (2,3)  for  s  and  c.  Assuming  s 
and  c  are  known  functions  of  x,  equation  (1)  is  discretized  using 
mixed  finite  elements  and  solved  by  an  efficient  multigrid  tech¬ 
nique  (Schmidt  and  Jacobs,  19S8)  Operator-splitting  is  then 
applied  to  the  system  (2,3)  to  deal  separately  with  the  effects  of 
convection  /  adsorption 


<5§  +  V.(Mc)v)  =  +  +  V,(c/(a,c)v)  =  c,. 

91  (4) 

and  diffusion 


-  V.{2>\,,«)V*)  =  0,^-  V.(jy(s,c)Vc)  =  0  (5) 
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Tbe  nazBrcar  system  of  hyperbolic  equatlccs  (4)  ts  solved  by 
tie  method  of  characteristics  combined  wish  ffitaaa  sobers. 
Noelkxaritjes  a  the  system  «  governed  by  the  frcdioncl  funs 
f=rctia=  /(j,c)  =  A, /(A*  -f  A,),  where  A.(*,c)  =  , 

A*(j/  =  (1  — fp/p#  are  the  aqreces  and  oksc  phase  xaobsEiSes 
aad  p,  ,  p,  are  the  viscosities  of  water  (polymer-dependent) 
aad  osL  The  function  /(s,c)  is  S-siaped  (with  a  single  inflection 
past)  aad  has  properties:  /(0,c)  =  0  ,  /(S,c)  =  1  ,  /,(*,c)  >  0 
t  /*(*»<)<  0,  fi»0<s<  1 , 0  <  e  <  1.  Adsorption  of  polymer 
by  the  rode  satisfies  the  conditions:  e(0)  =  0  ,  at(c )  >  0  „ 
o^e)  <0,  for  0  <  c  <  1.  Tbe  velocity  v  can  be  obtained 
directly  from  tbe  total  Said  velocity  u.  The  rndtj-dinsenssonal 
problem  (4)  can  thus  be  reduced  to  cnc-dtmcnsumcl conservation 
laws 

+  »  (sc  -r  c(e)):  -f  (c/(s,c))i  =ajtf$  (6) 

along  streamlines.  These  streamlines  are  defined  as  the  inte¬ 
gral  curves  x(()  of  the  system  of  ordinary  differential  equations 
dxfd£  ~  v(x)/^(x)-  This  system  can  be  solved  very  efficiently 
by  exploiting  the  mixed  finite  dement  representation  of  v. 

If  s  and  c  are  represented  by  constants  on  each  grid  block, 
the  essential  task  is  now  to  solve  a  sequence  of  (1-D)  Riemaan 
problems  along  streamlines.  Writing  (6)  in  quasi -linear  form, 
each  Rieiaann  problem  consists  of  the  hyperbolic  system 

(:HS  //(.«)(:),-(*")  « 

with  two  constant  states  as  initial  data 


(:L  -  {! 


cocntsaesily  discretized  by  mixed  finite  dements  and  solved  by 
maltigrid.  The  coupling  between  s  and  c  and  the  aochaean- 
tscs  in  the  iScfieo  tensor* J>*(s#c),  2y(#,c)  are  handled  by 
applying  iteration. 


The  figure  illustrates  a  two-dimensional  simnlatsca  of  polymer 
flooding  by  the  numerical  method.  Water  (in  gray)  and  poly¬ 
mer  (in  blade)  are  injected  at  the  lower-left  corner  of  tbe  square 
domain  (a  symmetry  element  of  an  oil  reservoir).  Oil  (in  white) 
is  produced  at  the  upper-right  comer.  The  polymer  slug  n  pre¬ 
ceded  and  followed  by  a  water  flush.  The  reserves  is  assumed 
to  be  homogeneous  (Le.,  constant  permeability  and  porosity). 
Local  grid  refinement  is  used  to  increase  the  resolution  of  the 
oil /water  and  water/polymer  interfaces.  At  the  same  tune,  the 
total  number  of  grid  bio  "is  is  reduced.  Fine  grid  blocks  (ob¬ 
tained  by  repeatedly  subdividing  coarser  blocks)  move  along 
with  tbe  polymer  slug  and  the  water  front  as  the  displacement 
process  continues. 


The  Riemanu  problem  (7,8)  has  a  complex  structure,  which  be¬ 
comes  apparent  by  considering  the  eigenvalues  A*  =  /,  ,  A*  = 
Jl(s  -f  ce)  and  eigenvectors  e*  =  (1,0)T  ,  e*  =  (/« Ac  -  A*)T 
of  the  Jacobian  matrix  associated  with  (7).  The  problem  is 
not  strictly  hyperbolic  (i  c.,  A'(s,c)  and  Ac(s,e)  may  coincide 
for  certain  (s,c))  and  not  genuinely  nonlinear  (i  e.,  e’.VA'  and 
ec.VAc  may  change  sign).  It  is  therefore  not  possible  to  apply 
the  (standard)  theory  of  hyperbolic  systems  (Lax,  1972). 

However,.  Johansen  and  Winthcr  (1988)  have  described  the 
construction  of  a  unique  solution  of  the  Riemann  problem  (7,8). 
The  solution  consists  of  constant  states  connected  by  smooth 
segments  (rarefaction  traces)  and  by  discontinuities  (shock  traces). 
The  rarefaction  waves  arc  associated  with  the  integral  curves  of 
the  eigenvectors  e',  ec .  Shock  waves  arc  (physically  meaning¬ 
ful)  discontinuities  satisfying  both  the  Rankin  c- 1  lug  oniot  jump 
condition  and  an  entropy  condition.  The  precise  solution  of  the 
Riemann  problem  consists  of  a  combination  of  (i)  a  c-shock  and 
composite  s- waves  (if  ci  >  e^)  ;  or  (ii)  c-rarcfactlon  waves  and 
composite  s- waves  (if  cl  <  cr)  A  composite  3- wave  is  a  com¬ 
bination  of  an  s-rarcfaction  wave  and  an  s-shock  that  forms  the 
solution  of  the  (scalar)  Bueklcy-Lcvtrtti  problem  Sj+ /(s,c)<  =  0 
for  some  fixed  polymer  concentration  c. 

All  shocks  and. waves  arising  from  the  solutions  of  differ¬ 
ent  Riemann  problems  along  the  streamline  are  followed  during 
the  timestep  of  length  At.  Wave  interactions  are  fully  taken 
into  account  (Gmelig  Meyiing,  1990),  Artificial  smeanng  of  dis¬ 
continuities  is  virtually  eliminated  and  timestep  limitations  of 
CFL-typc  (i.e.,  At/A(  mai(A,,Ae)  <  1 )  are  avoided. 

The  implicit  time-discretization  » (s"+1-a")/At  ,  2^1  a> 
((sc)"'  —  (sc)*)/At  turns  the.paraio/ic  equations  (5)  into  a 
coupled  system  of  elliptic  equations.  These  equations  are  also 


Fig.  Numerical  simulation  of  ’water  prefiush,  polymer  slug  and 
water  post  flush  for  a  homogeneous  reservoir. 
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Abstrcot.  2hc  'kuovledge  of  the  tine-distance 
ovolutioa  of  a  solar  energy,  thermal  storage 
spates  using  rock  beds  crossed' by  an  organic 
fluid  allows  the  acquisition  cf  important 
information  for  the',  opticus,  dimensioning  of 
theraal  solar  energy  tanks*  Because  of  the 
difficulties  arising  during  the  analytieal 
solution  of  the- storage  equations,  no  applied 
the  numerical  solutien  to -the  network  method. 


I.  IltTHODOCTICS’  - 

Tho  physical  system  considered  is  composed 
of  a  vertical  cylinder  containing  the .  stora¬ 
ge  medium  whioh  is  made  up  of  particles,  of  a 
▼cry  small  diaweter'cc  aenpared  to  the.diame- 
ter  of  the  cylinder.  She  flow  of  the''  organic, 
fluid  is  vertical  with  the  warm  part  of  the 
stock  at  the  upper  part  of  the  cylinder  and 
the  cool  part  at  the  base  in  order  to  avoid 
the  natural  convection  phenomena* which  lead 
tc  the -destruction  of  the  thermal  stratifi¬ 
cation.  The  thermal  transfer  equations  for 
physical  system  considered  (storage  equations) 


(1) 


are  reduoed 


will  bo  (Giooucl  and  others,  19.,, 

+  .„(t)  +b^(u-r)=0 

l€§^l+.b2(u-v);=0 

where  “  ®i  ®S  "  % 

u  =  v  =  C£ 

temperatures;  b.  and  b,  are.  parameters  ohars:- 
terlzlnG  the  thermal  properties  of  the 
fluid  and  the  solid;  ^(x.t)  la  temperature 

of  the  organic  fluid;  Ts(x,t)  la  tnnporatura 
of  tho  solid  medium;  Tj_  and  T2  ar9  temperatu- 

raa  at  tho  lower  and  upper  ends  of  the  stock, 
respectively;  w(t)  is  average  flow  velocity 
of  the  fluid. 

Tho  partial-derivative  equations  systea 
obtained  la  a  byperbolio  system.  To  solve  it 
no  use'an  iaprovod  version  of  the  network 
method. 


II.  METHOD  FOR  TH3  NUMERICAL  SOLUTION 
OP  THR  STORAGE  EQUATIONS  SYSTHi. 
COMHJTER.  PROGRAM. 


Goouuso  of  the  difficulties  arising  during 
tho  analytloal.  solution  of  aystea  (5),  no 
applied  tho  nuaorleal  solution  with  tho  net¬ 
work  aothod  (Ixaru,  1979;  Absl  and  others, 
198o). 

The  prlnolple  of  tho  method  oonsists  in  fin¬ 
ding  solutions  along  two  families  of  chareo- 
tarlatioa 

dx  =  ,w  dt  for  the  fluid 
dx  =  0  for  the  aolid 
Syatoa  (1)  turn*  into  sjstoaa  (2)  and  (J) 


(?) 


(dx  =  w  dt  /,•,  (dx  =  0 
[du  w  b,(v-u)dt  {dv  =  b-(u-v)dt 
Uo  suppose  the  portioular  solutions  already 

to™  («(*„.*),  v(t„  - 


v(t,x0)j 


In  order  to  got  easily  proeessable  results, 
the  sets  of  data  have  been  written  in  matrix 
form*  This  procedure  was _ applied  im  order  to 
specify  the  tine-distance  *  distribution  of 
temperature  in  a  condensed  form.  Thus,  the 
variation  of  the  temperature  of  the  liquid 
is  given  in  matrix  fora  by  T,  (I,J)#  where 
the  rows  I  indioate'  the .  behaviour  at  oonsfent 
t=const.  as  a  function  of  distance  x  and 
coluzns  J  the  behaviour  at*  constant  distan¬ 
ces  x=const*  as  a  function  of  time*  This 
approach  was  extended  for  the  functions 
u(x,t)  and  v(x,t),  which  give  matrices  D(I,J) 
and  7(1,  J).  "  . 

The  reduced  temperatures  a  «ftd  v  cam  thus 
be  computed  at  the  points  where  Tc-and  T. 
are  known:  " 


0VCi,d)-®i  TR(i,;0-T1 

u(l,3)=  -TW3J - — ;  v(i,j)=  -S.  _■ - i  (4) 

~2  1  12il 

A  numerical  solution  by  Beans  ,  of  the  net-' 
work  method  requires  sampling  of  input  data* 
In  order  to  get  am  acceptable  accuracy  for 
tho  solution,  this  sampling  should  be  fine 
enough  (16  x.16  points)*  The  coefficients 
A±  (1=  176)  have  also  been  written  in  matrix 


form -A (I  * J)  and  have  been  used  to  compute  the 
lnorenents  du  and  required  by  the  network 
method* 

The  procedure* is  iterative*  Starting  from 
the  given  initial  solution  (u  ,v  ),an  attempt 
is  made  to  verify  the  sya tern ;°if, this  doss 
not  fall  withia  the  required  ll'iits  of  accu¬ 
racy,,  then  du  =  duCAj)  and  dr  =  dv(A/)  are 
oomputed  and  hence:  A 

C“liVx)  =  (n0,vo)  +  (dm,  dv) 
is  the  now  solutiom  with  which  amether 
ottawpt  is  made  to  varlfy  tha  system. 

Tha  quastlties  da  amd  dv  are  alsa  written 
in  matrix  form;  DU(I,J),  D7(I,J). 

The  program  oompatea  the  toeperature  aistri- 
butions  with  double  precisiom. 

Wo  note  tho  following  stages: 

-ooaputotlon  of  the  fluid  velooity  n,  of 
paraaotera  b1(b2,of  tha  initial  U  and  T 

natrloes  UBing  the  initial  Ts  and  Tt  aa tricar 

-  coapujatlo*  of  the  set  of  parametmms  A. 

(1  =  iT6)i  1 

-  computation  of  tho  infinitesimal  quantities 
dU(l,J)  and  37(1, J); 

-  computation  of  aatrioos  U(I,J)  and  V(I,J); 

-  computation  of  tho-final  temperature  dis¬ 
tribution  Ts;(I,J)  and  Tgj(I, J). 


Tho  xo3ulta  of  the  oomputationa  have  been 
diapleyed  in  a  table  of  values  aa  two.aatri- 
00a  U(X,J)  and  7(1 ,J)  whose  columns  represent 
the  time  variation  of  the  reduoed  temperatures 
at  different  dlstonoes  from  the  origin  (x  = 
=oonst)  and  rows  represent  spaoe  variations 
of  the  reduoed  teaperatures  at  different  time 
moments  (t  =  const). 

To  get  a  oonvenieut  solution  of  the  system 
of  partial  derivative  equations,  we  have 
passed  from  parameters  TL(x,t),  Ts(x,t)  to 
tho  funotions  u(x,t),  v(x,t);  these  functions 


Fig. 1. Dependence  of  temperatures  Tj^T^ -on 
tine 


FigTS.  Depandenoe  of  toaporatures  I, ,  T= 
on  distanoo  L  b 


deaeilbe  a  ratio  of  the  temperatures  of  the 
liquid  cad  of  the  solid,  respeotively.to  the 
difference'lji  the  teaperatores  of  the  upper 
and  lower  ends  of 'the  storage  tank  (I,  and- 
2^,  respectively).  d- 

5a- notice  that  IV(x,t)  and  ?c.(x,t)  are 
warped  surfaces  is  the  plane  a(x,t)e[(x0,xf), 
(t0,tf)J,  oorreaponding  to  the  different 
values  of  T^,  Tg  at  those  point  (Figs.l  and 


•W.C0SCLPSIO2S 

The  rain  new  points  of  this  paper  are: 

—  the  use  of  a  simple  iteration  nethod 

-  the  use  of  matrix  fora. 

The  nethod  used  by  the  authors  to  obtain 
the  teaperatnre 'distribution  has  sene  impor¬ 
tant  advantages.  He  notice  the  preoision  of 
the  computations  and  the  relatively  high 
degree' of  generality, which  allows' the  appli¬ 
cability  of  the  nethod  to  a  variety.of  physi¬ 
cal  systems  of  Interest  for  the  thermal  sto-* 
rage  installations  of  the  solar  power  plants. 
The  comparison  of  these  data  with  the  expe¬ 
rimental  results  will  make  possible  the  opti¬ 
mum  dimensioning  of  thermal  solar  energy 
tanks'. 
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ONE-DIMENSIONAL  ICE-FORMATION  PROBLEM. 
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Abstract  -A  simple  numerical  scheme  is 
described  which  deals  with  one-dimensional 
ice  formation  problea.  The  method  is  used 
here  to~ numerically  investigate  how  the  rate 
of  ice  foroation  is -affected  by  (a) convective 
heat  transfer  coefficient  at  the  icc-water 
interface,  (o)  the  effect  of  initial  water 
temperature  and  (c)  the  effect  of  the  coolant 
temperature .  It  is  shown  that  the  heat 
transfer  coefficient  and  the  initial  water 
temperature  have  a  limiting  role  to  play  in 
the  ice  foroation  process  but,  as  expected, 
the  effect  of  coolant  temperature  is  very 
significant. 

I.  Nomenclature 

t  Temperature 
t^  Water  temperature 
tf  Freezing  point  of  water 
tc  Coolant  temperature 

x  Distance  of.  ice-water  interface  from 
the  plate 

r  Radial  distance  of  ice-water  interface 
from  pipe 

Thermal  conductivity  of  ice 
k^  Thermal  conductivity  of  water 
f  Heat  transfer  coefficient  between 

o 

coolant  t  plate 

fw  Heat  transfer  coeff  at  the  ice  water 

interface 
P4  Density  of  ice 

p  Density  of  water 
c4  Specific  heat  capacity  of  ice 
Specific  heat  capacity  of  water 
ht  Specific  enthalpy  of  ice  formation 
^  Thermal  diffusivity  of  ice 
e  Time 

D  Distance  between  the  cooling  surface  and 
boundary  of  the  water  domain. 

Subscripts 
i  Ice 
w  Water 

Vll.  INTRODUCTION 

Ice-formation  ,an  every-day  phenomenon, 
is  easily  achieved  both  naturally  and  under 
experimental  conditions.  However,  the  large 
amount  of  literature  on  the  mathematical 
description  of  the  problem  , referred  to  as 
the  Stefan  Problem,  is  considerable  and 
testifies'to  the  fact  that  the  proper 
prediction  of  the  process  is  ,  on  the  other 
hand  ,  not  so  easy.  This  literature  is  well 
reviewed  by  Crank  (1),  Salcudean  and  Abdullah 
(2)  and  Lunardini  (3).  Because  of  its 
non-dependence  on  the  phase  change  front  the 
enthalpy  method  [41  is  by  far  the  most 


widely  used  method  to  deal  with  phase  change 
problem. There  are, though, two  apparent 
difficulties  in  employing  the  enthalpy  method 
for  the  ice  formation  problem.  The  first  is 
the  specific  heat  capacity  discontinuity  at 
the  freezing  temperature^and  the  second  is  in 
accounting  for  the  convective  heat  transfer 
at  the  ice-water  inter face. Both  these 
problems  have  been  successfully  cackled 
[4,5 )  but  notj  before  indulging  in  tedious 
mathematical  formulations. 

In  another  method  ,the  front  tracking 
method,  the  position  of  the. ice  front  is 
obtained  at  the  end  of  each  iteration  .  The 
difficulty  with  front  tracking  method  , apart 
from  its  inherent,  weakness  of  being  dependant 
on^  the  position  of  the  interface  ,  is  its 
extreme  complexity  in  solving  higher 
dimensional  problems. Fortunately,  though,  for 
most  engineering  ice-formation  applications, 
the  one-dimensional  result  is  sufficiently 
accurate  for  all  practical  purposes  [51 -The 
advantage  of  this  method  is  that  the  position 
of  the  ice  front  ,  which  is  the  most 
important  quantity  ,  is  obtained  directly 
from  the  analysis  and  it  is  not  necessary  to 
infer  it  from  an  interpretation  of  the 
temperature  profiles  as  is  necessary  with  the 
enthalpy  method. 

The  purpose  of  the  present  paper  is  to 
present  a  simple  alternative  front  tracking 
method  and  to  demonstrate  its  use  by 
evaluating  the  role  of  convective  heat 
transfer  coefficient  at  the  ice-water 
interface  and  the  effect  of  initial  water 
temperature  and  the  coolant  temperature  on 
the  rate  of  ice-formation. This  method  has  the 
advantage  of  computing  the  position  of  the 
ice  front  directly  but  is  much  easier  to 
apply  than  the  front  tracking  method. 

Ill  FORMULATION 

Consider  a  simple  model  of  ice 
forming  on  a  semi-infinite  plate  which  is 
adjacent  to  coolant  held  at  a  constant 
temperature  below  the  freezing  temperature  of 
water  (Figure  1) . 
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the  water  temperature  is  assumed  to  be 
uniform  at  t  .  The  enthalpy,  balance  equation 

at  the  ice-water  interface  yields  the 
velocity  of'the  ice  front  as: 


(V  t<)]Ahi 


u> 


ttbere  the  local  temperature 

gradient  within  the  ice  mass  at  the 
interface.  .  _  _  - 

Now  consider  an  element  at  the  interface  on 
the  water  side  ,  in  time  <?a  an  enthalpy 
balance  yeilds  the  rate  of  change  of  water 
temperature  with  time  as  : 


at. 

aa 


-  v  (c.  -  c») 

(D-X)  •  Pw  -  cw 


(2) 


Equations  (1)  and  t2)  are  a  pair  of 
first  order  simultaneous  differential 
equations  which  can  be  solved  to  give  the 
position  of  ice  front  and  the  water 
temperature  during  tha  freezing  or  thawing 
process. 

For  freezing  around  a  pipe  the 
corresponding  equations  are  : 


(3) 


and 


36  (D  w  r)  •  pw  •  c. 


IV  SOLUTION 

Equations  (1)  and  (2)  can  be  solved 
numerically  using  any  of  the  standard 
procedures  for  initial  value  problem.  The 
only  problem  is -the  term  k^dt/dx  in 
equation (1).  At  the  start  ,  when  there 
is  no  ice  then  : 


-SH-vfvO 


Thereafter  the  tempertcure  gradient  can 
be  obtained  from  the  solution  of  the 
conduction  equation  in  the  ice  .  ie. 


(5) 


This  can  be  solved  ,  most  conveniently 
using  a  finite  difference  equivalent  of  (5) 
with  the  boundary  conditions  : 
x  «  0  ;  t  -  t 


x  -  X  ;  t  -  tf 

Thus* the  solution  is  a  combination  of, for 
example, the  Runge  Kutta  Method  and  the 
finite  difference  method. 

A  typical  result  is  shown  in  figure  2. 


Tm»  (Mnuies) 

figure  2:  VtfoOon  Of  Ice-Froftl  And  Woter  Tempefotwe  Wiln  Tm 
V  TEST  PROBLEM 

The  stated  formulation  was  tested  by 
simulating  the  following  conditions:  Ice  is 
formed  on  a  flat  vertical  plate  adjacent  to 
coolant  which  is  maintained  at  a  constant 
temperature.  The  plate  is  in  an  insulated 
water  tank  and  the  water  is  initially  at  a 
uniform  temperature. The  dimensions  of  the 
plate  and  the  tank  are  so  chosen  and  the 
conditions  are  such  that  the  assumption  of 
one-dimensional  freezing  is  justified.The 
various  parameters  and  conditions  used  for 
computations  of  the  results  are  include* 
in  table  1. 


TABLE  I 

PARAMETERS  FOR  THE  TEST  PROBLEM 

Heat  transfer  coefficient  at  the  . 

coolant/ice  boundary (fQ)  2.0  JcW/m  /K 

Thermal  Conductivity  of  Ice(k£)  2.2-10’skW/mK 
Density  of  ice  (px)  517  kg/m1 

Specific  heat  capacity  of  ice(c£)  2.0  kJ/kgK 
Sp.  enthalpy  of  ice  formation (h£)  -333.5kJ/kg 
Thermal  conductivity  of  Water  (k.)  .6/1 0'JkW/oK 
Density  of  Water  (pj  1000  kg/  m1 

Sp.  heat  capacity  of  water (c.)  4.185cJ/kgK 

Freezing  point  of  water  (t£)  0  c 

Distance  between  plate  and  boundary  (D)  1.0a. 


To  assess  the  role  of  convective  heat 
transfer  coefficient  at  the  ice  water 
interface  (f.) ,  the  water  temperature  It.) 

was  initially  106C  and  the  coolant 
temperature  (tj  kept  at  -40  C.  The  various 

values  of  f/used  le  0.01  kW/oJ/K  to  4.0 

kW/m*/K  were  chosen  to  cover  the  entire 
Spectrum  of  likely  practical  values  of  fw- 
The  ace  front  position  at  different  times  for 
the  various  values  of  f^  is  shown  in  table  2. 

Because  the  water  temperature  was  also 
changing  there  was  no  maximum  ice  thickness 
and  ultimately,  all  the  water  will  become 
ice. 
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TABLE  II 

POSITION  OF  ICE  FRONT  AT  DIFFERENT  INTERVALS 
WITH'  VARYING  F 


TIME 

POSITION 

OF 

ICE 

FRONT 

(KM) 

(MIN.] 

1 

iq\ 

r 

00 

17, 

.08 

16. 

.33 

15. 

.42 

15 

.25 

.15 

.53' 

16. 

.02 

19. 

99 

24, 

.73 

23. 

,86 

23. 

.70 

23 

.63 

23; 

.82 

24. 

.15 

29. 

95 

30, 

.56 

29. 

.’72 

29. 

.88 

29 

.82 

29, 

.9? 

30. 

.24 

40. 

00 

35. 

.47 

34. 

,69 

35, 

,02 

34 

.97 

35. 

.10 

35, 

,33 

50. 

07 

39. 

.76 

39. 

.10 

39. 

.51 

39 

.47 

39. 

.58 

39. 

.78 

60. 

,14 

43. 

.65 

43. 

.09 

43. 

.56 

43 

.52 

43, 

.62 

43. 

,80 

70. 

21 

47. 

.21 

46. 

.77 

47. 

,26 

47. 

.22 

47. 

.31 

47. 

,49 

f 

> 

0. 

.01 

0. 

.1 

0. 

,5 

1, 

.0 

2. 

.0 

4. 

,0 

To  evaluate  the  effect  of  initial 


water  temperature  /the  coolant  temperature 
was  put  at  -40°C,  f  at  0.1  kw/mv  K  and  the 

water  temperature  varied  from  5.0°C  to  40°C. 
The  result  is  shown  in  table  3. 

TABLE  III 

POSITION  OF  ICE. FRONT  AT  DIFFERENT 
INTERVALS  AND  VAILING  WATER  TEMPERATURE 


TIME  POSITION  OF  ICE  FRONT  (MM.) 
(MIN.) 


10, 

.00 

16 

.77 

16 

.33 

IS 

.90 

15.47 

14.59 

13. 

.70 

19, 

.99 

24 

.41 

23 

.86 

23 

.32 

22.80 

21.77 

20. 

.75 

29. 

.95 

30 

.30 

29 

.72 

29, 

.15 

28.61 

27.56 

26. 

,56 

40. 

.00 

35 

.28 

34 

.69 

34. 

.12 

33.58 

32.56 

31. 

,60 

50. 

.07 

39 

.69 

39 

.10 

38. 

.53 

38.01 

37.02 

36. 

,10 

60. 

.14 

43 

.66 

43 

.09 

42, 

.54 

42.04 

41.10 

40. 

,22 

70. 

.21 

47 

.33 

46 

.77 

46. 

.23 

45.75 

44.83 

44. 

01 

t  ( 

C)>5. 

0 

10. 

0 

I5.i 

0 

20.0 

30.0 

40.1 

D 

In  a  similar  way  the  effect  of  coolant 
temperature  was  evaluated  by  putting  fw*»  0.1 
kW/mV  K  »,  t  ■  10°  C  and  varying  the  coolant 

temperature  from  -10°C  to  -40°C  .The  result 
is  given  in  table  4. 

TABLE  IV 

POSITION  OF  ICE  FRONT  AT  DIFFERENT 
INTERVALS  AND  VARYING  COOLANT  TEMPERATURE 


TIME (MIN.)  POSITION  OF  ICE  FRONT  (MM.) 


V 

10-00 

6.99 

9.06 

10.63 

13.70 

16.33 

19.99 

9.63 

12.93 

15,76 

20.16 

23.86 

29.95 

11.-91 

16.56 

19,95 

25.25 

29.72 

40.00 

14.55 

19.74 

23,55 

29.59 

34.69 

50.07 

16.97 

22.56 

26.75 

33.44 

39.10 

60.14 

19.16 

25.11 

29.64 

36.92 

43.09 

70.21 

21.16 

27.44 

32.32 

40.13 

46.77 

t  Co 

-10. CO  - 

15.00 

-20.00 

-30.00 

-40.00 

VI  RESULTS  AND  DISCUSSION 

A  study  of  the  results  in  table  2  shows  that 
for  all  practical  purposes  the  value  of  f 

has  no  effect  on  the  rate  of  ice  formation.  A 
lower  value  of  f^  results  in  initial  higher 


rate  of  ice  formation  but  as  the  ice  builds 
up  /the  rate  decreases  compared  to  higher 
values  of  f  > .  The  overall  rate  of  ice 
formation  is  marginally  more  in  the  case  of 
higher  f  values  but  for  all  practical 

purposes  the<difference  is  insignificant. 
Results  from  table  3  show, as  would  be 
expected#  that  with  increasing  water 
temperature  the  rate  of  ice  formation 
decreases.  However  it  is  noted  that  the 
effect- is  quite  small. 

Looking  at  these  results  together,  it  can  be 
concluded  that  heat  transfer  coefficient  at 
the  ice-water  interface  and  the  initial 
temperature  of  water  has  little  effect  on  the 
overall  rate  of  ice  formation.  It  can  be 
concluded  that  ,  for  all"  practical  conditions, 
of  the  heat  extracted  via  the  ice,  the 
portion  required  to  cool  the  water  is  small 
and  can  be  neglected. 

Results  from  table  4  shows  that  the  coolant 
temperature  has  a  significant  effect  on  the 
rate  of  ice -formation  . 

VII  CONCLUSION 

The  method  reported  in  the  paper  is  quite 
simple  and  easy  to  use  yielding  a 
mathematical  insight  into  the  ice  formation 
problem.  Data .runs  done- with  the  method  gave 
useful  information  on  the  role  of  the 
different  parameters  and  input  conditions  on 
the  rate  of  ice  formation. The  value  of 
convective -heat  transfer  coefficient  at  the 
ice-water  interface  has  a  very  limiting  role 
to  play  in  the  process  and  so  too  the 
initial  water- temperature, The  coolant 
temperature  is  the  most  important  parameter 
in,  the  ice  formation  problem. 
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Abstract:  The  regularization  formulation  of  visual 
reconstruction  problems  is  approximated  using  a  mixed 
finite  element  approach.  This  lends  itself  to  interest¬ 
ing  solution  methods:  digital  and  analog.  In  particular, 
the  analog  networks  are  similar  to  but  more  simple  than 
those  proposed  previously. 

I.  INTRODUCTION 

Visual  Reconstruction  is  concerned  with  the  recovery  of 
information  from  images  and  includes  approaches  such 
as  the  recovery  of  depth  from  stereopsis  or  the  recov¬ 
ery  of  shape  from  shading.  A  common  paradigm. is  to 
formulate  such  problems,  within  the  framework  of  reg¬ 
ularization  theory,  as  inverse  problems.  Furthermore, 
using  an  analogy  between  energy  in  an  electrical  circuit 
and  the  objective  functional  of  the  regularization  for¬ 
mulation,  many  analog  networks  have  been  derived  and 
promoted  as  neural  networks  for  machine  vision. 

This  paper  uses  a  mixed  finite  element  approximation 
to  the  regularized  problem,  From  a  purely  computa¬ 
tional  point  of  view,  this  has  several  advantages.  Firstly, 
lower  order  finite  element  basis  functions  can  be  used. 
Secondly,  such  an  approach  allows  various  derivatives  to 
be  explicitly  reconstructed  at  the  same  time  as  the  func¬ 
tion  of  interest.  These  estimates  of  the  derivatives  of  the 
function  can  be  useful  for  segmentation  or  in  the  search 
for  features.  The  approach  can  also  be  shown  to  natu¬ 
rally  incorporate  aspects  found  to  be  useful  from  purely 
experimental  studies  |l|. 

Another  major  contribution  of  this  work  is  to  investi¬ 
gate  analog  network  schemes  for  solving  the  the  mixed 
finite  element  formulation.  The  setting  we  propose  al¬ 
ready  generalizes  that  of  Harris  |2).  We  have  used  vari¬ 
ants  of  the  Arrow/Hurwicz  approach  to  solving  saddle 
point  problems  (first  proposed  as  a  neural  network  model 
by  Platt  (3))  to  derive  a  whole  variety  of  analog  network 
structures. 


We  conclude  by  discussing  work  in  progress,  to  find 
more  efficient  digital  .  approaches  to  solving  our  mixed 
finite  element  formulations. 

II.  VISUAL  RECONSTRUC¬ 
TION  AND  FINITE  ELE¬ 
MENTS 

Suppose  we  are  interested  in  recovering  a  quantity  rep¬ 
resented  by  the  function  ^  and,  from  our  model  of  the 
image  formation  process,  we  can  write  down  the  relation¬ 
ship  between  this  quantity  and  the  observed  data  d  as 
A(ip,  d)  -  0.  Typically,  even  if  one  can  solve  this  system, 
the  inverse  nature  of  the  problem  makes  the  solution  un¬ 
stable  and  we  generally  seek  a  regularized  solution  that, 
minimizes: 

J=  /  (l) 

J  n 

where  the  last  term  is  a  penalty  term  that  encourages 
smoothness  (and  D  is  an  operator  that  usually. is  com¬ 
posed  of  various  derivative  operators).  Much  of  the 
study  of  these  approaches  in  vision  has  concentrated 
upon  the  recovery  of  surfaces  from  sparse  depth  esti¬ 
mates.  This  model  problem  is  the  most  simple  of  the 
visual  reconstruction  problems  and  reduces  to  a  gener¬ 
alized  spline  fitting  problem.  For  example,  a  commonly 
studied  instance  is  the  problem  of  minimizing: 

J=  $£»(*-*)*+  (2) 

a  In  §3$  +  dxdy 

where  is  a  constant  related  to  the  reliability  of  the  data, 
C  is  a  set  of  discrete  points  at  which  we  have  constraints 
upon  the  solution  (e.g.  depth  values  in  stereo)  and  the 
integral  is  a  second  order  smoothness  (Sobolev)  semi¬ 
norm. 
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Even- though  fast  algorithms  exist  for  spline  fitting 
(particularly  in  one  dimension),  in, machine  vision  the 
research  has  concentrated  on  solving- discrete  approxi¬ 
mations  through  local  iterative  approaches.  A  typical 
approach  is  to  discretize- using  finite  elements  and  then 
solve  using  multigrid  iterative  approaches  |4j.  Such  an 
emphasis  reflects  the  fact  that  we  are  interested  in  this 
problem  as  a  model  problem  only  (the  particular  prob¬ 
lems  of  direct  interest  usually  involve  more  complex  and: 
possibly  nonlinear  operators  A).  Furthermore,  local  iter¬ 
ative  methods  are  weU  suited  to  parallel  implementation 
and  are  neural  network-like.  Indeed,  even  more  exotic 
analog  network  methods  have  been  devised  to  even  more 
closely  resemble  biological  approaches  to  vision  [5]  and, 
at  the  same  time,  improve  speed  and  robustness. 

III.  MIXED  FINITE  ELE¬ 
MENTS 

It  is  easy  to  see  that  the  Euler-Lagrange  equation  for  2  is 
closely  related  to  the  biharmonic  operator  (fourth  order), 
and  hence  quite  complex  finite  elements  are  required  |4j. 
The  author  has  used  the  mixed  finite  element  formalism 
to  reduce  the  order  of  the  differential  operators  appear¬ 
ing  in  the  smoothness  functional  (6|.  Consider  the  one 
dimensional  example1  of  minimizing: 

J=  IE, ■«=<>(*< -*)s+  (3) 

§/n(§)s +(&)’<** 

The  start  of  our  approach  is  to  consider  the  derivatives 
as  themselves  being  unknown  functions  that  we  seek  to 
reconstruct.  Thus,  we  seek  to  minimize: 

j=  4e,£cW-*)s+  m 

i/nu5  +  p5<te 

subject  to  u  =  ^,  and  p  =  ■§.  There  are  two  main 
ways  in  which  to  convert  the  above  constrained  opti¬ 
mization  problem  into  an  unconstrained  problem  One 
method  is  to  use  penalty  terms  (and  this  leads  directly 
to  the  formulation  of  Harris  [2),  for  which  1 »  has  derived 
an  analog  network  structure),  and  the  other  is  to  use 
Lagrange  Multiplier  approaches.  However,  it  is  also  pos¬ 
sible  to  combine  the  two  approaches  in  an  Augmented 
Lagrangian  formulation  [7].  For  our  problem,  this  would 
lead  to  the  problem  of  seeking  extrema  of  the  functional: 

J"  fE,ec(fr-*)2+U(»2+p2)  (5) 

1  Although,  for  michin*  vision,  we  ere  clearly  interested  in  for¬ 
mulations  over  a  two  dimensional  domain  0,  we  will  restrict  our¬ 
selves  to  one  dimensional  examples  in  much  of  the  sequel  The 
generalisation  to  higher  dimensions  is  straightforward  but  more 
cumbersome. 


+Au(j§  -  «•)  +  \p(4g  -  p)  + 

§((§-«)2+(H-p)2)<fc 

where  p  is  a  positive  constant,  and  Au,  \p  are  Lagrange 
Multipliers.  It  would  appear  that  we  pay  the  price  for 
low  order  elements  in  terms  of  an  increase  in  the  number 
of  nodal  points  and  in  terms  of  lower  order  accuracy 
in  the  reconstruction.  In  addition,  it  seems  necessary 
to  calculate  the  Lagrange  Multipliers.  However,  it  is 
easy  to  show  that  Au  =  u  and  Ap  =  p.  Furthermore, 
it  may  be  possible  to  use  post-processing  (8)  to  improve 
the  accuracy  of  approximation. 

If  p  =  0,  the  above  problem  reduces  to  a  Lagrangian 
formulation  (and  the  solution  is  generally  a  saddle  point 
rather  than  a  minima).  It  is  possible  to  show  that  if  p  is 
large  enough,  the  problem  essentially  becomes  a  penalty 
based  formulation  (and  the  solution  is  a  minima).  For 
simplicity  we  will  'set  p  =  0  in  the  sequel. 

By  eliminating  the  multipliers,  and  reformulating  the 
constraints  into  weak  constraints,  it  is  possible  to  refor¬ 
mulate  as  the  extremization  of: 

£=  §E«=cW-*)s  +  (6) 

•/n^  +  P5) +  «$  +  &£  A* 

We  generally  discretize  such  formulations  using  linear  el¬ 
ements  to  arrive  at  the  mixed  finite  element  formulation 
in  terms  of  the  nodal  variables  fa,  u,-  and  p,\. 

IV.  ANALOG  NETWORKS 

Platt  [3]  and  Scyman  [9]  independently  proposed  a  vari¬ 
ant  of  the  Arrbw/Hurwicz  approach  [10]  for  finding  sad¬ 
dle  points  of  a  functional.  In  essence,  this  is  a  form  of 
gradient  descent  on  the  primary  variables  and  ascent  on 
the  secondary  variables.  In  our  case,  this  corresponds  to 
the  scheme: 

dfc  _  dL  duj  _  £L  dp;  .dL  .  . 

dt  dipt*  dt  du{*  dt  dp; 

We  have  integrated  such  equations  for  the  problem  6  and 
the  results  are  shown  in  figure  1. 

It  is  relatively  straightforward  to  map  this  analog  ap¬ 
proach  onto  hardware  using  transconductance  amplifiers 
ini-  Simulations  of  these  circuits  using  SPICE  show  that 
the  circuits  correctly  operate.  These  circuits  are  consid¬ 
erably  simpler  than  those  previously  suggested  [2]. 

V.  DIGITAL  IMPLEMENTA¬ 
TION 

The  analog  methods  in  the  previous  section,  when  simu¬ 
lated  on  a  digital  machine,  require  many  iterations  and 
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(a)  Original  Data 


(b)  Reconitructed  Step  Edge 


Figure  1:  Reconstructed  Step  Edge 

The  original  step  edge  (a)  is  reconstructed  showing,  (b)  the  function,  (c)  first  derivative,  after  1000  iterations,  and 
(d)  the  function,  (e)  the  derivative,  and  (f)  the  second  derivative  after  10000  iterations.  Note  the  clear  peak  in  the 
first  derivative  and  the  zero  crossing  in  the  second. 
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are  thus  relatively  slow.  la  tins  section  we  consider  dig¬ 
ital  method?. 

We  consider  now,  for  simplicity,  the  discrete  form  of 
one  dimensional  regularization  using  only  the  first  order 
smoothness  term  (discarding  the  second  order  smooth¬ 
ness  term  from  6):  one  obtains  a  set  of  Ecler-Lagranje 
equations  that  can  be  written  in  matrix  form: 

(?  3)(!)-(?)  •• 

where  D  is  a  discrete  first  derivative  operator,  aou'/* 
and  7-2  are  similar  to  the  identity  matrices.  For  the  case 
p  0  this  becomes 

(  Ph+pDTD  (l-p)DT  \(4\_  ,g) 

{  (1  -P)D  -ll-p)I2)\uJ-  PJ 

(?) 

We  are  currently  studying  methods  of  solving  these  by 
using  conjugate  direction  (including  hierarchical  basis 
functions  (12]  or  other  methods  for  pre-conditioning  [13]) 
and  evaluating  with  alternative  ways  of  dealing  with  the 
indefinite  nature  of  the  system  [14]. 

VI.  CONCLUSION 

There  are  two  major  contributions  c.  this  work  so  far. 
One  is  the  introduction  of  mixed  finite  element  methods 
into  machine  vision.  The  other  is  in  the  investigation 
of  novel  analog  solution  methods  for  the  mixed  formula¬ 
tions. 

These  developments  have  inspired  current  work  on  effi¬ 
cient  (parallel)  solution  methods  of  the  related  indefinite 
systems  using  digital  hardware. 
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ABSTRACT 

In  this  paper,  application  cf  Fewer  Systea  Analysis 
Package  (FA?)  to  the  dlstributico  system  cf  credit  Iron 
and  Steel  ifcvfcs,  Inc.  Is  described. 

7**:xaJor  ala  of  the  paper  Is  to  discuss  the 
modifications  cade  cn  the  solution  algorithos  In  the 
Pc«er  Systems  Analysis  Package,  PAP,  In  order  to  adapt 
then  to  an  Industrial  distribution  systea. 

The  paper  also  includes  a  short  description  of  the 
organization  cf  new  cable  data  base,  known  as  the  Icw- 
voltage  cable  library.  In  the  last  part  cf  the  paper,  a 
systea  plot  routine  developed  during  the  research  is 
described. 


1.  POO  SYSTEMS  ANALYSIS  PACKAGE:  PA F 
1.1.  Development 

Initial  research  on  the  Power  Systecs  Analysis  Package 
(PA?)  was  started  at  the  University  of  Manchester, 
Institute  cf  Science  and  Technology  (LXIST)  in  1913. 
During  this  research,  two  algorithms  were  developed  for 
the  steady-state  solution  of  electric  power  systems. 

The  first  of  these  algorithms  Is  for  the  optima  power 
flow  problem,  which  deals  mainly  with  the  constrained 
ainiaization  of  a  the  systea  operating  cost  /7/.  The 
algoritha  Is  based  the  well-known  method  of  Dcccel- 
Tlnney  /$/  and  has  proved  quite  successful  for  the 
solution  of  systems  operating  under  stress  as  well  as 
under  normal  steady-state  conditions. 

The  other  algoritha  is  the  Past-Decoupled  Load-Flow, 
the  highly  efficient,  fast,  and  reliable  solution 
algorithm  for  the  solution  of  the  power  system  /4/. 

After  1972,  a  parallel  research  was  started  In  the 
Hfddle  East  Technical  University.  The  main  outcome  of 
this  research  was  the  Balanced/Unbalanced  Short-Circuit 
Analysis  and  the  power  systea  data  base  IDSfS  /II/. 

Another  Integral  part  of  PAP  Is  the  transient  and 
dynaafe  stability  analysis  routine  developed  in  1973 
/12/. 

The  first  industrial  application  of  PAP  was  realized 
In  the  Planning  and  Coordination  Division  (PKD)  of  the 
Turkish  Electricity  Authority  (TEK)  in  1979.  The 
package  is  still  In  use  for  the  routine  systea  analysis 
tasks  in  PKD. 

The  package  was  then  adapted  to  Cukurova  Electricity 
Utility  for  system  analysis  and  planning  studies  on  the 
southern  regions  of  Turkey,  and  is  being  used 
successfully. 


1.2.  Fast-Decccpied  Load-Flow  Algoritha 

The  Fast-Decoupled  Lcad-Flcw  equations  are  obtained'  by 
solving  the  well-known  polar  power  mismatch  equations 
using  Kewtcn*s  Method  /4/.  The  oethod  Is  a 
straightforward  application  of  the  general  Newton 
solution  algoritha  for  solving  the  sparse  real  equation 


where,  the  subcat  rices  in  the  coefficient  catrfx 
consist  of  the  Jacobians  of  active  and  reactive  power 
mismatches  with  respect  to  angles  and  scaled  voltage 
magnitudes,  respectively. 

Several  approximations  can  be  cade  in  the  above 
solution  algoritha  /4/_  These  approximations  cay  be 
briefly  listed  as  follows: 


(i)  neglect  the  off-diagonal  subcat rices  N  and  J  by 
applying  the  decoupling  principle,  hence  leaving 
two  separated  equations. 

(It)  keep  the  Jacobian  subaat rices  H  and  L  constant  at 
the  Initial  solution  point,  where  they  ere 
-  evaluated  and  factorized  only  once, 

(ill)  approximate  the  cosine  and  sine  functions  to  be 
evaluated  during  the  calculation  of  the  Jacobian 
suboatrices  H  and  l  by  unity  and  radians  of  the 
angles,  respectively. 

With  these  approximations  the  matrices  H  and  L 
beccee  strictly  the  elements  of  the  nodal 
admittance  matrix  B.  These  matrices  are  then 
called  B*  and  B”,  respectively.  In  teres  of 
these  matrices  further  approximations  are  made  as 
follows: 

(iv)  omit  from  B*  the  shunt  reactances,  and  off- 
nominal  transformer  tap  adjustments, 

(v)  transfer  the  diagonal  left-hand  side  voltage 
matrices  which  arise  from  the  division  of 
measurements  /4/,  to  the  right-hand  sides  as 
Inverses, 

(vl)  neglect  the  remaining  diagonal  voltage  matrices 
on  the  left-hand  sides  of  the  equations  by 
asswfng  that  the  voltages  In  a  systea  operating 
under  normal  operating  conditions  are  near  unity, 

(vil)  neglect  the  series  branch  resistances  while 
calculating  the  elements  of  B*. 
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Tr«-  resalting  Fist-Decoupled  Lead-Flow  equations  then 
■Secoee 

B’  £6  -  4>W  (O 

3”  £V/V  =  4Q/V  (3) 


1.3.  Heltatiocs  of  the  Fast-Decoupled  Load-Flow 
Algorlttii 

Success  of  the  fast-decoupled  Load-Flow  algorltt*,  is 
strongly  based  on  two  features  of  the  systee:  the  X/R 
ratio,  and  the  operating  conditions.  The  X/R  ratio  eust 
be  high,  and  the  systee  oust  be  in  its  normal  steady- 
state  operating  condition.  Violation  of  any  or  both  of 
these  conditions  may  easily  lead  the  algorittm  to  a 
convergence  difficulty  cr  even  to  a  divergence. 
Performance  of  the  algorithm  is  significantly  degraded 
as  this  ratio  is  reduced.  In  extreme  cases  where  this 
ratio  falls  below  0.1,  divergence  is  generally 
observed. 


Table  1  lists  the  performances  of  the  conventional 
Hewton  Load- Flow  algorithm  and  the  Fast-Decoupled 
algorithm  for  the  Eregli  distribution  system,  where  the 
X/R  ratio  Is  progressively  reduced  from  Its  nominal 
value  to  1DX  by  multiplying  branch  reactances  with  a 
varying  scale  factor.  The  tests  are  carried  out  without 
any  control  on  transformer  taps  and  generator  reactive 
power  outputs.  Convergence  tolerance  used  1r.  the  tests 
Is  0.001  HW/HVAR  for  both  tests.  Tests  on  the 
conventional  Hevton  algorithm  are  carried  out  by  using 
the  optimum  load-flow  nodule  in  PAP. 


Table  1.  Kiaber  of  Iterations  for  Eregli  System 
with  Progressively  Reduced  X/R  Ratio. 


X-Scale  Factor 

Newton- Raphson 
Load- Flow 

Fast-Decoupled 

Load-Flow 

1.000 

5 

1 

0.500 

5 

12 

0.250 

5 

35 

0.200 

5 

dlv. 

0.166 

5 

dtv. 

0.1 25 

6 

dlv. 

0.111 

6 

dtv. 

0.100 

6 

dlv. 

0.080 

? 

div. 

It  cay  be  concluded  free  the  above  test  that 
convergence  of  the  Fast-decoupled  Algorithm  is 
unreliable  for  low  voltage  distribution  systems.  The 
algorithm  exhibits  severe  convergence  difficulties  as 
the  X/R  ratio  if.  reduced,  hence  cannot  ba  used  for  the 
analysis  of  Eregli  System  in  its  original  fora. 


Similar  convergence  difficulties  are  observed  when 
branch  resistances  are  multiplied  by  a  scale  factor 
which  Is  progressively  increased  from  unity  to  5.0,  The 
situation  is  even  worse  in  this  case,  since  branch 
Impedances  are  Increased,  thus  increasing  bus  angles. 


The  main  principle  behind  this  alternative  version  Is 
surprisingly  s1*»1e:  neglect  the  resistances  of  the 
series  branches  while  calculating  the  Jacobian  matrix 
B**,  rather  than  B*. 

The  BX  Scheme  Is  Implemented  to  the  Fast-Decoupled 
Load-Flow  Algorithm  In  PAP,  and  its  performance  Is 
tested.  Similar  to  the  test  carried  out  for  the  Fast- 
Decopled  Load-Flow,  the  X/R  ratio  Is  progressively 
reduced  from  Its  nominal  value  to  10%  by  multiplying 
branch  reactances  with  a  varying  scale  factor.  The 
remaining  conditions  In  the  tests  are  not  varied  so  to 
compare  the  performance  of  the  two  algorithms.  In  order 
to  be  exhaustive, .the  operating  condition  of  the  system 
is  varied  In  order  to  observe  Its  effect  on  the 
performance. 

Performance  Indices  obtained  by  using  the  BX  scheme  Is 
listed  in  Table  2.  As  may.be  seen  from  the  Table,  a 
highly  Improved  performance  Is  obtained  as  reported  in 
the  literature  /5,10/. 


Table  2.  Number  of  Iterations  for  the  Eregli  System 
with  Progressively  Reduced  X/R  Ratio  using 
the  BX  Scheme. 


X-Scale  Factor 

Fast-Oecoupled 
Load-Flow  using 
the  BX  Scheme. 

1.000 

T 

0.500 

7 

0.250 

9-10 

0.200 

9-11 

0.166 

10-11 

0.125 

13-19 

0.111 

11-13 

0.100 

13-15 

0.080 

15-19 

t,5.  Short  Circuit  Analysis 

The  short  circuit  analysis  module  In  PAP  Is  used  to 
calculate^  post-fault  bus  voltages,  line,  cable, 
transformer,  generator,  and  induction  motor  currents, 
and  interruption  duties  of  circuit  breakers  under 
balanced  or  unbalanced  fault  conditions. 

Most  of  the  features  of  the  program  are  quite  similar 
to  that  of  a  conventional  short  circuit  analysis 
program,  and  hence  are  not  described  here,  instead,  the 
modifications  made  on  the  program  for  adapting  It  to 
the  Eregli  system  will  be  described. 


a.  Momentary  Duties 

The  momentary  outy  of  a  circuit  breaker  Is  the  oaxlmtxn 
transient  current  expected  to  pass  within  the  fault 
clearing  period.  Depending  on  the  system  X/R  ratio  and 
the  time  Instant  at  which  the  fault  is  Initiated,  this 
current  may  bo  as  high  as  1.7  times  the  normal  fault 
current. 


1.4.  The  BX  Scheme 


To  eliminate  the  limitations  of  the  Fast-decoupled  • 
load-flow  algorithm  an  alternative  version  Is  proposed 
In  Ref  /5/.  This  version,  also  called  the  BX  Scheme, 
provides  a  much  better  convergence  performance  while 
retaining  all  the  attractive  features  of  the  Fast  ■ 
Decoupled  Algorithm. 
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In  general,  for  the  sake  of  simplicity,  only  the  most 
severe  fault  condition;  a  three- phase- to- ground  type 
fault  Is  considered. 


Momentary  duty  characteristics  are  empirical  curves, 
hence  they  are  represented  in  the  algorithm  by  a 
fourth-order  polynomial. 


b.  Interruption  Duties 


3.  SYSTEM  PLOT  ALGORITHM 


The  interruption  duty  of  a  protective  equipment  is  the  Results  obtained  frca  the  load-flow  and  short-circuit 
oaxinua  fault  current  at  the  Instant  of  clearing.  This  analyses  are  plotted  by  a  plot  routine  Integrated  Into 

current  depends  on  a  ratio,  known  as  No'  AC  Decrement  PAP.  This  routine  Is  designed; to  plot  the  overall 

Ratio  (KACD),  as-well  as  its  clearing  time  and  the  single  line  diagram  of  the  Eregll  System  Which  is  of 

system  X/R  ratio.  This  ratio  is  defined  as:  about  160  X  80  cm  dimension,  plotted  by  a  large  size 

'  ■ _  plotter.  '  . 

HA  CO  s  I  t  (4)  Output  of  this  routine  Includes  the  power  flows  in 

ext  taj  t  cables  under  steady  state  operating  conditions, 

generator  loadings,  transformers  with  tap  adjustments. 

where,  I  i  Is  the  total 'fault  current  received  frca 

efEirnal  sources,  that  is  froa  the  utility  Plot  output, obtained  fron  the  short-circuit  study  is 
’system,  exactly  of  the  same  form  as  Uiat.for  the  load-flow, 

except  that  flows  and  voltages  In  this  case  corresponds 
I  Is  the  total  symmetrical  fault  current  to  fault  currents,  post  fault  voltages,  respectively. 
fl&Jlng  to  ground  at  the  fault  point. 

Multiplying  factor  curves  for  Interruption  duties  of  4-  4.  EREGLI  IRON  AND  STEEL  WORKS  DISTRIBUTION  SYSTEM 

cycle,  3-cycle,  and  2-cycle  circuit  breakers  are 

represented  by  again  fourth-order  polynomials.  Eregll  Iron, and  Steel  Works  Distribution  System  Is  used 

to  supply  the  power  demand  of  the  overall  complex 
distributed  In  a  3.5  square-km  area.  The  peak  power 
2.  Low-Voltage  Cable  Library  demand  of  the  site  Is  about  125  MW,  of  which  75  MW  is 

received  from  the  154  kV  utility' buses.  The,  remaining 
The  Power  System  Analysis  Package  (PAP)  has  an  power  is  generated  by  three  thermal  generating'  units 

extensive  data  base  which  stores  all  relevant  physical  Installed  at  -the  site.  ,The  peak  power  demand  Is 

and  electrical  data  concerning  the  transmission  and  expected  to  reach  200  HW< level  in  1995  when  the 

generation  system.  A  detailed  description  of  the  expansion  plan  Is  completed: 

organization  of  this  data  base  is  given  In  /9,11/.  This 

section  therefore,  Is  devoted  only  to  the  description  The  system  configuration  is  mainly  radial,  although 

of  the  low-voltage  cable  library  developed  for  the  some  links  are  drawn  as  security  lines  which  are  kept 

Ere9l1  distribution  system.  open  under  normal  circumstances.  Bus,  branch  and 

transformer  totals  in  the  system  are  about  170,  68,  and 
The  low-voltage  cable  library  Is  so  designed  that  a  120,  respectively.  The  nominal  voltage  levels  in  the 

user  can  easily  access  the  electrical  characteristics  system  are  13.8,  2.4,  0.4  kVolts.  The  system  Is 

of  a  low-voltage  cable  or  line  used  fn  his  study.  To  supplied  from  the  utility  buses  by  four  tie  lines,  two 

perform  a  load-flow  or  a  short-circuit  analysis,  the  of  them  being  at  66  kV  and  two  at  154  kV.  series 

user  needs  to  specify  only  the  length  and  type  of  the  reactors  are  Installed  mainly  for  short-circuit  current 

cables.  limitation  purposes. 

Table  3  lists  the  part  of  the  low-voltage  cable  library  The  utility  system  beyond  the  tie  line  buses  is 

corresponding  to  2.4.  kV  cables.  represented  by  Thevenln  generators,  of  which  the 

subtransient  reactances  are  calculated  using  the  bus 
short-circuit  levels. 

Table  3.  Lov-Volta9e  Cable  Library  (6.3  kV) 


5.  a 

>NCLUSIONS 

Cable 

Resls.- 

React. 

Susc.  (0)fces.  (0)Reac. 

Capacity 

Size 

Ohra/Km  Ohm/Km 

Hho/Kn 

Ohrn/Ka 

Ohra/Km 

Amp. 

The 

following  conclusions  may  be  drawn  from  the 

mro-sq 

experience  gained  In  applying  PAP  to  Eregll  System. 

3*6 

2.9199 

.1640 

.0000 

8.7598 

.4888 

o) 

Fast-Decoupled  Load-Flow  Algorithm  is  unreliable 

3*10 

1.9652 

.1673 

.0000 

5.8957 

.5052 

for  the  solution  of  distribution  systems  with  low 

3*16 

1.5059 

.1673 

.0000 

4.4948 

.4987 

X/R  ratio. 

3*25 

.9580 

.1575 

.0000 

3.8287 

.6332 

125.0 

3*35 

.6594 

.1509 

.0000 

3.2808 

.7513 

150.0 

<h) 

The  BX  Scheme  performs  much  better,  hence  is  more 
suitable  to  distribution  systems. 

3*40 

.5873 

.1476 

.0000 

2.9298 

.7415 

158.3 

3*50 

.4790 

.1476 

.0000 

2.4016 

.7349 

175.0 

(111)  A  distribution  system  engineer  always  tends  to 

3*63 

.3871 

.1476 

.0000 

1.9291 

.7448 

204.2 

deal  with. the  real,  physical  aspects  of  problems, 

3*70 

.3510 

.1476 

.0000 

2.8150 

1.1713 

220.0 

hence  tries  to  avoid  himself  to  be  In  a 

3*80 

.3051- 

.1444 

.0000 

2.4409 

1.1549 

246.6 

sophisticated  computational  environment  with  a 
large  volume  of  data.  Thereforo,  facilities  such 

3*95 

.2608 

.1419 

.0000 

2.0915 

1.1327 

260.0 

as  low-voltage  cable  library  are  highly  useful. 

3*100 

.2461 

.1411 

.0000 

1.9751 

1.1253 

267.0 

3*120 

.2067 

.1312 

.0000 

1.6535 

1.1024 

295.0 

<1v> 

Due  to  size  of  the  system,  plotting  Is  an 

3*125 

.2001 

.1378 

.0000 

1.5945 

1.0958 

300.7 

essential  requirement  in  distribution  system 

3*150 

.1706 

.1378 

.0000 

1.3550 

1.0991 

329.6 

analysis. 

3*160 

.1575 

.1378 

.0000 

1.2566 

1.1024 

341.2 

3*185 

.1444 

.1378 

.0000 

1.2500 

1.2300 

370.0 
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3*200 

.1280 

.1345 

.0000 

1.2730 

1.3583 

387.3 

3*250 
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.1345 

.0000 

1.0400 

1.3287 

440.0 
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Abstract-  A  mathematical  model  of  ion  pair 
active  transport  through  an  unsupported 
liquide  membrane  mediated  by  macrocyclic 
ligands  as  crown  ethers  and  assisted  by  pH 
gradient  has  been  developed. The  overall  me¬ 
chanism  is  complex, consisting  of  both  diffu¬ 
sion  and  complexation/decooplexation  reac¬ 
tion  steps  at  two  possibly  different  inter¬ 
faces.  Modelling  and  simulation  of  these 
extreme  proceases(i.e.,a  diffusional  regime 
and  a  kinetic  regime) , versus  experimental 
data, make  possible  to  determine  qualitati¬ 
vely  and  quantitatively  the  rate  limiting 
process, despite  the  poorly  defined  hydrodi- 
namlcs  of  the  cells  involved  in  experiments. 
The  main  purpose  of  modelling  and  computer 
simulation  was  to  optimize  the  rate  of  trans¬ 
port  in  amino  acids  separation  techniques. 

I.  INTRODUCTION 

The  paper  Is  focused  on  modelling  Ion  pair 
transport  through  liquid  membranes  mediated 
by  crown  ethers  and  assisted  by  pH  gradient. 
The  possibility  of  separating  amino  acids 
from  a  mixture  making  use  of  macrocyclic  li¬ 
gands  of  crown  ether  type  (18-crown-6)  in  a 
technique  of  simple  extraction  or  two  step 
extraction  was  previously  reported  (l).The 
macrocyclic  ligands  which  are  able  to  speci¬ 
fically  recognize  through  controlled  interao- 
tions  according  to  size, shape  and  structure 
cations  and  then,  by  their  means, various 
anions,  has  allowed  the  study  of  amino  acids 
which  amphion  type  molecule  may  appear  either 
as  cation  in  acid  medium  or  as  anion  in  ba¬ 
sic  medium. The  possibility  of  coupling  two 
more  chemical  species, according  to  a  natural 
model  by  means  of  certain  physico-chemical 
affinitie-  without  formation  of  covalent 
bonds  is  reffered  to  as  "recognition”  (2). 

II.  THE  ION  PAIR  CARRIER  MEDIATED 

TRANSPORT 

This  system  has  been  most  widely  studied 
(3-5)  especially  in  relation  with  the  trans¬ 
port  of  bound  monovalent  cations  (C  )  with 
an  external  anion(A~)by  neutral  ligands(L) 
like  orown  ethers  and  cryptands. Generally, 
some  of  the  phases  are  stirred  to  provide 
conveotive  transport. The  process  is  complex 
and  the  identification  of  rate-limiting 
process -Is  complicated  by  the  probable  simul¬ 
taneity  of  some  steps  and  by  other  potential 
pathways  involving  complexation/deoomplexa- 
tion  reactions  in'- the  bulk  phases. Studies  of 
the  kinetics  of  substrate  or  ion  pair  selec¬ 
tive  transport  may  reveal  the  importance  of 
different  steps. 

In  order  to  increase  the  efficiency  of  the 
process  an  original  design  of  the  device  was 
uaed(6).In  the  source  phase  there  is  the  pro¬ 
tected  amino  acid(C+)to  be  transported  and 
the  picrate  anion(A“)  in  an  aqueous  solution 
at  pH«2  ensured  by  use  of  HCl.At  the  source 
phase/ membrane  interface , 18-orown-6  macro- 


cyclic  carrier  complexes  the  amino  acid 
through  the  protonated  cuninic  group. The 
formed  complex  cation  is  'extracted  into  1,2 
dichloroethane  membrane  by  ion  pairing  with 
the  picrate  anion  which  diffuses  through  the 
membrane  to  the  meabrane/receiving  phase 
interface. The  receiving- phase  at  pH*13, trans¬ 
forms  the  protonated  amlnic  group  -  NHt’into 
-NHpjthe  amino  acid  is  released. from  the 
complex  and  passes  Into  receivlng^phase  as  a 
lithium  salt. The  empty  carrier  L. diffuses 
back  to  the  source  phase/membrane  interface 
where  the  whole  cycle  starts  again  and  again 
until  the  equilibrium  between  the  phases  of 
the  system  is  achieved. The  basic  pH  of  the 
receiving  phase  is  ensured  by  using  MOH 
because  the  Li+  ion, being  much  smaller  than 
the  crown  ether  ring' size, is  not  complexed 
by  the  macrooydic  carrier  and  so  it  does 
not  determine  any  additional  reactions  which 
could  Influence  the  transport  phenomenon. 
Consequently, the  acid  medium  of  the » source 
phase  allows_the  formation  of  an  ion  pair 
complexCLC  A“)  whioh  diffuses  through  the 
membrane. The  basic  medium  of  the  receiving 
phase  facilitates  the  dissociation  of  the 
complex  and  thus  the  amino  acid  passes  from 
the  organic  phase,  into  the  receiving  phase. 

A  coupled  transport  of  amino  acid  and  H+ions 
from  source  phase  to  receiving  phase  is 
accomplished. 


II. TRANSPORT  MODELLING 


The  overall  process  can  be  limited  by  the 
diffusion  of  the  ion  pair  complex  through 
the  organic  phase(LM)and  /or  by  the  rates 
of  reactions  occuring  at  the  interfaces  of 
the  system. As, long  as  the  rates  of  complex 
formation  and  dissociation  are  fast  compared 
to.  diffusion  through  the  membrane, the  amino 
acid  flux  is  found  to  bet 


where  P  is  the  permeation  coefficient  of  the 
ternary  complex  and  the  free  crown  ether,K 
is  the  overall  extraction  equilibrium  cons¬ 
tant,  and  [J  s  are  concentrations  at  the 
source  phase/ membrane  (in) and  the  membrane/ 
receiving  phase( out) interfaces, respectively 
in  aqueous(w) solutions, It  is  found  that 

J/W  - o -  (2) 

’+b,K+b„K2 

bi  and  bo 


l+bjK+b^K* 

and  plotted  versus  lg  K, where  a* , 
are  functions  of  the  reaction  evo 


olution 


Accordingly  the  reaction  is  most  effioient 
(fast  and  selective) if  lgK  4... 6. 

III. CONCLUSIONS 

A  diffusion-limited  kinetlo  treatment  and 
complexation/decomplexatlon  limited  processes 
have  been  modelled  and  computer  simulated. 
Most  experiments  have  suggested  a  diffusion- 
limited  process. 
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Abstracts. 

The  paper  .is  dealing-  with  the' convex 
combinations  problem  (CCP) .  Mathema¬ 
tical  analysis  of  the  CCP  is  presen¬ 
ted.  Results  of  the  investigation  are 
applied  to  problem  of  design  of  lami¬ 
nated  composite  materials. 

I.  CONVEX  COMBINATIONS  PROBLEM. 

The  CCP  is  formulated  as  follows: 

find  the  set  L(Y>  of  all  coeffici¬ 
ents  Ctj »i"t •••> ,n>  of  convex  com¬ 
binations  the  given  points  <V4 ,  i- 
■l,...,m)  which  give  the  point  Y 
or  (which  is  the  same) 

find  set  of  solutions  of  the  prob¬ 
lem 


The  left  hand  side  of  the  first. equ¬ 
ation  in  the  <1>  describes  convey  po¬ 
lyhedron  P«conv(7j ,i-i . ,n)-  'Lee- us 
consider  set  (Pn,n-1 . ,N)  of  •*«! sTple- 
xea  of  the  polyhedron  P  containing  the 
point  7  <N  is  the  total  number  of  such 
symplexes)  and  denote  by  Sn(Y>  soluti¬ 
on  of  the  CCP  corresponding  to  the 
simplex  Pn  and  point  7. 

L(7)»conv(§n,n-l,... ,N) 
(conv  denotes  the  “convex  hull1*). 

Let  us  define  distance  between  two 
polyhedrons  A,B  by  the  formula 

D(A,B)  *  max  max  j  x  -  y  J  + 
y*jA\B  xoB 

+  max  max  j  x  -  y  |  <2> 

y«A\B  x«B 


IU22C2<2_2-  Lot  7a->7  when  a->0  and 
7^,7  belong  to  P  for  every  a.  Then 
D<L<7> ,L<7))->0  as  a->0. 


II.  NUMERICAL  ALGORITHM. 

As  seen,,  solution  of  the  CCP  is  re¬ 
duced  to  the  finding  of  the  finite 
number  of  solutions  (5n ,n®l , . . . ,N> . 
For  1-D  problem  these  solutions  can 
be  computated  explicitly  /l/.  Let  us 
consider  the  first  equation  in  the 
CCP  < 1 >  as  1-D  CCP: 


(3) 


Z  viiti-yi  •  v°  - Zv*  • 

i-1  l-l 

In  accordance  with  the  theorem  1, 
its  solution  is 


N 


‘n>0  -  Z  S’"  •  <S> 

1  nj-1  1 

Substituting  <4)  into  the  equati¬ 
ons  from  the  2d  to  the  m-th  in  (1) 
one  obtains  a  new  1-D  CCP  (with  re¬ 
spect  to  <un  ,  nj-1,... ,Nj).  After 
m  steps  one  Obtains  solution  of  the 
CCP  (if  it  is  solvable)  in  the  form 
Nm 

t  -  'S”  H  ,u  , 

whore  N 

un  >0  >  Z  U"  ’ 

V1 

and 

Ni  wi 

V  Z  -Z  SnmX:-I-«n,i  ' 


If  the  CCP  (1)  has  no  solution,  then 
a  1-D  CCP  in  the  algorithm  above  is 
not  solvable. 
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III.  APPLICATION  OF  THE  CCP  FOR 
DESIGN  OF  COTOSITE  MATERIALS 
OF  LAMINATED  STRUCTURE  WITH 
SPECIFIED  PROPERTIES. 

Relationship  between  homogenized  cha¬ 
racteristics  72/  R  of  composites  of 
laminated  structure  and  characteris¬ 
tics  of  their  components  has  the  foll¬ 
owing  form  /3-6/: 


where  i 

Dj  denotes  mechanical  characteristics 
of  the  i-th  material  /4/  (or  direction 
of  fibers  of  the  i-th  layer  /3/>; 
tj  is  the  volume  ratio  of  the  i-th  ma¬ 
terial  (or  fibers  in  the  i-th  layer); 
m  is  the  total  number  of  materials  or 
layers  of  fibers  which  may  be 'used; 

6  and  F  are  known  functions. 

In  accordance  the  with  the  definition 
of  (tj ,«)  one  has 
m 

ti>®  *  ^  ti*1  *  <7> 

i»l 

Problem  of  design  of  composite  with 
specified  properties  is  formulated  in 
the  following  form  /4/» 
find  a  structure  of  composite  (it  is 
described  by  (tj ,i»l , . . - ,m> )  which  gi¬ 
ves  to  composite  a  set  of  required 
homogenized  characteristics  S 
or  (which  is  the  same) 

solve  problem  (6) , (7)  with  respect 
to  (tpi*>l,...,«))  when  A  is  given. 

As  seen,  the  problem  of  design  becomes 
a  CCP. 

The  numerical  algorithm  above  was  re¬ 
alized  as  FORTRAN  program.  It  Remonst¬ 
rated  its  efficiency  for  number  of  ma¬ 
terials  up  to  30  and  number  of  homoge¬ 
nized  characteristics  up  to  10. 

The  problem  of  design  of  composite 
constructions  of  laminated  structure 
with  specified  characteristics  is  as 
follows! 

find  a  structure  of  a  composite  which 
gives  to  the  composite  a  required 
distribution  of  homogenized  characte¬ 
ristics  ?<x)  in  the  domain  Q  /7/. 

It  comes  to  a  CPP  with  right  hand  side 
7<x>  (where  xcQ  is  a  parameter) ,  which 
can  be  approximated  by  finite  number 
of  CPPs  with  right  hand  sides  Yj*Y(Xj>t 
1*1,..., L  on  the  base  of  theorem  2. 
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Abstract  -  A  computer  program  for  fitting  and  discriminating  between 
probability  distributions  has  been  developed  to  run  on  PCs.  The 
program  is  for  those  interested  in  fitting  distributions  to  data,  for 
example,  environmental,  life  testing,  meteorological  and  traffic,  count 
data,  The  program  uses  advanced  state-of-the-art  discrimination  and 
maximum  likelihood  parameter  estimation  techniques  and  these  have 
been  applied  to  a  wide  range  cf  practical  problems. 


Introduction 

Based  on  extensive  research  into  the  application  of  probability 
distributions  to  environmental  data,  a  computer  program  called  PROFIT 
has  been  developed.  Environmental  data  can  usefully  be  summarised 
using  probability  distributions  as  an  aid  to  analysis  and  decision  making. 
In  many  cases  the  most  useful  environmental  indicators  are  the  mean, 
variability,  extreme  events  and  the  probability  of  exceeding  specified 
values.  Such'  indicators  form  the  basis  for  most  environmental 
standards.  Ail  these  indicators  of  environmental  quality  can  be  derived 
from  the  correctly  identified  probability  distributions  and  their 
parameter  values.  PROFIT  is  a  flexible  program  for  applying 
probability  distribution  models  to  data.  It  combines  powerful 
identification  .ools  with  maximum  likelihood  parameter  estimation  in  an 
easy  to  use  package. 


The  program  is  specially  designed  to  be  used  by  those  in  industry, 
government  and  research  who  have  a  special  interest  in  fitting 
probability  distributions  to  data.  The  program  can  be  usefully  applied  to 
those  wishing  to  study  data  sets  aiising  from' reliability  and  quality 
control  tests,  air  and  water  quality  data  sets,  meteorological  data  and 
road  traffic  data.  However,  application  of  the  program  is  really  only 
limited  by  the  applicability  .of  the  distribution  models  included  in  the 
program  to  the  data  in  question. 

In  order  to  simplify  the  program  structure  we  have  selected  the  most 
widely  applicable  probability  distributions  such  as  the  gamma,  Weibull, 
lognormal,  normal  and  exponential  models.  For  the  gamma,  Weibull 
and  lognormal  distributions,  two-  and  three-parameter  distributions  and 
the  ability  to  select  between  these  distributions  have  been  included  in  the 
program. 

While  many  computer  programs  used  for  statistical  analysis  of  data  are 
available,  no  other  program  offers  such  ease  .if  use,  flexibility  and 
advanced  state-of-the-art  techniques  with  extensive  pioven  application  to 
a  wide  range  of  real  data  sets. 


Program  Fwrcre 

The  program  incorporates  many  useful  design  features  based  upon  the 

authors  own 4  experiences  with  the  analysis  of  environmental  data  sets. 

These  design  features  make  the  program  powerful  and  easy  to  use.  They 

include: 

ft)  Flexible  command  structure  allowing  both  interactive  and  batch 
processing  of  dat3.  The  analysis  of  individual  data  sets  can.be 
rapidly  achieved  using  the  program  in  an  interactive  mode.  This 
feature  is  particularly  useful  when  conducting  an  exploratory 
analysis.  However,  .where  a  standard  set  of  outputs  are  required 
or  multiple  data  sets  must  be  analysed- then  batch  files  can  be 
input  to  the  processor. 

(ii)  Easy  to  use  interfaces  based  on  menus  makes  the  program  easy  to 
learn  and  easy  to  use. 

(iii)  Straightforward  formats  for  data  input  speed  analysis.  A  range  of 
simple  unformatted  data  files  can  be  accepted  by  default. 
Alternatively,  to  assist  with  the  development  of  data  bases,  a 
s:mple  hierarchal  data  format  has  also  been  developed. 
Unformatted  data  can  be  written  in  the  hierarchal  format  so  that 
no  preprocessing  is  required  to  take  full  advantage  of  this  feature. 

Civ)  An  on-lme  help  facility  is  available,  with  examples,  which 
provides  user  selected  levels  of  help  for  the  new  and  experienced 
user.  Program  generated  error  messages  provide  detailed 
descriptions  of  problems  and  the  most  likely  remedies. 

(v)  The  processors  will  produce  a  set  of  default  or  standard  outputs 
User  definable  outputs  can  be  easily  added. 

(vi)  A  flexible  plotting  utility  is  available  to  aid  in  the  analysis  of  the 
input  data  and  the  presentation  of  results.  These  include  plots  of 
autocorrelation,  time  series,  error  analysis  and  time  series  plots  to 
aid  in  the  assessment  of  stationarity. 

(vii)  A  selection  of  widely  applicable  distributions  have  been  included. 
These  axe  the  normal,  the  exponential  distribution,  two-  and  three- 
parameter  Weibull  distributions,  and  two-  and  three-parameter 
lognormal  distributions, 

(vm)  A  key  processor  designed  to  perform  the  maximum  likelihood 
estimation  of  parameter  estimates  for  the  probability  distribution 
models  has  been  designed  to  be  applicable  to  the  full  range  of 
potential  parameter  values  for  each  distribution.  The  parameter 
estimation  processor  has  been  tested  on  a  wide  range  of  actual 
environmental  data  sets. 

(ix)  A  comprehensive  range  of  well  tested  state-of-the-art  identification 
and  selection  criteria  have  been  incorporated  into  the  program.  A 
special  rule  based  expert  system  aids  in  the  interpretation  of  the 
results. 

(x)  The  program  also  includes  simulation  tools  to  enable  sampling 
from  the  probability  distributions  listed  in  (vn)  above.  The 
simulation  tool  allows  quantitative  investigation  of  the  ability  of 
the  identification  and  parameter  estimation  procedures  to  predict 
desired  properties  or  derived  quantities  of  probability 
distributions.  This  facility  allows,  for  example,  the  user  to 


investigate  the  ability  of  the  maximum  likelihood  Approach  to 
estimate  a  percentile  of  a  particular  distribution. 

(xi)  Ancier  important  simulation  tool  is  the  application  of  the 
bootstrap  method  to  d2ta  sets  in  order  to  determine  the  variability 
in  a  desired  user  specified  quantity  such  as  a  percentile  or 
parameter  value. 


ntaribulion  Functions.  Slaiiilical  and  Performance  Criletia  and 

I,  Distribution  functions 

For  the  exponential  and  normal  distributions  the  reader  is  referred  to  the 
extensive  literature  available  on  these  distributions  {!]. 

Probability  density  functions  for  the  three-parameter  gamma,  Weibull 
and  lognormal  distributions  for  a  random  sample  are  given  by: 


Gamma: 

(i) 

Weibull. 

» “tell 

(2) 

Lognormal' 

0) 

In  equations  (1),  (2)  and  (3>,  a  represents  the  shape  parameter,  p  the 
scale  parameter,  y  the  location  parameter,  and  T  is  the  gamma 
function.  The  two-parameter  versions  of  the  density,  functions  of  the 
gamma,  Weibull  and  lognormal  distributions  are  the  same  as  In  (1),  (2) 
and  (3),  with  y  ■  0  in  each  case.  In  the  above  equations,  p  >  O.ct  >  0 
and  y  is  less  than  the  minimum  observed  sample  value. 

The  maximised  value  of  the  likelihood  function  is  an  essential  statistic 
employed  in  many  criteria  used  to  discriminate  among  alternative 
models-  For  a  sample  x,,  Xj,  «,  x4  of  n  independently  and  identically 
distributed  random  observations,  the  log-likelihood  functions  for  the 
three-parameter  gamma,  Weibull  and  lognormal  distributions  are  given 
as  follows: 


Gamma: 


Weibull: 


Lognormal: 


logJL  »  -nalogP  -  nloglXa)  ♦ 
<«-l)  £  Iog(VY)  -  £  fep) 

M  M  \  P  / 

logZ-  •  r.loga  -  nalogp  ♦ 

(a-l)  £  log(x,-Y)  -  £  fep] 

M  M  V  P  / 


logl  .  -£log(2M!)  -  £  Mvy)  * 

*  M 
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2a*  i.) 


(4) 


(5) 


(6) 


The  parameters  of  the  three  log-hkelihood  functions  are  estimated  by 
maximum  likelihood  methods.  Since  the  general  maximum  likelihood 
procedure  for  the  three-parameter  gamma  and  Weibull  distributions  will 
frequently  fail  to  converge  when  the  ^unknown;  shape  parameter  is  less 
than  or  equal  to  unity,  a  computationally  efficient  approach  that 
circumvents  this  problem  is  used  (for  further  details,  see  (2J). 


II.  Statistical  Criteria 

Let  x,,  Xy  xM  represent  a  random  sample  of  n  observations.  Interest 
here  lies  in  discriminating  among  nested  and  non-cested  two-  and 
three-parameter  distributions  in  which  the  null  hypothesis  of  interest  is 
H0  :  y  -  0  against  the  alternative  //, :  y  #0.  Denoting  the 
maximised  values  of  the  two-  and  three-parameter  variants  of  a 
particular  log-likelihood  function  as  hgL^  and  fogL,  ,  respectively,  the 
AIC  and  SIC  may  be  expressed,  respectively,  as: 

Choose  the  |jj  parameter  distribution  if 

AIC.  logl,  -  2  bgL,  -  3  CO 

SIC :  bgL,  -  log*  bgL,  -  31ognH  W 

A  new  GIC  procedure  is  proposed  for  the  discrimination  of 
distributional  structures  among  a  set  of  alternatives  by  Bai  et  al.  [3]. 
The  GIC  is  based  on  the  equivalence  between  some  well-known 
information  criteria  and  hypothesis  tests,  and  attempts  to  determine  the 
false  distributions  based  on  sample  information.  Large  differences 
between  the  maximised  values  of  log-likelihood  functions  will  lead  to 
rejection  of  the  distribution  with  the  lower  value.  Discriminated 
distributions  are  separated  into  two  categories,  the  superior  and  badly 
fitting  categories.  The  distributions  in  the  superior  category  perform 
within  an  acceptable  tolerance  level  and  there  are  no  significant 
differences  among  their  performances.  The  GIC  procedure  may  provide 
several  alternatives  rather  than  one  particular  distribution  for  a 
particular  set  of  data.  In  the  event  of  there  being  several  sets  of  data, 
the  distribution  with  the  highest  probability  of  acceptance  in  the  superior 
category  will  be  chosen. 

Let  x,,  Xj,  ,  x,  be  n  independently  and  identically  distributed 
random  observations.  Denote  logLj  as  the  maximised  log-likelihood 
value  of  distribution;  (/-l,  2,  m),  with  the  ordering  given  as 

logl^  >  logl^  >  -  >  loglm  (9) 

Then  distribution  )  will  be  rejected  in  favour  of  distribution  1  if 

GIO.  -2  logL^  ♦  T0  <  -2  logL,  (10) 

where  7*0  >  0.  The  value  T0  is  the  tolerance  level  required  in  order  to 
reject  distributions  as  being  significantly  different  from  each  other,  and 
is  equivalent  to  the  rejection  region  discussed  in  the  previous  section. 
For  the  nested  case,  T0  can  be  expressed  as 

Tt  *  4  (11) 

whets  e  is  the  critical  value  of  the  x 1  distribution  with  one  degree  of 
freedom.  In  the  noa-nested  case,  2",  is  given  by 


T„  -  2f  (12) 

where  s*  >  0  and  the  asymptotic  upper  bound  on  the  significance  level 
is  given  by  the  cumulative  standard  normal  distribution  function 
evaluated  at 

The  motivation  behind  the  GIC  procedure  is  straightforward.  First,  for 
a  given  sample,  select  the  distribution  with  the  highest  maximised 
log-likelihood  value  among  logl{  (J  »  l,  2,  ,m)  This  distribution 
then  belongs  to  the  category  of  superior  distributions  and  is  also  used  as 
a  standard  for  further  inference.  Second,  reject  the  false  alternatives 
among  the  remaining  distributions  »n  terms  of  the  given  tolerance.  The 
distributions  which  perform  within  an  acceptable  tolerance  level  of  the 
best  fitting  distribution  are  retained  in  the  superior  category,  and  the 
distribution  with  the  highest  probability  of  acceptance  over  different  sets 
of  data  will  be  chosen  from  the  superior  category. 
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HI.  Performance  Criteria 


AnExanroie  Application:  Chemical  Cagnupiflg  in  Stream  Walt? 


In  this  study,  loss  functions  recommended  for  assessing  air  quality 
models  have  been  chosen  (see  [4))  to  establish  the  effect' of 
discrimination  criteria  on  the  intended  use  of  the  model.  These  functions 
are  the  relative  bias  (BIAS)  and  the  relative  root  mean  square  error 
(RRMSE)  which  are  evaluated  at  the  upper  percentiles  of  the 
distributions.  For  an  estimate  4,  of  a  quantity  of  interest  q,  these  loss 
function  axe  defined  in  terms  of  deviations  from  the  observed  value  q  in 
each  data  set.  The  definitions  used  for  the  loss  functions  are: 

«w  ■£§(?)  (,3) 

Is  §(*?)!  <,4) 

where  N  is  the  number  of  data  sets.  For  present  purposes,  the  quantity 
,q  denotes  the  upper  percentiles  of  the  sampling  distributions. 

Based  on  the  loss  functions  above,  two  performance  criteria  defined  in 
terms  of  the  relative  root  mean  square  error  (RRMSE)  are  also  used. 
The  first  criterion  is  based  on  the  mean  of  RRMSE  in  the  upper  (U) 
percentiles,  and  the  second  is  based  on  the  mean  of  RRMSE  in  the 
entire  or  full  (F)  percentiles  of  the  distribution.  For  an  estimate  4V  of  a 
quantity  of  interest  q,  these  performance  criteria  are  defined  in  terms  of 
deviations  from  q  in  the  percentiles  of  interest,  where  i  denotes  the 
specific  percentiles  estimated  and  ;  corresponds  to  the  specific  data  set 
fitted.  The  definitions  of  the  upper  percentile  error  and  full  percentile 
error  are  as  follows: 


(15) 

(16) 


Jakeman  et  al  {7]  developed  a  hybrid  modeiting  approach  to  predict 
extreme  concentrations  of  chemical  determinants  in  stream  water 
Jakeman  et  al  (7]  used  the  PROFIT  computer  program  to  identify  and 
estimate  the  parameters  of  the  probability  density  function  (pdf) 
component  of  the  hybrid  model.  We  provide  a  brief  description  of  the 
study  reported  by  Jakeman  et  al  [7)  illustrating  the  application  of  the 
PROFIT  computer  program. 

The  water  quality  data  used  here  comprise  weekly  observations  on 
stream  water  chemical  determmahds  from  the  Birkenes  catchment  in. 
Norway,  There  are  fourteen  year!\of  reasonably  complete  records 
available  for  five  chemical  species:  sulphate,  nitrate,  sodium,  calcium 
and  hydrogen  ions.  The  parameter  distributions  applied  to  characterise 
these  data  are  the  2-  and  3-parameter  gamma  (G2  and  G3),  lognormal 
(L2-and  L3)  and  Weibull  (W2  and  W3).  As  reviewed  in  Jakeman  and 
Taylor*  (S)  these- distributions  are  .generally  capable  of  representing 
skewed  environmental  data. 

Before  embarking  upon  the  analysis,  it  is  important  to  appreciate  that  an 
underlying  assumption  is  theoretically  required  when  pdfs  are  fitted  to 
observations  using  techniques  such  as  maximum  likelihood.  This 
assumption  is  that  the  observations  are  independent  and  identically 
distributed  (iid)  over  the  sampling  period,  i.e.  stationanty  and  low 
autocorrelation  for  observations.  To  obtain  an  indication  of  the 
satisfaction  of  this  assumption,  three  simple  analyses  were  undertaken 
First,  the  variability  of  the  mean  between  the  summer  and  winter 
seasons  was  examined  for  each  annual  period.  Large  differences  for 
most  years  would  imply  a  non-stationary  seasonality  for  the  variable  and 
hence  the  need  to  build  a  causal  model  with  seasonal  rather  than  annual 
predictive  capabilities.  Second,  regressions  against  flow  were 
constructed  for  the  different  variables  for  each  year.  This  would  indicate 
if  short  term  concentrations  were  flow  related  and  hence  if  they  might 
be  better  modelled  taking  this  variable  into  account.  The  regressions 
were  performed  for  concentration  on  flow  and  for  the  logarithm  of  these 
two  variables,  but  invariably  the  latter  produced  higher  levels  of 
explanation.  Third,  autocorrelation  functions  were  computed  for  each 
variable  and  for  each  year. 


where  p  is  the  location  of  the  p -quantile  which  is  chosen  at  the  98  per 
cent  level  for  UPE  and  100  per  cent  for  FPE,  N  is  the  number  of  data 
sets,  and  is  the  sample  size  for  data  set  j .  Foi  present  purposes,? 
errors*  pcrcentlle  {PUnUUK>  f^ated  to  the  uppei  and  hill  percentile 

IV.  Asymptotic  Tests 

The  performance  of  two  well  known  procedures  for  testing  goodness  of 
fit  are  also  considered,  namely  the  chi-square  (CHI)  test  and 
KoJmogorov-Smimov  (KS)  test  Classifying  the  n  observations  into* 
categories,  the  chi-square  statistic  is  of  the  form  (see  (5J): 

CHI  ■  2  07) 

M  nPl 

which  has  an  asymptotic  xl  distribution  with  (k-J-1)  degrees  of 
freedom  when  holds.  The  p,  are  hypothetical  probabilities,  the/, 
are  empirical  frequencies  and  l  Is  the  number  of  parameters  estimated 
for  e3Ch  distribution  (for  further  details,  see  (1}).  For  fitting  the  real 
data  In  Sections  5  and  6  below,  *  -  10  and  l  •  2  or  I  ■  3 ,  The  KS 
test,  which  is  defined  in  terms  of  the  maximum  absolute  difference 
between  the  sample  distribution  function  SJx)  and  the  hypothetical 
distribution  function  Fe(x)  (see  e  g.  (6},  p.  204),  is  given  by 

».  ■  (18) 

Large  observed  values  of  the  D$  statistic  lead  to  rejection  of  the 
hypothesis  F0(x), 


A  summary  of  the  outcome  of  these  exercises,  using  the  median  resutt, 
is  given  m  Table  1.  The  autocorrelations  are  high.  However,  Jakeman 
et  ai  (91  report  that,  for  the  L2  distribution,  first-lag  autocorrelations  of 
0.5  lead  to  increases  in  fitting  errors  for  the  maximum  value  of  25  pei 
cent.  This  Increase  is  relative  to  random  samples  of  the  same  size  from 
the  same  lognormal  distribution.  It  could  be  regarded  as  a  worst  case 
due  to  the  larger  positive  skewness  of  the  lognormal  distribution  over 
other  distributions  used  in  this  paper.  The  table  shows  that  both  calcium 
and  hydrogen  have  substantial  correlation  with  flow.  There  is  little 
doubt  that  short  term  hydrogen  concentrations  should  be  explained  by 
flow  and  possibly  other  variables.  To  a  lesser  extent,  this  is  the  situation 
for  calcium  as  well.  In  the  remainder  of  this  paper,  we  will  therefore 
restrict  attention  to  the  sulphate,  nitrogen  and  sodium  species,  The 
initial  analyses  reported  in  Table  1  indicate  that  annual  sets  of  short 
term  concentrations  of  these  species  are  suited  to  modelling  by  pdfs. 
For  nitrate  there  does  exist  almost  30  per  cent  difference  between  the 
summer  and  winter  means  for  half  of  the  annual  data  sets.  As  we  shall 
see,  the  accuracy  of  our  statistical  modelling  of  extremes  cf  nitrate  is 
lower  than  that  for  sodium  or  sulphate. 

I.  Estimation  of  distributional  properties 

Tables  2,  3  and  4  report  the  results  of  fitting  by  maximum  likelihood 
the  six  distributional  forms  to  the  14  annual  sets  of  weekly  observations 
of  sulphate,  nitrate  and  sodium  concentrations,  respectively.  The  tables 
give  statistics  associated  with  the  rrmse  in  fitting  the  observed  mean  and 
the  observed  maximum.  The  three  statistics  shown  are  the  average  of 
the  rrmse  over  the  14  data  sets,  the  standard  deviation  and  the 
maximum  rrmse  computed. 
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Fig.  I.  Fit  of.  the  L2  distribution  to  weekly  sulphate  concentrations  in  1986  and  1987. 


Fig,  2.  Fit  of  the  \V2  distribution  to  weekly  nitrate  concentrations  in  1980  and  1987, 


Fig.  3.  Fit  of  the  G2  distribution  to  weekly  sodium  concentrations  in  1988  and  1983. 


In  the  case  of' sulphate,  the  L2  distribution  fits  the  observed  mean 
concentration  with  an  accuracy  of  0  9  per  cent  on  average  and,  ;n  the 
worst  case,  the  error  of  fit  is  1.4  per  cent  on  one  of  the  14  data  sets. 
•The  L2  distribution  fits  the  observed  maximum  of  each  annual  data  set 
.-vjtfi  an  average  accuracy  of  6,5  per  cent,  the  worst  fit  over  all  years 
tang  17.6  per '-cent  in  error.  Only  the  W2  distribution  yields 
unacceptable  errors  in  fining  these  quantities. 

For  nitrate,  all  3-parameter  distributions  and  L2  yield  unacceptable  fits 
to  the  observed  maximum,  W2  is  the  most  superior  distribution  but  only 
marginally  to  G2.  For  both  these  distributions,  the  average  error  in 
fining  the  observed  mean  and  maximum  is  close  to  10  per  cent  and  the 
worst  error  on  anyone  data  set  is  around  ,30  per  cent,. 

For  weekly  sodium  concentrations,  only  the  W2  distribution  yields  poor 
fining  errors.  The  3-parameter  distributions  provide  no  substantial 
improvements  over  the  more,  parsimonious  alternatives.  The  G2  and  12 
distributions  fit  the  observed  mean  very  closely  and  better  than  the 
others,  while  the  observed  maximum  is  fitted  to  an  accuracy  of  less  than 
5  per  cent  on  average  and  always  less  than  30  per  cent. 

II.  Discrimination  among  distributions 

Tables  2,  3  and  4  also  show  the  results  of  applying  the  GIC,  AIC  and 
SIC  statistics,  and  the  traditional  chi-square  and  K-S  tests.  Ail  tests 
apply  a  98  per  cent  confidence  level.  Consider  the  sulphate  results  first. 
The  GIC  selects  L2  and  W3  as  not  significantly  inferior  to  alternatives 
in  12  of  the  14  cases.  The  other  two  information  criteria  (AIC  and  SIC) 
are  not  as  definitive.  The  chi-square  test  and  especially  the  K-S  test  are 
more  definitive  preferring  G2  and  L2,  followed  by  W3  and  G3.  L2  is 
certainly  the  most  preferred  or  equally  preferred  by  all 'criteria  except 
AIC.  The  error  results  in  Table  2  only  eliminate  W2.  If  a  2-patameter 
distribution  is  sought,  then  either  L2  or  G2  would  suffice,  our 
preference  being  L2  because  of  its  likelihood  (by  virtue  of  GIC  results) 
being  optimal  or  near  optimal  most  often. 

For  nitrate  concentrations,  Table  6  shows  that  all  the  information 
criteria  prefer  the  G3  distribution  The  chi-square  test  prefers  G2  while 
tli©  K-S  test  finds  all  2 -parameter  distributions  acceptable.  On  the  basts 
of  the  errors  of  fitting  the  observed  means  and  maxima,  as  reported  tn 
Table  3,  only  G2  and  W2  can  be  considered  sufficiently  accurate.  The 
K*S  and  chi-square  tests  late  these  most  highly  also  while  the 
ijuloimauon  criteria  rank  them  reasonably  highly.  The  W2  distribution 
»eems  maigmally  preferable  on  fitting  error  grounds 

The  lesults  of  applying  me  three  information  criteria  and  two  asymptotic 
tests  for  sodium  concentrations  are  quite  consistent.  The  G2  distribution 
is  most  preferred  or  at  least  equally  preferred  to  L2  by  four  of  the  five 
discrimination  methods,  the  cht-square  test  accepting  G2  for  U  of  the 
14  times  and  L2  on  12  occasions,  The  G2  distribution  also  fitted  the 
observed  means  most  closely  and  on  average  fitted  the  observed  maxima 
with  least  error,  However,  the  L2  distribution  appears  almost  as 
acceptable  as  G2. 

Fig.  I  illustrates  the  range  of  the  fitting  performance  attained  by  the  L2 
distribution  when  estimation  is  by  maximum  likelihood  on  annual  sets  of 
weekly  sulphate  concentrations.  The  left  hand  plot  shows  the  worst  fit  to 
the  observed  maximum  concentration  obtained  over  the  14  data  sets  (in 

1986)  while  the  right  hand  plot  shows  the  best  fit  to  this  observation  jin 

1987) .  Figs,  2  and  3  repeat  this  illustration  of  the  range  of  error  in 
fitting  the  observed  maximum  for  nitrate  and  sodium.  In  each  case,  our 
preferred  distribution  is  used,  the  W2  and  G2  distributions, 
respectively. 


Conclusions 

A  computer  program  for  selecting  and  fitting  probability  distribution 
functions  to  data  has  been  developed  which  offers  the  advantages  of 
being  easy  to  use,  flexible  and  incorporating  state-of-the-art  algorithms 
and  numerical  techniques  The  program  has  been  successfully  applied  to 
a  large  range  of  data  sets,  as  illustrated  m  the  example  application.  The 
program  can  be  applied  to  any  data  set  to  which  the  normal,  exponential 
and  2-  and  3-parameter  Weibull,  gamma  and  lognormal  probability 
distributions  are  applicable. 
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Abstract  A  csbodology  foe  xdm-fVrrbn  ©f  rest  fcjdrognjfcv 
corresponding  lc  tool  rim  flo*  k»  been  devsfopsd  |l|  j&i 
prograsrred  for  PC  usage.  The  enderfyfog  crhrmxkal  coia 
employed  to  convert  renf2fl  iso  ssemflo*  a  pressed.  Major 
products  zzd  fora  of  the  awcaad  PC  prfa;;.  IH ACRES,  arc 
..escribed  c*i=g  examples  froea  various  appUcrx-ns  of  the  agfoiiioa 
to  hydrological  sys^ms  where  ester  toed)  <x  da3 j  rasfoa  j=u 
streamfiow-  data  have  bees  zvadabte. 


lareftaira 

Unit  hydrograph  theory  [2]  plays  a  ctaral  rote  in  many  esaNbbed 
numerical  procedures  for  characterising  the  dynamics  of  the  rzmfoH  - 
streamfiow  proass  at  carihmerf  scale,  la  may  procedures,  as  tssiaJ 
step  is  to  subtract  baseflow  frost  the  observed  bydrograpb  is  some 
inruitively  reasonable  bet  fairly  arbitrary  may  to  give  z  residual 
streamfiow.  Then,  a  unit  hydrogmpb  (impulse  response  function)  is 
identified  which  links  the  system  inpet  of  time  varuaocs  is  rainfall 
excess  to  the  system  caps  of  time  sanations  is  residual  streamfiow' 
The  methods  predourjamly  used  for  adjusting  rainfall  to  give  rainfall 
excess,  and  any  necessary  parameters,  are  chosen  or  optimised  to  give  a 
best  fit  between  'observed'  and  modelled  residual  streamfiow.  A 
disadvantage  of  this  overall  approach  is  that  the  dexh  ed  unit  bvdrcgraph 
applies  only  to  a  radicr  poorly  defined  component  of  streamfiow  (i.e. 
observed  streamfiow  minus  a  conjectural  basefiow).  Although  a  unit 
hydrograph  derived  In  this  way  might  characterise  well  the  dynamic 
relationship  between  rainfall  and  streamfiow  for  the  individual  'well- 
behaved’  event  on  which  the  model  was  calibrated,  it  may  perform 
rather  poorly  in  validation  mode  on  other  events.  It  is  common  practice 
with  such  methods,  therefore,  u>  derive  an  average  unit  bydrograpb  for 
a  catchment  from  calibrations  on  several  specially  selected  events  (e  g. 
events  which  exhibit  a  dominant  peak  and  where  the  rainfall  has  a  ‘well- 
behaved’  temporal  profile). 


Cos. d  Sathal  LeScwai  (spiffed  da)  and  iaJesza. 
limned  a=d  Wixrbrad  |I)|  bic^Se  tbs  rdscahp  hsxwa  6e 
hydrafogjuzl  dunarion  aal  distna  d  surface  mss  aedt)  foe 
sma.1  cptend  «±seat  a  WaJes.  J axeman  and  Irfewod  (5] 
mvcssgxg  the  dmnge  a  bjirctcgsuiJ  dsanesns  cf  a  20  hr 
rx.-hmrru  h  the  Asmha  Caps;  Tca».y  crc  a  gened  cd  asma 
escalypt  forest  rtpaszxo  after  a  severe  fcesh  foe.  Lrfoood  and 
Jalq-mn  f6]  assess  the  tepn!  brase  c£  the  css  bydrograpb  ever 
a  30  year  period  for  a  large  la)  erdneu  a  WaSes.  Data  from 
several  small  research  nednexs  aroend  the  wccld  are  being  analysed 
currency  by  IH  ACRES  to  examine  the  hypothesis  that  peak  Rronfxws 
can  compete  aesdy  'c*d*  wsrr  which  c  displaced  by  *cew*  rfo«e 
(see  tes  a  this  papa).  Ocher  pcss&te  apgfeataaes  of  IH  ACRES 
related  to  engineering  fcyirefogy  and  ncer  quafity  hydrology  are  bessg 
considered. 

This  paper  mtrtxfores  the  PC  package.  IHACRES,  dercteped  yoedy  by 
the  Instrme  of  Hydrology  (IH)  and  the  Centre  for  Resource  and 
Eavwemema!  Stohes  (CREST,  the  cany  THACRES  aiso  refierfs  the 
codaborarioa  to  tics  cad  between  IH  and  CRES.  The  paper  begins  web 
drafts  of  the  modd  evoked  by  IHACRES  2nd  then  iSssraes  various 
fearorcs  of  the  package,  employing  sdectod  c umptes  from  case  srodies 
undertaken  by  the  ambon. 

ThS-POdd 

Tbe  oodd  used  n  IHACRES  is  based  oa  mat  bydrograpb  thaxy  (e.g. 
12 J)  which  describes  the  variarioa  of  stfeamflo*  x(0  over  time  t  as  a 
linear  convolution  between  rainfall  excess  c(0  and  the  unit  bydrograpb 
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In  |1),  a  methodology  is  presented  for  deriving  unit  hydrographs 
corresponding  to  (a)  total  streamfiow  and  (b)  the  quick  and  stow 
components  of  streamfiow  separately.  This  methodology  has  been 
programmed  for  PCs  and  the  package  has  been  called  IHACRES 
(Identification  of  unit  Hydrographs  And  Component  flows  from 
Rainfall,  Evaporation  and  Streamfiow  data).  IHACRES  employs  time 
series  of  rainfall  and  streamfiow  data  for  model  parameter  estimation.  It 
does  not  require  selection  of  ‘well-behaved’  events.  Neither  is 
separation  of  baseflow  from  the  hydrograph  required  before  unit 
hydrograph  identification.  From  the  high  quality  of  calibration  and 
validate  model-fits  which  have  been  obtained  ft],  [3-bJ  for  different 
catchment  types  and  data  time  intervals  (e  g.  hourly,  daily),  it  is  evident 
that  unit  hydrog.vhs  can  be  well  identified  by  IHACRES.  The 
identification  of  separate  unit  hydrographs  for  quick  and  slow  flow 
provides  the  facility  to  separate  stream  hydrographs  into  qukc  and  slow 
flow  components. 

The  capability  of  IHACRES  to  derive  reliable  unit  hydrographs  for  toui 
streamfiow,  and  for  flow  components  (quick  and  slow;  separately, 
nukes  it  widely  applicable  in  studies  of  hydrological  systems.  In 
connection  with  several  types  of  investigation  of  environmental  systems, 
IHACRES  has  already  been  applied  to  rainfall  and  streamfiow  data  sets 
from  the  United  Kingdom,  Australia,  New  Zealand  and  the  (>i».ied 
States  of  America,  Jakemao,  Liulewood  and  Whitehead  (4]  use  the 
package  to  assess  the  catchment-scale  hydrological  impact  of  partial-area 
land-use  changes  associated  with  the  forestry  cycle  in  two  catchments  in 


Rainfall  excess  is  the  amount  of  rainfall  which  contributes  to  streamfiow 
after  ’losses’  due  to  evapocranspiratioa  have  been  deducted.  When 
dealing  with  a  continuous  time  representation,  as  in  (I),  the  unit 
bydrograpb  is  the  streamfiow  response  to  a  unit  of  rainfall  excess 
applied  instantaneously.  When  a  discretisation  of  (1)  is  used,  as  in 
IHACRES,  the  corresponding  input  is  unit  rainfall  excess  over  one 
sampling  interval. 


The  assumptions  of  unit  hydrograph  theory  are  well  known,  but  are 
conveniently  summarised  in  (I)  The  methodology  in  IHACRES  also 
invokes  an  approximation  of  the  unit  hydrograph  as  a  combination  of 
exponential  decays.  Another  interpretation  of  this  is  that  rainfall  excess 
is  input  to  a  system  of  linear  reservoirs  or  storages  from  which  there  is 
an  output  which  represents  streamfiow.  The  configuration  of  reservoirs 
may  involve  storages  (hat  are  in  parallel,  in  series  or  both.  User- 
interaction  with  IHACRES  can  identify  the  most  appropriate 
configuration  in  each  case  (see  [1J);  the  most  common  configuration 
encountered  by  the  authors  is  iw©  storages  in  parallel  When  derived  in 
discrete-time,  and  in  the  case  of  two  storages  acting  in  parallel,  the 
relationship  between  rainfall  excess  uk  and  stre3mfiow>  xk  at  lime  slept 
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Fig.  1.  Systems  uugrzm  of  (be  non  common  modd  configuration  used 
by  IHACRES. 

The  pa ckage  also  allows  me  of  2  non-linear  roodd  of  the  relationship 
between  rainfall  and  rainfall  excess.  Rainfall  excess  is  highly  dependent 
on  the  ievd  of  antecedent  rainfall  and  may  also  depend  on  changes  in 
evapotranspirafcon.  The  modd  used  in  IHACRES  b  a  sickle  but  useful 
one  and  the  interested  reader  b  referred  to  [7]  and  (1{  for  details;  an 
overview  cr  the  raodd  between  rainfall  and  rainfall  excess  b  now  given. 

In  piindple.  rainfall  excess  could  be  calculated  by  estimating  any 
’losses*  due  to  evapotrasspiration  and  changes  in  catchment  wetness,  by 
employing  measurements  of  hydrometeorological  variables  and  soil 
moisture.  However,  thb  approach  b  usually  data  intensive  and  not 
necessarDy  accurate  because  of  sampling,  measurement  and  modelling 
errors.  Also,  it  may  not  be  necessary  for  stream  flow  moddKng  over 
short  periods  in  humid  temperate  regions  where  the  effects  on 
streamflow  of  variations  in  evapotranspiration  over  restricted  pen-jds 
are  usually  of  secondary  importance  to  variations  in  rainfall. 

The  approach  adopted  in  IHACRES  b  to  account  for  variations  in 
catchment  wetness  (i.e.  the  ‘ripeness’  of  the  catchment  to  produce 
Streamflow  at  the  time  of  the  causative  rainfall)  by  maintaining  a 
running  index  of  exponentially  weighted  past  rainfall  A  single  optimal 
parameter  which  determines  the  leng*  of  memory  for  exponent!  illy 
weighting  the  past  rainfall  can  be  determined  fairly  rapidly  by  u  er- 


Raicfall  excess  b  calculated  by  multiplying  r[  by  sk  at  each  time  step 
and  then  scaling  to  ensure  equality  between  volumes  of  rainfall  excess 
and  streamflow  over  the  calibration  period.  Thus  effective  rainfall,  uk, 
b  ghen  by 

uk  -  const.  r{  sk  (6) 

where  const,  b  the  scaling  factor. 


Fpatyres  aid  illustrative  applications  of  IHACRES 
I.  Calculation  of  rainfall  excess 

The  package  allows  the  user  to  use  the  non-linear  model  (4)  -  (6)  to 
walailate  rainfall  excess  or  foi  other  calculated  values  of  rainfall  excess 
to  be  accepted  directly. 

Experience  with  the  above  simple  non  linear  model  for  rainfall  excess 
(|7J,  (1),  (3-6])  has  shown  a  to  work  well  in  combination  with  (2)  for  a 
variety  of  catchment  types  and  rainfallfevaporation  regimes.  Fig.  2 
shows  (he  rainfall  and  (he  IHACRES-derived  rainfall  excess  over  a  490 
day  period  from  22  August  1976  for  the  Kenwyn  at  Truro,  a  19  knr 
catchment  *n  southwest  England.  Notice  in  Fig.  2  now,  as  (he  catchment 
wets  up  due  to  recent  rainfall,  rainfall  excess  approaches  rainfall,  aftei 
a  relatively  dry  spell  rainfall  excess  is  only  a  small  proportion  ot 
rainfall. 
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Ftg.2.  Drily  raisfril  2nd  rrisfall  excess  for  the  Keawya  at  Truro  from 
22  August.  1976. 


II.  Calibration-validation  of  IHACRES  models 

Sever *1  methods  of  p2 tzzxzte  estimation  are  available  ia  IHACRES  to 
calibrate  catchment-scale  models  of  streamflow  behaviour  in  response  to 
rainfall  excess.  These  are  basically  various  forms  of  instrument! 
varuble  estimation  (see  (IJ  aad  [8]  for  more  details).  When  any  model 
has  beta  calibrated  it  should  be  validated  using  data  from  another  period 
of  record.  Fig.  3  shows  the  calibration  model-fit  to  daily  data  ova  a 
three  year  period  from  late  July  1982  for  the  894  fan*  Teifi  at  Gian 
Teifi  in  West  Wales.  This  model  has  six  parameters  (four  in  the  transfer 
function  given  by  (2;,  one  for  the  calcularicn  of  the  catchment  wetness 
index  and  one  for  adjusting  rainfall  for  seasonal  temperature-related 
evapotranspuation  effects)  and  explains  about  895#  of  the  variance  in 
the  streamflow.  To  assess  how  well  the  modd  performs  on  another 
period  of  the  Teifi  record.  Fig.  4  shows  it  applied  to  daily  data  for  a 
three  year  period  from  the  beginning  of  August  1976.  Hae,  the  model 
calibrated  on  data  from  1982  to  19S5  explains  about  85  pa  cent  of  the 
initial  variance  of  streamflow  in  the  earlier  period  of  record.  The  modd 
captures  the  dynamics  of  the  Teifi  catchment  hydrology  and  it  b 
cyident,  thaefore,  that  reasonably  consistent  unit  hydrographs  for  iota 
streamflow  and  for  quick  and  slow  flow  components  have  beer 
identified. 


big.  3.  Calibrated  modd  fit  to  daily  Teifi  streamflow  at  Gian  Teifi  from 
25  July  1982. 

However,  careful  inspection  of  Figs  3  and  4  reveals  that  thae  are  some 
systematic  discrepancies  between  observed  and  modelled  streamflow 
For  example,  many  of  the  small  hydrograph  peals  in  summa  seasons 
are  overestimated,  indicating  that  the  modd  could  still  be  improved 
The  fairly  large  difference  between  observed  and  modelled  streamflow 
at  the  beginning  of  the  wet  season  of  1986/87  (i  t.  at  the  start  cf  •’  * 
record  shown  in  Fig.  4)  corresponds  to  the  ‘wetting  up'  paiod  at  the 
end  of  the  1976  Drought  and  thaefore  presents  a  particularly 
demanding  period  of  record  to  modd.  Littlewood  and  Jakeman  |6* 


eranaae  the  30  year  record  of  Teifi  daily  streamflow  and  catchment 
areal  rainfall  to  (a)  improve  the  modd-fit  to  summer  events  and  (b) 
assess  the  temporal  invariance  of  the  met  hydrograph  aad  its  quick  and 
slow  components. 


Time  (deys) 

Fig.  4.  Validation  of  modd  used  for  Fig.  3  on  daily  Teifi  streamflow 
from  I  August  1976. 


HI.  Separation  of  bydrographs  into  quick  and  slow  flow 

Separate  convolutions  of  the  quick  and  slow  first-order  transfer  function 
components  cf  (2)  with  rainfall  excess  generate  the  quick  and  slow  flow 
components  of  streamflow  respectively.  Fig.  5  shows  the  moddled  daily 
total  streamflow  and  the  correspoa.jn0  slow  flow  component  ova  the 
three  year  paiod  from  1974  to  1976  for  the  743  km*  Colne  at  Denham, 
a  southward  draining  tributary  of  the  Thames  in  England.  Much  of  the 
catchment  comprises  Chalk  geology  and  the  proportion  of  Colne 
streamflow  which  can  be  considaed  baseflow  from  a  regional 
groundwata  reservoir  is  thaefore  high.  The  high  slow  flow  component 
of  streamflow  demed  by  IHACRES  shown  in  Fig.  5  accords  wdt  with 
the  provenance  of  a  rdativdy  large  volume  of  baseflow  from  the  Chalk 


Fig.  5.  Moddled  daily  Colne  streamflow  and  slow  flow  at  Denham 
from  l  January  1974, 


In  contrast,  Fig.  6  ihows  the  moddled  hourly  total  streamflow  and  the 
IHACRES  slow  flow  component  ova  a  400  hour  paiod  from  late  July 
1981  for  a  0  OF.  to.  catchment  (Maimai,  M6)  in  New  Zealand.  Heie, 
it  can  be  seen  that  the  quick  flow  (total  minus  slow)  is  the  dominant 
component  during  events,  particularly  at  and  near  peak  flows 
Interestingly,  it  has  been  estimated  from  isotope  tracer  field  experiments 
that  the  provenance  of  most  of  th5  streamwata  at  and  near  peak  flows 
in  the  M6  Maimai  catchment  is  old’  wata  which  was  in  the  catchment 
prior  to  the  causative  rainfall  (9). 

In  orda  to  explain  a  dominance  of  ‘old'  wata  in  peak  streamflows  in 
such  upland  catchments,  where  slopes  are  steep  and  soils  are  thin,  it  is 
necessary  to  invoke  a  displacement  mechanism  of  streamflow 
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Fig.  6.  Modelled  daily  Maimai  (M6)  streamflow  and  slow  flow  from  25 
July  1981. 


generation.  Acceptance  of  both  (a)  the  conclusions  of  the  isotope  tracer 
studies  and  (b)  the  IHACRES  results  indicates  that  the  quick  flow  and 
’old’  water  components  have  more  in  common  in  terms  of  provenance 
than  the -quick  flow  and  ‘new’  water  components.  However,  the 
underlying  conceptualisation  of  streamflow  generation  in  IHACRES  is 
runoff  from  a  catchment  of  variable  wetness  (or,  alternatively,  variable 
contributing  area)  represented  by  an  idex  based  on  an  exponential 
weighting  of  past  rainfall.  Arguably,  therefore,  IHACRES  is  more  in 
accord  with  widely  accepted  theory  of  streamflow  generation  than  ‘he 
theories  of  displacement  necessary  to  explain  the  isotope  tracer  results. 
Furthermore,  IHACRES  hydrograph  separation  gives  results  which  are 
intuitively  reasonable  for  both  high  and  low  baseflow  streams. 

Clearly,  there  are  differences  of  opinion  in  the  scientific  community 
about  the  basic  namre  of  the  dominant  streamflow  generation  process  in 
small  upland  catchments  in  humid  regions.  Application  of  IHACRES, 
using  concurrent  groundwater,  rainfall  and  streamflow  time  series 
observations,  and  comparison  with  tracer  techniques  in  particular  field 
studies,  may  provide  a  way  forward  in  any  attempt  to  resolve  these 
differences. 


IV.  Assessing  and  monitoring  the  impact  of  land-use  change 

If  good  models  of  the  form  given  by  (2)  can  be  established  from  rainfall 
and  streamflow  data  over  different  periods  of  interest,  then  the 
component  flow  characteristic  catchment  properties  given  by  (3)  can  be 
derived  for  each  of  those  periods.  Furthermore,  since  the  instrumental 
variable  techniques  used  in  IHACRES  include  computation  of 
approximate  confidence  intervals  for  the  parameters  in  (2),  it  is 
possible,  by  a  Monte  Carlo  technique,  to  compute  approximate 
uncertainties  associated  with  the  characteristics  in  (3)  over  any 
calibration  period. 

Table  i  shows  how  the  slow  flow  time  constant  (rj  and  relative 
volume  throughput  (I')  for  the  7.7  km1  Monachyle  catchment '  in 
Scotland  change  over  a  four  year  period  (1984/85  *  1987/88)  during 
which  a  part  of  the  catchment  was  artificially  drained  before  conifer 
trees  were  planted  [4).  Each  value  shown  in  Table  1  was  derived  from  a 
time  series  of  daily  rainfall  and  streamflow  for  annual  periods'  starting 
in  mid-June.  While  the  Table  shows  large  uncertainties  on  the 
characteristic  properties  (i.e.  the  ranges  defined  in  parentheses),  the 
mean  values  indicate  that  an  increasing  relative  volume  of  flow  passes 
through  the  slow  flow  storage  following  draining. 

In  another  investigation  [51  of  the  effects  of  land  cover  change  on  the 
hydrological  response  characteristics  of  catchments,  IHACRES  is  being 
applied  to  examine  rainfall  and  streamflow  data  from  a  20  km5  upland 
catchment  in  the  ACT,  which  was  instrumented  following  a  substantially 
damaging  bushfire  in  1983.  Analysis  of  dm  sets  from  1983  until  1990, 


TABLE I 

MEAN  VALUES  FOR  SOME  CHARACTERISTIC  CATCHMENT 
PROPERTIES  FOR  THE  MONACHYLE  CATCHMENT. 


1984/85  1986/87  1987/88 


xt 

42.98 

39.63 

34.95 

(12, 68) 

(12.64) 

vM 

005 

0.15 

022 

(.08. 0.25) 

(0.13, 0.31) 

*  Available  approximate  70  per  cent  confidence  intervals  are  shown  in 
brackets 


when  revegetation  had  largely  been  reestablished,  reveals  the  stronger 
influence  of  evapotranspiration  in  the  latter  part  of  the  data  sets.  Fig.  7 
show*  the  unit  hydrograph  and  its  quick  and  slow  components  for  this 
catchment,  derived  from  model  calibration  over  the  two  year  period 
Immediately  following  the  bushfire. 


Fig  7  Unit  hydrograph  and  compo  ,mt  flows  for  model  calibrated  from 
daily  streamflow  at  Licking  Hole  Creek  from  1  April  1983  to  13 
March  1985. 


Concluding  remarks 

The  paper  has  outlined  several  features  of  IHACRES  and  has  presented 
some  of  its  applications  (existing  and  potential)  covering  a  range  of 
problems  in  hydrological  sconce  and  environmental  management.  The 
success  to  date  of  the  methodology  in  terms  of  goodness-of-fit  to 
measured  streamflow,  both  in  calibration  and  validation  modes,  and  the 
ability  to  characterise  well-defined  quick  and  slow  components  of 
streamflow,  augur  well  for  application  of  IHACRES  in  areas  of 
environmental  and  engineering  hydrology  quite  generally.  Attempts  to 
improve  the  performance  of  IHACRES  include  (a)  investigation  of 
alternative  rainfall  excess  models  —  to  better  model  small  runoff  events 
in  summer  seasons  and  (b)  utilisation  of  local  groundwater  or  soil 
moisture  content  measurements  -  to  account  for  variations  in  the  slow 
flow  component  of  streamflow. 
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Abstract 

It  is  increasingly  recognized  that,  to  be  satisfactory,  decisions  involv¬ 
ing  air  quality  require  the  use  of  stochastic  tools.  This  paper  summa¬ 
rizes  recent  research  findings  and  discusses  the  influence  they  should 
have  on  the  development  and  interpretation  of  statistical  models  that 
will  be  used  to  support  prediction;. assessment,  and  monitoring  pro¬ 
cedures. 


I  INTRODUCTION 


Historically,  the  first  mathematical  models  used  in  air  quality  work 
assumed  that  the  process  of  pollutant  dispersion  within  the  atmo¬ 
sphere  was  a  deterministic  process.  Casual  observation,  for  example 
of  smoke  from  an  industrial  chimney  or  a  cigarette,  shows  on  the  con¬ 
trary  that  the  process  is  stochastic  (be.  random).  Figure  1  shows  two 
time  series  of  pollutant  concentration  in  the  atmosphere  (chosen  from 
many  possible  examples  at  random!).  Both  illustrate  the  inherent  and 
large  variability  characteristic  of  all  such  series.  This  variability  is  ig¬ 
nored  by  dctermmi'ric  models,  such  as  Gaussian  plume  models  and 
many  other  more  complicated  schemes,  which,  though  readily  avail¬ 
able  commcrc.  (invariably  linked  up  with  eye-catching  graphics 
packages!),  have  little  connection  with  reality.  Some  countries  still 
use  such  models  in  legislation  or  in  quasi-legal  regulations.  It  is  to 
be  hoped  that  they  do  not  lead  to  disasters  before  they  arc  replaced 
by  scientifically  sound  models,  and  these  must  be  stochastic. 

The  development  of  stochastic  models  for  air  (and  water)  quality  as* 
scssment  is  now  a  thriving  research  activity  •  and  quite  rightly  so. 
Important  advances  have  been  made  by  Jakeman  and  his  colleagues 
at  ANU,  Canberra  (see  Jakeman,  Bai  Jun  and  Taylo*  1988,  and  ref¬ 
erences  therein).  The  present  paper  summarizes  some  recent  work  by 
the  authors  who  predominantly  approach  the  topic  from  the  discipline 
of  theoretical  physics  In  selecting  material  to  include,  preference 
has  been  given  to  work  that  is  important  in  constructing  stochastic 
models  and  not  necessarily  to  research  results  of  most  fundamental 
scientific  interest. 


Figure  i:  Typ'cal  concentration  time  series  from  field  trials,  (a) 
Mylne  (1989),  (b)  Nielsen  and  Jensen  (1991),  Chatwm  and  Goodall 
(1991), 


II  THE  PDF  OF  CONCENTRATION 


Let  T  be  the  concentration  of  a  species  (foreign  gas,  particles,  ...) 
dispersing  in  the  atmosphere  As  Figure  1  shows,  F  is  a  random 
function  of  position  x  and  time  1.  Its  probability  density  function 
(pdf)  p(0;  x,  t)  is  the  most  basic  measure  of  variability.  In  the  usual 
way,  p(0,x,t)  is  defined  for  0  >  0  by 


WiX.t)  =  -Trlp-oW*.!)  £  < 


and  the  mean  /r  and  variance  a*  of  T  satisfy 


fi  =:  rrJ=  /"#*»-* 
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In  conventional  turbulent  diffusion  work,  n  and  o2  are  usually  de¬ 
noted  by  V  and  (c')3  respectively.  Note  that,  in  general,  n  and  o'* 
depend  strongly  on  x  and,  for  releases  of  pollutant  that  are  not  steady, 
on  t.  In  view  of  widespread  confusion  and  lack  of  precision,  it  must 
be  emphasized  that  time-averages  do  not  occur  in  the  formulation 
above.  Means  arc  defined  by  equations  like  (2)  and  must  be  esti¬ 
mated  from  data  by  appropriate  statistical  methods  which  are  not, 
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in  general,  time-averages.  For  environmental  reasons  there  may  be 
interest  in  the  statistical  properties  not  of  the  (instantaneous)  con- 
centration  F  but  of  Pr,  its  time-average  over  some  defined  interval  T 
(eg  5  min  or  I  hr).  The  pdf  of  Fy  is  not  simply  related  to  p(0;x,t); 
in  particular  it  must  depend  bn  T.  Given  also  that,  except  for  very 
large  T,  the  pdf  of  Fy  is  more  difficult  to  model  and,  in  general,  to 
measure  than  that  of  F,  it  is  timely  to  ask  for  hard  scientific  evidence 
(rather  than  an  appeal  to  conventional  practice,  misguided  legisla¬ 
tion  or  even  anecdotal  wisdom)  that  Fy  for  a  specified  non-zero  T  is 
the  appropriate  measure  of  environmental  danger,  for  example  from 
toxic  gases.  A  continued  failure  to  consider,  this  point  seriously  will 
inevitably  result  in  wrong,  even  fatal,  decisions  being  taken. 


HI  FACTORS  INFLUENCING  PDF  MODELLING 


From  the  point  of  view  of  environmental  decision  support,  it  is  impor¬ 
tant  to  seek  robust  and  practical  adequate  mathematical  models  of 
p(0;x,t);  ideally  such  models  ought  to  be  related  to  the  particular  en¬ 
vironmental  danger  (eg,  flammability  or  toxicity?)  and  they  should 
be  simple,  Jakeman  and  his  colleagues,  and  others,  have  tested  many 
candidates  (lognormal,  gamma,  Weibull, ...)  against  data.  Chat  win 
and  Sullivan  (1989)  showed  that  p($;x,t)  has  the  exact  structure: 

*(*.x»0  =  ?/(*ix»0+(i  -?M*;x,i)»  (3) 

where  7  s  7(x,t)  is  an  intermittency  factor  (equal  to  p/do,  where 
do  is  the  source  concentration),  and  /  and  g  are  themselves  pdfs. 
The  importance  of  the  formulation  (3)  is  that  7  <  1  except  near  the 
source  $0  that  p  «  g,  i.e.  the  concentration  statistics  are  dominated 
by  the  transfer  (e.g.  by  molecular  diffusion  but  excluding  advection) 
of  pollutant  from  air  emanating  from  the  source  to  air  not  emanating 
from  the  source.  This  observation  has  Important  consequences  for 
modellers. 
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Figure  2;  Top  diagrams  show  time  series  obtained  (for  same  concen¬ 
tration)  by  two  different  instruments.  Bottom  diagram  is  a  scatter 
diagram  comparing  the  results,  the  points  are  not  dose  to  a  line  of 
slope  1  through  the  origin  •  in  fact  r  «  0.7  (Chatwm  k  Goodall  1991), 


Another  area  of  research  of  the  same  genre  is  the  magnitude  and 
removal  of  statistical  noise  arising,  again  inevitably,  because  of  the 
limited  sizo  of  datasets. 


If  simple  models  exist,  they  must  enable  parametrization  to  take  ac¬ 
count  of  atmospheric  conditions  and  source  geometry.  In  particular 
there  would  be  great  benefit  to  be  derived  if  it  could  be  shown  that 
there  was  a  simple  relationship  connecting  quantities  like  /1  and  <r5. 
The  physically  well*  based  equation 

(4) 

where  a  and  p  are  constants  and  po  is  the  maximum  value  of  p  at 
a  cross-section  downwind  of  the  source,  is  of  this  type,  and  (4)  has 
been  validated  for  many  datasets  by  Chatwm  and  Sullivan  (1990). 
However  all  these  datasets  were  from  steady  releases  in  laboratory 
experiments;  datasets  from  experiments  more  relevant  to  environ¬ 
mental  hazard  assessment  are  much  rarer  but  work  is  in  hand  to 
see  whether  (4)  can  be  generalized.  Particular  interest  attaches  to 
instantaneous  or  -  more  generally  *  unsteady  releases. 


There  Is  no  space  to  consider  the  summarized  problems  in  depth,  but 
interested  workers  should  consult  the  cited  papers,  and  also  Sullivan 
(1984)  and  Sullivan  and  Yip  (1989). 
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Abstract  -  In  addition  to  experimental  data,  modeling  and  sim¬ 
ulation  of  the  flow  and  transport  processes  are  important  means 
to  describe  and  predict  the  water  quality  in  fluvial  systems.  With 
the'  help  of  a  verified' mathematical  model  one  can  investigate 
the  principles  and  connections  m  an  aquatic  ecosystem  by  sim¬ 
ulation  of  different  process  variants  (  e  g.  transport  of  dissolved 
and  particulate  substances,  determination  of  particle  load,  self- 
punficauon  by  settling  of  particulate  matter,  exchange  processes 
between  bottom  sediment  and  the  water  column,  consequences  of 
water  management  activities)  The  presented  computer  program 
HYMOS  has  been  developed  to  handle  these  problems.  Its  struc¬ 
ture,  its  capabilities  and  an  example  of  its  application  are  shown 
here. 

I«  Introduction 

Rivers  in  industrial  countries  are  often  polluted  with  dissolved 
or  particulate  substances  from  anthropogenic  sources,  and  a  con¬ 
siderable  amount  of  dissolved  matter  can  be  adsorbed  by  the  sus¬ 
pended  sediments  carried  in  rivers.  Hence  the  fate  of  particu¬ 
late  phases  of  nutrients  and  pollutants  is  governed  by  sediment 
transport.  Adsorption-desorption  transfer,  particulate  transport 
as  well  as  deposition  and  erosion  determine  the  path  of  a  sub¬ 
stance  in  aquatic  systems  (5,6). 

There  is  a  great  need  for  powerful  mathematical  models  pre¬ 
dicting  hydrophysical,  chemical,  and  biological  processes  in  flu¬ 
vial  systems.  The  adequate  mathematical  tools  of  these  processes 
are  coupled  systems  of  nonlinear  partial  differential  equations  of 
hydrodynamics  (Reynolds  equations  and  special  convection  diffu¬ 
sion  equations)  connected  with  initial  and  boundary  conditions 
(1,2).  Using  the  HYMOS  computer  code,  one  can  solve  typical 
initial  and/or  boundary  problems  based  on  the  hydrological  spec¬ 
ified  equations  mentioned  above. 

II.  The  HYMOS  program  package 


CONTRA  simulates  groundwater  problems 
SEDIFLOW  simulates  surface  water  problems 

Fig.  1  Program  package  structure 
The  HYMOS  program  package  consists  of  a  core  and  of  several 
satellites  using  this  core.  One  can  divide  the  core  into  a  pre¬ 
processor  (named  PRE),  into  some  basic  modules,  needed  by  all 
satellites,  and  into  a  post-processor  (named  PPG),  set  Fig,  I  The 
numerical  method  to  solve  initial  and  boundary  value  problems 
for  partial  differential  equ  *tions  implemented  in  the  core  of  HY- 
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MOS  is  a  special  finite  element  technique,  called  the  multi-bases 
Galerkin  finite  difference  method  (4)  Until'nov.two  satellites 
of  HYMOS  exist  Which  are  named  CONTRA  and  SEDIFLOW 
CONTRA  deals  with  groundwater  problems  Moreover,  this  pro¬ 
gram  is  able  to  identify  automatically  parameters  oi  'considered 
flow  and  transport  processes  in  porous  media  osir.g  observation 
data.  The  satellite  program  SEDIFLOW  is  .concerned  with  con¬ 
taminant  transport  and  sedimentation  processes  in  rivers  and 
lakes.  In  the  further  presentation  essential  properties  of  SEDI¬ 
FLOW  are  considered  m  more  detail. 

III.  Capabilities  of  the  SEDIFLOW  computer  code 

Using  SEDIFLOW  one  can  simulate  flow,  contaminant  trans¬ 
port,  exchange  and  transformation  processes  as  the  following. 

-  turbulent  flow  in  real  river  segments 

-  settling  aid  transport  behavior  of  particle  fractions  (e  g 
continuous  side  discharge  or  after  accident) 

-  transport  of  dissolved  and  particulate  contaminants  and  nu¬ 
trients  and  their  accumulation  in  bed  sediment 

•  complex  exchange  and  transformation  processes  of  sub¬ 
stances,  like  volatility  on  the  free  surface,  decay  of  sub¬ 
stances  in  the  dissolved  or  particulate  phase  or  m  the  sed¬ 
iment,  sorption  effects  concerning  particle  fractions  or  the 
sediment,  aggregation  and  particle  growth 

•  coupled  transport  and  exchange  processes  of  several  sub¬ 
stances 

The  user  of  the  SEDIFLOW  code  can  choose  between  three 
models  of  turbulent  momentum  exchange. 

.  constant  coefficients  of  turbulent  momentum  exchange 

-  vertical  dependent  exchange  coefficients  (mixing  length 
model) 

•  the  L-c-turbulence  model 

SEDIFLOW  manages  two-  or  three-dimensional  problems.  All 
provided  solutions  vary  with  time.  Steady  state  solutions  are 
found  as  final  states  of  time  dependent  processes. 

SEDIFLOW  also  allows  to  apply  reduced  models.  Reducing  a 
mathematical  model  is  a  common  way  to  get  first  rough  informa¬ 
tion  about  an  ecosystem  as  well  as  to  limit  computational  costs 
(3).  Following  reduced  models  are  implemented  in  SEDIFLOW 

-  two-dimensional  vertical  cross  sections  (2-D  version) 

-  depth-averaged  models  (shallow  water  situations) 

-  static  piessure  assumption  (instead  of  full  momentum  equa¬ 
tions) 

The  application  of  SEDIFLOW  results  in 

•  flow  characteristics ; 

•  velocity  fields  of  the  mean  turbulent  flow 
.  distribution  of  the  isotropic  pressure  (water  level) 

-  quantities,  describing  the  local  state  of  turbulence  (turbu¬ 
lence  kinetic  energy,  dissipation  rate  of  turbulence  energy, 
production  rale,  coefficients  of  turbulent  momentum  ex 
change) 

•  characteristics  of  substances 

-  concentrations  of  an  unlimited  numbei  o' suspended  pal 
tide  fractions,  dissolved  substances  and  partide-sorbed 
or  sediment- accumulated  substances 
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-  secondary  quantities  as  particle  deposition  and  entrain¬ 
ment  rates,  net  deposition  rates,  sediment  accumulation 
rates,  mean  substance  concentrations  in  the  water  body 

IV.  SEDIFLOW  application  example 

In  Figs  2  and  3  are  shown  the  predicted  flow  and  the  settling 
behavior  of  different  particle  fractions  along  a  slow  flow  river  sec¬ 
tion.  In  the  settling  basin  after  a  sudden  increase  of  the  discharge 
cross  section,  the  discharge  velocity  decreases  from  vB  =  0  3  m/s 
at  the  inflow  to  downstream  values  of  about  0  02  m/s  (Fig.- 2). 


£ 

hum 

i 

Fig.  2  Settling  bum,  flow  characteristics 

a)  calculated  horizontal  velocity  field  near  free  surface 

b)  vertical  cross  section,  located  at  inflow  centerline 

c)  depth-averaged  longitudinal  velocity  and  bottom  friction 
velocity 

d)  distribution  of  longitudinal  velocity  at  downstream  cross 
section 

The  total  paitide  mass  of  9  mg/1  measured  upstream  is  separated 
into  four  different  fractions  classified  by  the  particle  settling  ve¬ 
locity  (1st  fraction  -  smallest  settling  velocity).  Fig.  3a  shows  the 
individual  fraction  settlement  along  the  considered  river  section 
Rom  this  result  it  is  evident  that  the  very  light  matter,  corre¬ 
sponding  to  the  first  fraction,  is  eroded  because  of  the  overcntical 


values  of  friction  velocity  at  the  inflow  area  (Fig  2c)  Fig  3b 
compares  the  fractional  particle  settling  with  the  behavior  of  the 
total  particle  mass  using  a  median  value  of  settling  velocity. 


Fig.  3  Settling  behavior  of  individual  particle  fractions 
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Abstract  -  The  classical  forms  of  Darcy’s  and  Fick’s  laws  seem 
to  be  inadequate  to  describe  misc.ble  solute  transport  through 
porous  media  at  higher, density  and  viscosity  gradients  In  addi¬ 
tion  to, numerical  computer  codes  accompanying  laboratory  ex¬ 
periments  to  identify  possible  model  extensions,  it  is  proposed  to 
use  asymptotic  expansion  solutions  of  the  governing  equations  for 
a  two-dimensional  laminar  stable  mixing  layer  flow.  The  method 
is  applied  to  a  non-linear  extension  of  Fick's  law  [1]  which  is 
proved  experimentally,  up  to  now,  for  aone-dimensional  flow  con¬ 
figuration  (2j.  Thus  there  is  a  need  for  reexamining  the  existing 
experimental  data  (3], [4]  and  for  some  new  experiments. 

I,  Governing  Equations 

The  basic  equations  describing  fluid  flow  and  solute  transport 
in  a  porous  medium  are  the  equations  of  conservation  of  total  and 
solute  mass.  These  ate,  for  a  conservative  solute  in  a  source-free 
flow  field  (1] 

2M  +  V(,v)  =  0  (l) 

and 

np^p  +  ^c/rt  +  VJ  =°  (2) 

where  n  is  the  medium  porosity,  p  is  the  fluid  mass  density,  v  is 
the  fluid  velocity  vector,  V  is  the  Nabla  operator,  c  is  the  solute 
volume  concentration  (Af/L3),  J  is  the  soluie  dispersive  mass  flux 
[M/(L7T)]  and  t  is  time.  These  equations  must  be  supplemented 
by  relations  of  conservation  of  momentum.  In  most  problems  of 
slow  incompressible  fluid  flow  m  porous  media,  it  is  assumed  that 
the  velocity  and  pressure  P  are  related  by  Darcy’s  law 

v  =  -i(VP-Pg)  (3) 

where  p  and  k  are,  respectively,  the  fluid's  dynamic  viscosity 
[M/(LT)\  and  the  porous  medium’s  permeability  [L3}  which  is 
assumed  to  be  isotropic  and  homogeneous,  g  is  the  gravity  ac¬ 
celeration  vector.  A  commonly  used  expression  for  the  dispersive 
mass  flux  is 

J  =  -pD(v)V(c//>)  (4) 

where  D  is  the  Bear-Scheidegger  dispersion  tensor  [5],  It  contains 
in  an  additive  manner  the  porous  medium  attenuated  molecular 
diffusion  of  the  solute  and  the  mechanical  dispersion,  the  last  be¬ 
ing  proportional  to  the  mainstream  flow  velocity  over  a  wide  range 
of  interesting  parameters.  As  proportionality  constants  serve  two 
characteristic  quantities,  the  longitudinal  at  and  the  transversal 
«r  dispersivities  [ L )  of  the  Isotropic  porous  medium.  To  close  the 
system  constitutive  relations  are  needed  for  the  fluid’s  density  and 
viscosity  which  are  their  functions  of  the  solute  concentration  c , 
Linear  relationships  between  cand  p  or  p  are  assumed  to  be  valid 
heie,  though  the  real  dependences  are  at  least  slightly  non-linear 
for  most  water  soluble  solids 

P  =  Po  ^1  +  and  p  = s  po  (5) 


with  c,  -  the  maximum  solute  concentration  occunng  in  a  specified 
transport  problem  and  6  =  (p,  —  po)/p^  ,  p,  =  p(c,)  and  eS  = 
{p,~Po)fpo ,  fi,  =  ai(c,),  where  po  and  po  are,  respectively,  the 
density  and  viscosity  values  at  zero  or  some  datum  concentration. 

In  recent  years  a  growing  community  of  expenmentators  has 
been  collecting  material  evidencing  the  inconsistency  of  either 
Darcy's  or  Fick’s  law  or  both  to  describe  the  solute  transport 
at  high  concentration  in  the  case  of  natrium  chloride  ($m<M  =  0.2, 
(e<5)m«  *10)  [2]-(4],  (6),  (7]  Hassanizadeh  [2]  has  tested  seven 
extensions  of  Fick’s  law  (4)  with  the  help  of  a  discretisized  nu¬ 
merical  model  of  the  governing  equations  on  a  one-dimensional 
experimental  set-up.  The  breakthrough  behaviour  of  saltwater  at 
the  outlet  of  a  column  initially  filled  with  fresh  water  was  mea¬ 
sured  and  compared  with  the  calculated  curves  corresponding  to 
the  different  models.  Only  the  following  extension  of  Fick's  law 
met  all  the  experimental  values  satisfactorily 

(l+^|J))J=-pD(v)7(c/p)  (6) 

where  /3  is  a  new  constant  which  could  be  identified  uniquely.  To 
avoid  difficulties  of  the  mentioned  numerical  procedures  and  to 
proceed  further  with  the  verification  of  the  non-linear  solute  mass 
flux  ansatz  ($)  it  is  proposed  to  deal  with  a  mixing  layer  flow 
problem  as  described  below. 

II.  The  mixing  layer  flow  problem 

Consider  the  parallel  stationary  flow  of  saltwater  (z  <  0)  and 
freshwater  (z  >  0)  which  start  to  mix  at  x  =  0,  see  Fig.  1  To 
ensure  almost  horizontal  flow  the  inlet  velocities  of  both  fluids 
have  to  be  related  to  each  other  as  depicted  in  Fig.  1,  (3)  In  (3),  (4j 
steady-state  distributions  of  the  diluted  salt  were  measured  along 
the  vertical  at  some  distance  1  from  the  inlet  edge  which  was  large 
compared  to  the  width  of  the  transition  zone  from  salt  water  to 
fresh  water.  In  (3)  there  was  provided  an  approximate  analytical 
solution  to  this  mixing  problem  based  on  the  classical  forms  of 
Darcy's  (3)  and  Fick’s  (4)  laws,  too  Within  the  frames  of  (3) 
and  (4)  the  proposed  dependence  of  the  transversal  dispersivily 
a t  on  c,  could  not  account  for, the  measured  distributions  either 
It  would  have  implied,  however,  a  challenge  to  the  geometrical 
concept  of  dispersivity  that  should  be  independent  of  the  fluid 
flowing  through  the  porous  medium. 
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III.  Asymptotic  expansion  for  large  values  of  x 
Without  mixing  the  pressure  distribution  P  of  the  introduced 
flow  problem  would  be,  [3]. 

z  >0 .  P  =  P(0,0)  -  -  tngz  V), 

2  <0:  P  =  P(0,0)-^x-Mz  (8) 

i  e.  ^  sharp  interface  between  the  two  fluids  would  be  retained. 
Mixing  causes  perturbation  of  this  ptessure  distribution  which 
further  will  be  denoted  by  p. 

In  the  theory  of  fluid  motions  it  rendered  possible  to  get  a 
partial  insight  into  the  structure  of  the" solutions  of  certain1  flow 
configurations  by  the  method  of  asymptotic  coordinate-type  ex¬ 
pansions  (8).  This  method  is  most  suitable  for  finding  the  solution 
of  the  flow  and  transport  problem  at  large  distances  from  the  in¬ 
teraction  zone  of  the  body  and  the  fluid  instead  of  in  the  whole 
flow  region,  or,  as  in  our  case,  at  large  distances  from  the  inlet 
edge  x  =  0.  In  analogy  to  {Cj,  it  can  be  shown  that  the  pertur¬ 
bation  pressure  p  and  the  dimensionless  concentration  ©  =  c/c, 
can  be  expanded  in  a  power  series  of  x“,/J  for  large  values  of  x  in 
the  following  way: 


P(x,z)  a  p.i(l))  Il/1+pt(j))  *#  +  Pi(<l)  X''11  + ...  (9) 

©(*, «) = ©oh)  + ©ih)  x-'t' + ...  (io) 

where  rj  a  jr/>/4arx  is  the  so  called  similarity  variable.  Substi¬ 
tution  of  these  sums  into  the  system  (1)^3),($),(6)  leads  to  or¬ 
dinary  differential  equations  for  each  of  the  coefficient  functions 
p,,  0„  These  differential  equations  are  successively  solvable  and 
the  solutions  uniquely  determinable  only  for  some  first  coefficient 
functions,  and  logarithmic  terms  in  x  have  to  be  added  to  the 
series  (9)  and  (10),  |9],  However,  the  terms  of  practical  interest 
of  the  ©-expansion,  ©*  and  ©t,  could  be  determined  after  hav- 
ing  linearized  the  respective  equations  by  means  of  an  additional 
$-power  series  expansion.  Up  to  linear  terms  In  xmlli  and  6  the 
concentration  distribution  reads: 


The  first  three  terms  are  identical  with  the  solution  obtained  in 
(3],  In  Fig,  2  there  is  shown  the  influence  of  the  additional  £- 
term  resulting  from  the  flux  ansatz(6)  for  a  fixed  set  of  remaining 
parameters  taken  from  an  experiment  in  (4j,  Unfortunately  this 
term  cannot  be  fitted  with  the  experimental  distribution  for  no 
value  of  fl  either,  although  some  basic  trends  of  the  influence  of 
the  entering  parameters  are  identical  with  those  observed.  Nev¬ 
ertheless,  it  is  believed  that  the  utilization  of  the  asymptotic  ex¬ 


pansion  method  to  solve  the  flow  problem  considered  is  an  appro¬ 
priate  means  to  ‘visualize’  the  effects  of  transport  model  exten¬ 
sions  like  (6)  and  others  on  measurable  quantities  in  a  straight 
forward  manner.  Further  research  endeavour  both  experimental 
and  analytical  should  be  undertaken  It  is  planned  to  study  the 
influence  of  several  extensions  of  Darcy’s  law  (10],(11)  and  its  com¬ 
bination  with  the  Pick’s  law  extension  (6) 


Fig.  2  Influence  of  the  0-term  on  the  vertical  concentration  di$- 
tnbution  at  l  =  1115  cm  (parameter  values  art  taken 
from  (4j:  ar  a  0.135  cm,  »«,  a  0.0414  cm/see,  i  a  0  088, 
«<5  «  0.255,  Pedet  number  Pe  »  1450  ) 
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WATER  ENTRY  SIMULATION  OF 
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Abstract  In  this  paper  finite  difference 
calculations  for  water-entry  cavity  are  per¬ 
formed  for  the  cyl mder, circular  disk, and 
hemisphere , One  of  the  employed  finite  diffe¬ 
rence  schemes  is  S01A~V0F  scheme  on  Cartesi¬ 
an  grid  and  other  two  schemes  are  on  BFC(Bo- 
undary  Fitted  Coordinate)grid  with  partly  or 
fully  contravariant  velocity  express  ion. The 
symmetricity  of  the'  flow  is  assumed, An  time 
dependent  Incompressible  Navier  Stokes  equ¬ 
ation  is  treated.  Monitoring  display  of  the 
contours  of  flow  variables  is  also  attempted 
nearly  simultaneously  with  the  fluid  dynamic 
calculation  on  a  graphic  workstation. 

I.  INTRODUCTION 

As  for  water  entry  cavity  flow  on  a  small 
scale,  the  splash  phenomena  around  sphere 
with  centimeter  size  are  experimentally  stu¬ 
died  by  Kuvab?ra  et  aiUl.and  with  numerical 
approach.such  flows  were  attacked(2»3)  by  NA 
C  schemes  assuming  the  flow  is  axisymmetr ic. 

In  first  part  of  this  paper  the  water  en¬ 
try  cavity  flow  Is  numerically  analyzed  with 
the  boundary  fitted  coordlnates(BFC|  .  Ini¬ 
tially  the  lover  vertex  of  body  Is  tangen¬ 
tially  contact  with  the  upward  liquid  flow 
and  a  body  is  fixed  at  initial  position.  The 
Navier  stokes  equation  and  contlnuty  equa¬ 
tion  are  transformed  to  the  forms  on  the 
generated  BFC  grids. HSMAC  method  Is  modified 
for  the  transformed  forms, whose  results  are 
compared  with  those  on  the  cartesian  grid. 

Following  the  authors*  successfully  ani¬ 
mated  scenes  of  the  splash  of  a  droplet(4), 
but  in  contrast  to  pre  or  post-processing , a 
target  of  the  present  study  is  the  attempt 
to  visualize  the  flow  behavior  simultaneous¬ 
ly  with  the  fluid  dynamical  calculation. 

U.  BASIC  EQUATIONS 

The  symmetry  of  flow  and  incompressibility 
are  assumed.  An  incompressible  Navier  Stokes 
equation  is  written  for  a  general  curviline 
ar  coordinate  as  follows 

8  V  1 

- -  ♦  v  * ( V  V  )  - - v  p  +  y  A  V  ♦  «  (i) 

at  a 

V  ■  V  a  0  (2) 

Physical  domain  Is  transformed  to  a  rectan¬ 
gular  domain  of  general  coordinate  i  .  j?  . 
Equations  (1),(2)  are  nondimensionalized  by 
fi  DVR, where  U.R  are  the  oncoming  fluid 
velocity  and  body  radius,  respectively. 

It  has  been  a  important  subject  how  tac- 
ticaly  a  staggered  differencing  is  applied 
or.  non-cartesian  grids  without  oscillatory 


pressure  or  velocity.  The  reasons  of  the 
oscillation  is  that  the  transformed  velocity 
component  are  not  aligned  with  coordinate 
dircctions{5J.  First, here, the  non-physical 
velocity  formulation  is  applied.  Finite  dif¬ 
ferencing  of  the  convection  terms  employes 
a  third  order  upwind  difference  expression 
Except  this  the  scheme  is  the  modification 
of  HSKAC  scheme  to  a  general  coordinate. 

B.  rn  Relaxation  for  continuity  equation 

lime  integration  of  Navier  Stokes  equation 
(1)  by  explicit  Euler  method  gives  the  re¬ 
newed  velocities  at  a  new  time  level.  Press¬ 
ure  and  renewed  velocities  are  adjusted  to 
satisfy  the  continuity  equation(2)  whose 
residual  is  expressed  with  0=div  V  before 
convergence.  Ihat  is, the  relaxation  process 
is  iteration  between  the  pressure  expression 
which  is  given  by  the  diagonal  dominant 
approximation  and  the  new  estimate  for  the 
velocities  as  follows 

<?  p*-o>  D  / &  t(  «  1  !♦  g  22)  (3) 


{$*d Pt+r)ad p.) 

<£»Spf+??y$P») 

where  «  is  the  iteration  factor  and  gll.g 
22  are  the  metric  coefficient  e.g.  gUs€x* 
£y2  ,and  subsetipts  denotes  the  derivatives. 

Ej-UpmAak  y-Qfta.4msa 

The  markers  without  substantial  mass  are 
shifted  at  each  time  step  with  intcrpolated- 
vclocity  from  velociotics  at  the  cell  sides. 
Then  the  location  of  free  surface  is  simply 
determined  from  the  location  of  surface(SUR) 
cell  beyond  which  no  cell  Involve  the  marker. 

In  the  results  on  BFC  shovn  here, the  sur¬ 
face  tension  is  not  Included  so  far. 

The  gas  pressure  in  the  open  cavity  can  be 
calculated  by  the  Bernoulli  equation.  While, 
if  a  closed  cavity  appears, the  adiabatic 
process  is  applied  for  the  pressure  Inside 
the  bubble  behind  the  obstacle. 

Thus,  the  atmospheric  gas  does  not  inter¬ 
act  with  liquid  flow  except  in  the  closed 
cavity.  In  accordance  with  this,  we  applied 
the  initial  flat  free  surface  condition. 


Numerical  results  are  shown  for  Re=30QQ0, 
40000,  to  compare  with  the  existing  experi¬ 
ment  for  Re=38720(R-8m*,U=2.42m/$  if  ^  =0.01 
cm2/$),The  velocity  vectors  are  shovn  in  Fig. 
i  for  ReOOOOO  .  It  is  natural  that  the 
large  velocity  appears  at  the  peak  near  the 
body. In  contrast  in  Fig. 3  at  large  velocity, 
Re-40000,  such  film  cannot  be  predicetd. 


i 


Fig. 1  Re=30Q0Q  Fig. 2  ReMOOOO 

The  free  surface  profiles  at  typical  tine- 
levels  are  shown  In  Fig.  I .  The  two  peaks 
appears  in  the  developed  state. The  peak  near 
the  body  is  the  leading  edge  of  the  liquid 
filt  along  the  body  surface. Such  liquid  Mia 
is  known  in  the  exper lient(  1 1 .  The  height  of 
the  peak  is  rather  lower  than  the  experi¬ 
mental  value  where  leading  edge  reaches 
75  degree  fron  lower  stagf/tlon  point  when 
Ut/D* 1/2  . 

While,  the  physical  component  formulation 
{ S |  have  been  coded  and  sucessfully  applied 
to  a  cylinder  submerged  In  a  uniform  flow 
by  the  younger  author  Hr. Abe. This  code  is 
expected  to  produce  an  improved  behavior. 


is  an  additional  results  to  the  authors’ 


previous  study|3l.  The  numerical  examples 
are  given  for  the  hemisphere  jiilh  diameter 
12cn  and  the  unifortflov  ve loci ty : U =  3 .834m/ 
s  .  As  in  the_aut*  ors  , previous  report, our 
resultsU)  show  qualitatively  good  ^agreement 
with  the  results  by  SHAC  calculation^). 

In  Fig. 4  the  isobars  are  shown  for  dimen¬ 
sionless  time  t =U t/D  =  3 , 834 -  Even  in  the  sur¬ 
face  profile, we  cannot  find  sharp  peak  di¬ 
ming  upward.  That  is  the  liquid  film  cannot 
be  predicted  by  the  cartesian  mesh  code  as 
far  as  with  the  cell  sizes  Ar=  Az=R/20. 

The  isobars  as’  in  Fig. 4  or  the  equivorti- 
city  lines  can  be  displayed  during  calcula¬ 
tion.  On  our  weorkstation  TlATN-1  the  free 
surface  shape  in  the  surface  modeler  image 
as  in  Fig. 5  can  be  displayed  nearly  in  real 
time  mode.  That  is, the  CPU-time  are  about  20 
seconds, 10  sec  respectively  for  CFO  opera¬ 
tion  and  isobars  displaying.  The  appropriate 
intervalfor  display  is  estimated  with  the 
comparison  of  the  elapse  time  for  contour 
display  with  that  for  the  CFO  calculation. 


Hi  CARTESIAN  GRID  and  SOIA-YQF  SCHEME 


We  introduce  here  the  rc$ults(3lon  cylin¬ 
drical  coordinate  r.z,  respectively, corres¬ 
pond  to  radial  and  axial  coord inate, and  for 
which  equatlons(l,2)  are  rewritten 
In  this  case  axlsymmetrlc  obstacle  such 
as  hemispehre  Is  treated  upon  the  same 
assumption  as  In  the  BFC  formulation.  As 
in  usual  MAC  scheme  a  staggered  differencing 
is  applied. The  finite  difference  approxima¬ 
tion  employs  a  two-point  upwind  scheme. 


A.  Free  surface  condition 
Near  the  free  surface  we  employ  YOFlvolume 
of  fluidl  function, which  represents  the 
liquid  volume  fraction.  The  volume  fraction 
F  satisfies  the  following  equation 


QJ  ,  1  8  (rFu)  d  F  v 

0  t  r  0  r  0  z 


0  (5) 


The  liquid-surface-pressure  measured  from 
gas  phase  can  be  expressed  with  surface  tens 
ion  <?  by  radii  of  curvatures. 


As  described  in  the  previous  section  II 
for  the  BFC  formulation, the  pressure  and  re¬ 
newed  velocities  are  adjusted  to  satisfy 
the  continuity  equation 


Ve  introduce  the  numerical  examples  which 


IV  CONCLUSIVE  REMARKS 
Finite  dlffernce  scheme  on  the  BFC  grid 
predict  liquid  film  along  curved  surface 
.which  could  not  be  predicted  by  the  scheme 
on  the  Cartesian  grid.Moni toring  display 
scenes  on  the  workstation  Is  shown  as  the 
proposal  of  a  visual  aid  for  code  develop¬ 
ment  process. 

Fig»5  Fre.e  Surface  and  Isobars 
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Abstract-'  Identification  of  source  strenghts  of 
closed  basins  is  often  difficult  due  to  the 
presence  of  diffuse  sources  or  the  unreliability 
of  official  information.  A  source  -  receptor 
technique  is  developed  in  this  communication  for 
the  estimation  of  polluting  sources  at  the 
boundary  of  the  basin  from  concentration 
measurements  inside  the  basin. 


I.  INTRODUCTION 


c(*i.y*)  ’known  within  experimental  accuracy  at 
location  (x<.y*)»*~* . N 

where  T  is  the  unknown  pollutant  flux  along  the 
boundary  contour  S  (n  being  the  normal  direction 
to  S) . 

Since  T  (S)  is  unknown,  the  Neumann  problem  can 
be  replaced  by  the  equivalent  Dirichlet  problem 


The,  identification  of  polluting  sources  in 
closed  basins  has  become  more  and  more  important 
in  the  last  decade.  Contaminations  due  to 
industrial  wastes  and- municipal  sewage  waters 
are  presently  threatening  the  survival  of 
biological  life  in  many  lakes  and  interior  seas 
or  can  give  rise  to  dangerous  eutrophication 
phenomena  or  become  centers  of  infections.  Hence 
the  necessity  of  monitoring  water  quality,  of 
identifying  polluting  sources  and  of  classifying 
them.  Unfortunately  the  identification  problem 
can  be  carried  out  using  receptor  techniques 
only.  In  other  wordsvtho  location  and  the  nature 
of  polluting  sources  at  the  boundary  of  the  basin 
are  to  be  determined  using  vthe  information 
provided  by  concentration  measurements  inside 
the  basin.  This  is-due  to  the  fact  that  sources 
can  be  unknown  such  as  diffuse  sources,  generally 
connected  with  cumulating  microcontaminatij>nS; 
or  their  official  strengths  unreliable;- ’as  in 
tho  case  of  more  than  one  country  haying  access 
to  the  basin.  To  tackle  the  problem  we  first 
have  to  setup  a  model  connecting  'concentrations 
in  the  basin  with  fluxes  of  contaminants  at  the 
boundary;  then  a  suitable  algorithm  has  to  be 
constructed  for  the  model  inversion,  i.e.  tho 
determination  of  fluxes  using  experimental 
values  of  the  concentration. 

II.  -MODEL  SETUP 


The  diffusion  of  pollutants  in  water  has  been 
assumed  to  obey  the  following  fickian  law: 


In  other  words  diffusion  has  been  supposed  to 
bo  prevailingly  horizontal,  whereas  a  more 
general  dispersion  law  r,  including  deposition, 
vaporization  and  chemical  reactivity  is  assumed 
to  hold  vertically.  This  term  is  strictly 
connected  with  tho  so  called  water  purification 

power  and  ,  as  a  first  approximation,  it  can  be 
assumed  constant.  Thus  the  overall  problem  is 
defined  by 


A*c 


dx** 


fc-r/c 


(2) 


A 2c-fc  (3) 

c(S)-y(S) 

c<  measured  at.  (x4,y,) 

Once  y(S)  has  been  estimated,  it  is  straight¬ 
forward  to  estimate  T(5). 

III.  FORMAL  SOLUTION 

Tho  solution  of  equation  (3)  can  be  written  as 


&*c,-k  A2c2-0 

c,(S)-0  c2(S)-y(S) 


Poraally 

Ci(Xo.y0)-K  fG,(xt,y0lx,y)dxdy 

where  Gj  is-the  Green's  function  of  the  first 
type  of  the  non -homogeneous  Laplace  equation 
with  homogeneous  DiVi chiefs  conditions. 

If  tho  conformal  mapping 

y^-^O.O) 

(i.e.  f  maps  S  onto  the  unit  disk  with  xq,  yo 
mapping  into  the  origin  , 

t~/(*o*yol**y)“/(2o»2)  )  is  kf.gwn,  we  can 
compute  Gj  from  the  relation  (1) 

Similarly  C2  can  be  computed  using  the  Poisson 
integral 


2* 


1  -r 


2 

0 


l*rg-2r0cos($-$0) 


Y($)d« 


r0S  1;  0S$S2n 

In  fact  if  the  confornal  napping  defined  above 
is  known,  we  can  write  y($)-  Y(/(s))  and  conputo 
u(r0i>0)  froa  tho  Poisson  integral  and  eventually 
obtain 

Thus  in  general  we  can  write  tho  foraal  solution 
of  equation  (3)  as 
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c(x.y  )-*  J  C,{s.yl5.»l)<i5di)*/Cn(?-y|S)Y(S)fS) 

Acre  both  Gj  and  Gji  are  fcsra  if  the  cci.foroal 
sapping  /:C-*{r|  |l^=||*£l>  is  fcwwn. 

Setting 

d(x.y)^e(x.y)~tjc,(x.ylf.rt)didii  (S) 

vc  obtain 

d(x,.y<)-J  C»,(x.y|S)y(S)dS  (6) 

IV.  ALGOFITEHIC  DFVELOrXOT 

There  are  now  two  major  problems  to  be  addressed. 
First  a  procedure  was  to  be  develop*vd  for  the 
numerical  computation  of  the  conformal  sapping 
and  consequently  of  the  d*  and  Cpx  as  defined 
in  equation  (6) . 

Secondly  eguation  (6)  has  to  be  solved.  Being 
a  Fredholm  integral  equation  of  the  first  type 
it  is  a  badly  posed  problem  in  the  sense  of 
Hadanard  and  needs  regularization  before  being 
inverted  [3]. 

A.  Cosrutation  of  the  conformal  carping 

We  shall  use  an  extresua  property  of  conforsal 
napping  to  carry  out  its  nuaerical  evaluation 
(see  (4]  for  the  demonstration) - 
Let  us  consider  the  set  of  sappings 

/  continuous  in  G.N/IK-.  /(p)-0.  /'(P)“  1> 
where  the  norm  ||.||.is  defined  as 

||/|i-max  |/(z)|-  max  l/(z)I 
zcG  zcBG 

and  the  norm  |.l  is  arbitrary. 
dG  indicates  the  closure  of  G  .  It  can  be  proved 
that  there  exists  a  capping  }  such  that 
r-||/||<-ll/ll  v/e//„ 

This  sapping  is  the  conforsal  sapping 

/:C-.D(r)-<U  Kl-r> 

It  I /(*)  I- (*«’(/<*)  Wm*(/(z)]) 

it  can  be  proved  that  r*l  and  }  is  the  sought 

for  conforsal  capping 

}:G-*{z\  llzll^l) 

A  suitable  approxisation  to  }  is  provided  by 

}(.z)-w,<  £a,ui, 

1-2 

with  and  the  coefficients  aj  can  be 

computed  fron 

/s<=J[/(z,)WmJ[/(z«)]Sn\  ,7. 

Vl-mln  /  v  ' 

We  can  either  choose  a  high  value  of  n  or  divide 
dG  into  a  suitable  nusber  of  eleaents  dGm  such 
that  ££.£<*6*  and  use  in  each  of  then  a  low 
polynonial  approxisation  to  }, 


Problem  (7)  is  a  gemeral  ziaiaiatica  problem 
subject  to  ccalisear  ecsstairts.  Thsogh  high 
disessiecai;  it  can  be  solved  vitbset  itsai- 
xccmtable  ccmvergeace  difficulties  csisg  either 
augmented  lagraagiaa  cr  sscccessive  quadratic 
programing  techniques  (  J5J  ,  |6J). 

3.  F.tralsrizitist  ' 

Equation  (6)  is  aa  ill-posed  problem  in  the  sense 
of  Hadanard,  i.e.  arbitrarily  ssall  changes  in 
the  data  can  give  rise  to  considerable  deviations 
in  the  estiaaticn  of  y(S)  .  Here  the  necessity 
of  regularizing  the  problem  ,  i.e.  of  picking 
among  all  the  solutions  y(S)  which  fit  the  data 
d;  vlthin  experimental  accuracy,  the  cne  which 
fits  additional  a-priori  informatics  best.  In 
the  literature  the  additional  information  is 
generally  connected  with  sene  extremum  criterion 
such  as  the  smallest,  the  smoothest,  the  most 
randomly  distributed,  etc-  "“here  is  no  apparent 
reason  for  applying  similar  criteria  in  this 
case.  Instead  an  approach  based  cn  fuzzy  set 
theory  is  employed-  This  approach,  which  has 
been  described  in  detail  elsewhere  [7],  makes 
it  possible  to  use  all  the  qualitative  infor¬ 
mation  available  to  regularize  the  problem  and 
obtain  the  solution  that,  to  an  assigned  level 
of  confidence,  is  consistent  with  both 
experimental  data  and  a-priori  information, 
only  a  concise  outline  is  given  here.  The 
additional  fuzzy  information  available  is 
written  in  the  form 

ft  1*4(y)1 

where  Bt  indicates  some  qualitative  and  /  or 
quantitative  property  of  the  expected  solution 
Y-  BtX-*K  [where  K  is  a  suitable  space  for 
the  set  of  properties  Bt.  Thus  for  instance  K 
might  be  the  set  (  strong  source,  moderate 
source,  negligible  source)  )  and 
/,  :  K~*L 

where  £-{x |x€S,0£ xZ  1) 

In  other  words  /*  is  a  subjective  membership 
function  which  assigns  a  numerical  value  in  the 
range  0  -  1  to  the  elements  of  the  set  K. 

More  complex  formulations  are  necessary  if  both 
f  and  B  are  more  complicated. 

The  case  in  which  f  is  characterized  by  four 
parameters  has  already  been  examined  by  the 
Author  and  a  complete  flow-chart  of  the 
corresponding  algorithm  has  been  described  in 
detail  [7]. 

Thus  the  overall  identification  algorithm  can 
be  written  as 

IMY“dll2"niin 

subject  to 

ft.  Dl»-/lt^l(Y)J-/|,»M 

where  fi,nin  and  *i,nax  arc  values  included  in 
the  range  (0,1)  and  depend  on  our  degree  of 
confidence  of  the  solution  having  certain 
properties.  Thus  the  theoretical  and  numerical 
structure  has  been  completely  outlined. 
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V.  CCtSCLuSICtSS 


A  sethod  f© r  monitoring  coastal  ccatribctiess 
to'tie  overall  pollution  using  cssantratlso 
i«2£crexiKts  isside  closed  basics  has  been 
described.  S isolated  data  a ad  ccsvenieut  fuzzy 
constraints  have  bees  used  to  validate  the  data. 
Store  insight  will  be  gained  frca  the  esc  of  real 
data  vith  the  possible  introduction  of  general 
ferry  bounds.  This  verb  has  been  started  and  is 
presently  underway  in  car  institutes. 
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CONTROL  SYSTEM  DESIGN’  IN  THE  ENVIRONMENT 

J. LStefaa  Issritcte.  61000  Ejs^jaca,  Jamcm  35,  Yugoslavia 


Abstract-  la  tie  week  *  *5e*  to  ccdrcJ  sysla  design  foereal  apyii- 
q&fl  »  presented.  Tie  way,  tie  design  is  carried  oxt,  a ad  proposed 
costrst  spies  (hopefxlly  akoreaSzed)  are  eatsraUy  tightly  dependent 
oo  bnra  njxxi*,  eg.  cbcproglse*  between  "exact*  ud  experi- 
oestal  approaches;  Exear  aad  sonlbeax  treatment,  etc.  Tie  decisions. 
Triad:  are  samly  is  designer’*  tawds,  harg  do<  only  oa  control  theory 
asd  designer's  expertise  (bxrrfedge  and  abcEtSes)  but  also  oa  teci- 
gVa!,  ec ooosk  asd  sodoJogjc  characteristics  of  tie  tariroeroext,  n 
which  a ad/or  for  wBch  control  tjrtea  is  designed.  Ia  opposition  to 
control  system  design  tie  functionality  of  simulation  tools,  wiki  are  ia 
application  example*  tie  oast  frequently  used  ia  a3  phases  of  costrol 
<fesig3,  is  much  more  odepeadestfroca  tie  environment  and  designer. 
Ia  tie  work  dercr 2>ed  aspects  are  discussed  through  modeling,  aaal- 
ycj  and  costrol  Jjitta  design  for  cue  of  several  sabprocesse*  ia  tie 
factory  which  produces  titanium  dioxide. 

I.  INTRODUCTION 

Tie  principal  activity  at  tie  Department  of  Computer  Automation 
aad  Costrol  is  devoted  to  research,  development  aad  applications  ia 
tie  area  of  automatic  (computer-based)  costrol  of  systems  aad  pro¬ 
cesses.  Tbs  orientation  expresses  tie  isles tios  to  cultivate  aad  ex- 
paad  knowledge  ia  tie  costrol  field,  bat  rasci  core  tie  seeds  aad  tie 
passivities  in  ocr  community  Through  tie  applications  we  try  to 
contribute  to  tie  development,  he.  we  solve  cos crete  problems,  siow 
aad  stress  tie  role  of  control  sjstecs  as  well  as  we  help  ia  forming 
staff  for  iadastry  departmenU. 

Ia  tie  paper  modehng,  analysis  aad  control  system  design  for  one 
of  19  ssbprocesses  in  tie  factory  wiki  prodaces  titanium  dioxide  are 
preseated.  Although  tie  fall  automation  of  this  prodcctioa  is  Be»  (as 
I  an  informed)  ray  iateatioa  is  sot  to  go  ia  details,  bat  to  disco.-, 
tie  environmental  aspects  ia  relation  to  costrol  design  methodology 
aad  simalatioa  tools.  While  the  methodology  is  rather  specific  aad 
subjective,  I  beSeve,  that  described  functionality  for  simalatioa  tools 
is  tie  miaimaa  for  saccessfal  work. 

n,  FROCKS  PESCRlPTtOy 

Tie  factory,  where  tie  prodactioa  of  titaaiam  dioxide  is  based  on 
the  hydrolysis  of  salfxie  solution  /I /,  was  built  approximately  15  years 
ago.  Bat  lately,  the  world  treads  towards  automation  which  ensures 
better  supervision  aad  higher  product  quality  convinced  progressive 
staff  ia  tie  factory  to  invest  in  tie  research.  Oar  department  was 
engaged  aad  till  now  we  have  implemented  a  control  system  for  re¬ 
moval  of  ferroas  sulfate  by  crystallization.  The  next  subprocess  we  are 
studying  aad  which  is  discussed  ia  this  work,  ts  the  redaction  of  the 
Omeaite  solution. 


Fig.  1.  Simplified  Technological  Scheme  of  the  Subprocess 


Tie  goal  of  tie  redaction  is  to  redace  aS  tie  ferric  components 
of  tie  Soeu^  Khtiot  to  tie  ferrocs  state  aad  to  convert  a  small 
proportion  of  the  titaaisa  dioxide  to  tie  trivafeat  condition.  Tie  goal 
breached  by  tie  reaction  with  raetaSc  iron.  The  reaction  is  exotermx 
aad  mass-transfer  limited.  So  a  specific,  technology  b  used,  where 
three  paraSd  reactors  with  (tesi)oaUh  character  (metallic  iron  should 
he  added  aad  washed  every  8  hoars;  every  8  days  a  reactor  should 
he  emptied,  washed  and  refilled)  are  connected  and  managed  in  a 
way  which  ensures  continuous  production.  The  simplified  technological 
scheme  of  the  redaction  b  shown  in  FSg.1. 

From  the  input/ontpnt  point  of  view  the  reduction  in  Fig.1  rep¬ 
resents  a  kind  of  continaocs  stirred  tank  reactor.  The  decanter  b 
included  only  to  spread  tie  input  and  the  recycling  flow  in  all  active 
reactors.  The  intermediate  tank  gathers  tbe  outputs  from  the  reactors 
and  enables  heat  removing  with  only  one  cooGcg  system.  The  reduced 
solution  b  taken  from  the  intermediate  tank. 

in.  MODELING 

The  model  base  was  taken  from  tbe  literature  /2/,  where  also  typical 
assumptions  and  simplifications  which  are  suitable  for  the  initial  stage 
axe  described,  e.g.  concentrated  parameters,  perfectly  mixed  solution, 
constant  density,  first  order  reaction.  With  the  additional  simpufica 
tioas,  e.g.  a  cyEadric  form  (with  a  spare  effective  base)  of  the  reactors 
and  unique  reaction  (the  input  solution  reacts  irreversibly  to  form  the 
reduced  solution  with  desired  concentrations)  which  takes  place  only  in 
the  reactors,  first  mathematical  nodd  was  derived.  It  consists  of  three 
differential  equations  for  each  reactor,  three  differential  equations  for 
the  intermediate  tank,  one  differentia!  equation  for  the  cooling  system 
and  one  algebraic  equation  for  the  decanter. 

As  usually  the  first  mathematical  model  did  not  agree  enough  with 
the  measurements.  It  had  to  be  improved,  but  corresponding  physical 
and  chemical  background  was  unknown.  The  project  was  time  and 
funds  limited,  so  there  were  also  very  small  chances  to  get  detailed 
information  about  the  process,  i.e.  to  repeat  the  measurements  with 
better  equipment  and  experiments.  So  the  modeling  process  relied 
only  on  engineers  reasoning  and  experimental  approaches. 

With  the  time  the  equations  of  the  first  model  were  improved  with 
dements  which  are  related  to  additional  nine  differential  equations, 
that  can  be  reasonably  argumeated  (e.g.  two  consecutive  reactions 
and  two  "paraxite*  reactions  slower  simplified  model  reaction,  delays 
because  of  the  transport  and  mixing,  heat  transmission  to  metal&c  iron 
and  vessels,  the  influence  of  tbe  measurement  equipment)  This  eagi 
neeriug  and  experimental  phase  in  modeling  was  much  more  difficult 
and  lasting  especially  because  the  final  model  consists  of  21  paramo 
lers  (6  known,  7  estimated  from  the  available  data  and  measurements 
and  8  unknown). 

In  my  opinion  the  modeling  process  was  not  very  specific  for  our 
environment.  For  the  initial  stage  of  control  system  design,  for 
projects  with  smaller  control  requirements  or  for  time  and  funds  lim¬ 
ited  projects  I  find  such  an  approach  quite  natural.  Obviously  also 
differences  can  occur.  As  the  designer  I  see  some  differences  even  in 
our  environment.  They  are  mainly  conditioned  with  the  ex’stent  mea¬ 
surement  equipment,  the  possibilities  to  experiment  on  the  object  and 
understanding  the  needs  for  experimental  results.  Finally  they  can 
occur  also  because  of  more  complex  reasons,  like  economic  status  and 
valuatiog  of  the  expert  work. 

IV,  CONTROL  PESIGtt 

Th.  *ho!e  control  design  was  influenced  by  the  environment.  At  its 
beginning  there  were  two  main  reasons.  First,  the  economics  dictated 
not  to  invest  too  much  in  the  equipment  what  was  a  severe  limitation 
already  on  the  measurement  equipment.  Therefore,  any  technologic 
changes  were  out  of  question,  although  the  original  factory  plan  differs 
from  the  realized  one  (in  the  plan  each  reactor  has  its  own  decanter,  in¬ 
put  and  recycling  flow).  Secondly,  the  factory  staff  was  rather  content 
with  existent  control.  While  at  the  same  time,  it  was  also  burdened 
too  much,  it  was  not  enough  acquainted  with  the  reduction  and  often 
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did  do!  think  enough  ia  tic  control  tense.  Tic*,  tie  identification 
of  control  goab  was  a  hard  job  aad  could  only  be  done  through  few 
iaterriews,  ini tsuvt  model  b eiuricsr  studies  aad  engineer  reasoniag. 
So,  after  relatively  long  period  three  global  costrol  goals  were  derived, 
Le.  redceed  solution  should  hive  desired  concentration,  tie  speed  of 
production  should  be  controlled  aad  the  temperature  ia  the  reactors 
(’reaction  temperatures")  should  be  constant. 

Fronf  the  costrol  goals  it  caa  be  eas3y  realized  that  tbe  existent 
costrol  scheme  is  not  quite  appropriate.  Namely,  only  the  first  global 
costrol  goal  caa  be  satisfactorily  fulfilled.  The  other  two  goals  axe 
partly  fulfilled  by  operators  (set  past  for  the  levd  control  ia  the  reac¬ 
tors;  influences  the  production  speed)  or  by  technologists  (set  pact  for 
the  temperature  control  ia  the  intermediate  task;  "indirect  control" 
of  the  reactioa  temperature).  Naturally,  such  man-in-tke-loop  con¬ 
trol  demands  quite  a  big  skill  and,  ia  any  case  slower  aad  less  precise 
control. 

Ia  the  first  design  step  the  control  scheme  was  simply  improved  by 
introducing  a  speed  production  controller  (oa  the  basts  of  the  produc¬ 
tion  flow  it  sets  the  set  point  for  the  level  control  in  the  reactors)  and 
reaction  temperature  controller  (on  the  basis  of  the  reaction  tempera¬ 
ture  it  correct*  the  set  point  for  the  temperature  control  in  the  inter¬ 
mediate  task).  This  approach  seemed  to  be  very  attractive  because  it 
reaches  the  main  goals.  However,  such  control  structure  is  not  good 
enough  because  it  "neglects*  the  conditions  ia  the  reactors  (although 
the  temperature  ia  the  intermediate  tank  and  desired  concentration  of 
the  product  are  reached  the  concentrations  in  the  reactors  and  reaction 
temperatures  caa  quite  deviate).  This  is  especially  significant  by  the 
concentrations,  while  the  reactioa  temperatures  are  tightly  connected 
to  them.  So  modified  control  structure,  which  is  shown  in  Fig.2,  r os 
derived.  Its  main  characteristics  axe: 

•  The  concentration  control  is  realized  for  each  reactor  separately. 

•  The  production  speed  control  is  not  needed  i’i  the  staaonary  pant 
(when  the  desired  concentration  is  reached) .because  the  inpat  and 
output  flows  are  the  same. 

•  Due  to  the  fact  that  output  flow  is  not  a  /unction  of  the  concentra¬ 
tion  in  the  intermediate  tank  bat  function  of  the  level  in  this  tank,  a 
proportional  controller  for  quality  assurance  is  included  (it  corrects 
the  production  speed  when  the  desired  concentration  is  not  reached; 
it  should  be  dominant  over  the  production  speed  in  noastadonary 
situations). 

•  The  reaction  temperature  control  is  the  same  as  in  the  first  design 
step. 

Simulation  results  show  that  all  global  control  goals  as  wefl  as  the 
desired  concentration  ia  each  reactor  can  be  satisfactorily  reached.  In 
deed,  some  problems  with  the  reaction  temperature  remain  (there  exist 
three  controlled  variables  and  only  one  control  inpat,  i.e,  the  temper¬ 
ature  in  the  intermediate  tank),  but  they  are  less  critical  because  the 
concentrations  in  the  reactors  are  under  control.  If  the  modified  con 
trol  struct  are  will  proof  itself  in  the  reality,  the  only  arguments  against 
it  could  be  of  the  environmental  character,  l.e.  problems  with  the  ac¬ 
ceptance  (e.g.  familiarity  and  the  necessity  for  handling  two  different, 
i.e.  computer  and  hardware  control  concept). 

V.  SIMULATION  TOOL 

For  the  modeling  and  control  design  procedure  our  simulation  tool 
SIMCOS  /3/  was  used.  It  is  a  CSSL  based  simulation  language  which 
is  permanently  improved  with  some  nonstandard  options,  like  features 
that  make  the  language  closer  to  hybrid  simulation,  features  that  en¬ 
able  simulation  of  digital  control  systems  and  simulation  in  real  time. 

With  no  regard  to  the  interactiveness  (which  is  not  so  important  for 
experienced  users)  a  good  simulation  tool  for  control  system  design 
should  be  as  functional  and  flexible  as  equation  oriented  simulation 
languages  are.  Besides  this  fundamental  demands,  which  are  fulfilled 
in  several  simulation  tools,  some  others  elements  are  necessary.  In 
the  field  of  methods  such  dements  are  real  time  simulation  (including 
possible  link  with  the  "foreign*  objects,  i.e.  with  the  process  for  ver¬ 
ification  of  control  strategy  or  with  the  computer  control  system  for 
verification  of  control  system),  optimization,  curve  fitting,  lineariza¬ 
tion  and  parameter  studies.  Ia  the  field  of  functionality  which  belongs 


to  "infrastructure"  the  necessary  dements  are  hierarchical  blocks,  ac¬ 
cessible  data  base  for  aad  from  other  CACSD  tods  (besides  time  histo¬ 
ries,  transfer  function  and  state  space  description  also  the  topological 
data  base)  and  an  opened  (as  much  as  possible)  architecture  to  indude 
specific  models,  methods  aad  experiments. 

I  find  proposed  functionality  ia  simulation  tods  independent  from 
the  divironment  and  examples.  Namdy,  almost  all  described  func¬ 
tional  dements  would  be' quite  useful  in  solving  the  discussed  aad 
other  examples  I  nude,  aad  I  befieve  also  in  any  other  control  system 
design. 


Fig.  2.  Modified  Control  Structure  for  the  Reduction 

VI.  CONCISIONS 

In  the  work  a  control  system  design  procedure  for  the  reduction 
of  the  ilmenite  solution  is  presented.  It  can  be  interesting  for  the 
community  which  is  envolved  in  the  automation  of  the  titanium  dioxide 
production.  However,  the  main  stress  of  the  work  is  given  to  the 
environmental  aspects  that  influence  the  control  design  procedure. 

In  the  Department  of  Computer  Automation  and  Control  we  try  to 
cultivate,  educate  aad  expand  control  knowledge  in  our  environment 
where  too  small  attention  was  devoted  to  control  problems  in  the  past. 
We  realize  this  intention  mainly  through  concrete  applications  where 
not  only  the  gap  between  theory  and  praxis  should  be  solved  but  also 
several  environmental  effects  should  be  considered.  The  status  in  the 
industry,  like  approaches  to  project  management,  funds  and  control 
knowledge,  has  quite  a  big  influence  on  the  control  systems.  This  fact 
can  also  be  confirmed  by  several  control  systems  which  have  been  im¬ 
plemented  in  our  factories.  Namely,  the  systems  often  do  not  function 
any  more  (the  desired  behaviour  can  not  be  reached  or  they  arc  even 
put  out  of  the  function)  or  they  are  not  quite  actual  nowadays  (like 
m  the  example  where  original  control  structure  does  not  consider  the 
production  speed  as  it  was  probably  not  so  important  in  the  past). 

The  paper  gives  also  a  view  to  the  functionality  of  simulation  tools 
used  in  control  system  design.  Described  simulation  capabilities  are 
quite  independent  from  the  environment  and  concrete  problem  and 
can  be  understood  as  a  hint  to  tool  designers  as  well  as  to  potential 
users. 
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APPLICATIONS  OF  GAME  THEORY  IN  LEGAL  PROCEDURE  OF  COURT 
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ABSTRACT  :  Legal  Procedure  of  Court  is  shown  as 
a  game  theory  problem  with’  Plaintiff  (Complainant) 
and  Defendant  (Accused)  as  Players.  Different 
witnesses,  courses  of  actions,  their  move*  etc.  are 
taken  as  the  strategies  for  these. player^  By  defining 
the  variable  'aij',  we  get  the  pay-off  matrix.  We 
solve  the  game  by  using  any  one\of '  the  methods.  And 
on  the  value  of  the  game,  the -judgement  can  be  passed 
accordingly. 

I.  INTRODUCTION 

A  game  is  simply  a  set  of  descriptive-Tules. -A 
play  of  the  game  includes  every  particular  instance  in 
which  the  game  is  played  from  begining  to  end.  The 
participants  are  the  players.  A  game  -  consists  of' a 
sequence  of  moves  of  the  players,  while  a  play 
comprises  a  sequence  of  choices  made  by  them.  The 
decisive  step  in  the  mathematical  treatment-  of  games 
is  the  normalization  achieved  by  introduction  of  pure 
strategies.  A  pure  strategy  As  a  plan  formulated  by  a 
player  prior  to  the  start  of  a  play,  which  covers  all 
of  the  possible  decisions  which  he  may  face  auring 
any  play  permitted  by  the  rules  of  the  -, game.  The 
expected  course  of  a  play  is  thereby  completely 
determined  by  the  selection  of  a  pure  strategy- by  each 
player  in  ignorance  of  that  chosen  by  any  other 
player. 

Zero  sum  two  person  games  are  played  by  two 
players  and  what  one  player  wins,  the  other  Closes.  A 
zero  sum  two  person  game  with  a  finite  number  of  pure 
strategies  can  be -described  by  an  mxn  matrix  A  a 
(a..),  where  a.,  is  the  payment  to  player  I  by  player 

I!  if  I  chooses  his  i  pure  strategy  and  II  chooses 

his  jth  pure  strategy.  Instead,  during  repeated  plays 
of  the  game,  he  can  be  better  off  by  mixing  his 
various  pure  strategies  randomly,  but  with  chosen 
relative  frequencies.  Subsequently,  a  passage  from 
such  a  'sample1  of  plays  of  the  game  to  the  underlying 
population  reveals  the  notion  of  a  mixed  strategy 
(mixed  actions)  as  a  probability  distribution  over  the 
set  of  pure  strategies.  Mixed  strategies  for  I  and  II 
will  be  denoted  by  x  and  y  respectively,  where 

X  =  (X,,  X2. - xm).  y  «  (y,,  y2. - y„) : 

xfo  o  for  all  i  and  o  for  all  j 
with  EXjSla^y. 

When  I  plays  a  mixed  strategy  x  and  II  plays  a 
mixed  strategy  y,  the  expected  payment  by  II  to  I  is 
G(x,y). 

All  discussion  of  zero  sum  two  person  games 
starts  from  the  Minimax  theorem  which  asserts: 

maxmin  G(x,y)  =  m^nm^x  G(x,y) 

The  unique  minimax  value  of  G  is  called  the 
value  of  the  game  A  and  will  be  denoted  by  v(A). 
*  • 

Mixed  strategies  x  and  y  such  that 
♦  * 

G(x  ,y)^G(x,y  )  for  all  mixed  strategies  x  and 
y  are  called  optimal  or  good  strategies  for  player  1 
and  player  II  respectively. 

II.  LEGAL  PROCEDURE  :  A  GAME  THEORY  PROBLEM 
II.  1  INTRODUCTION  :  In  legal  procedure  of  court  we 
can  make  use  of  'Game  Theory'.  By  this  approach  the 
legal  advisors  and  magistrates  may  have  sound  and 
mathematical  base  in  the  course  of  formulating  the 
judgement.  Sometimes  judgement  is  on  discretionary 
powers  of  judges  and  it  is  possibility  of  confusion  for 
them.  But,  I  hope,  if  we  apply  game  theory  approach 
for  court  procedure,  judges  may  have  easier  and  sound 
base  in  passing  the  judgement.  Also  legal  advisor  may 
give  more  stress  on  certain  point  to  prove  the  issues 


which  are  more  important  for  his  client  benefit. 

In  court  procedure,  - we  have  - 

1.  Two  parties  which  are  treated  as  players; 
Plaintiff  (P)  and  Defendant  (D). 

2.  Both  these  parties  have  conflicting  interests. 

3.  Each  party  has  a  finite  set  of  moves-~i.e.  finite 
set  of  strategies. 

4.  An  issue  in  favour  of  P  is  considered  as  an -issue 
against  D  and  vice-versa. 

Here  we  assume  player  P  as  a  maximising  player  and 
player  D  as  a  minimising  player;  thus  player  P 

washes  to  maxmin  G(x,y)  and  player  D  wishes  to 
x  y 

minmax  G(x,y). 

IK 2 ’notation  AND  DEFINITIONS  : 

Let  strategy  sets  for  P  and  D  be  denoted  by  x 
and  y  respectively.  We  define  the  utility-matrix  as 
follows. 


D 

12  j  n 


a,.  :  is  the  utility  for  player  P  when  P 

V  tU 

chooses  his  i  strategy  and  D  selects 
jth  strategy. 

(i  a  1,2, —  m;  j  a  1,2,  — n) 

m,  n  are  possible  strategies  for  player 

pand  for  player  D  respectively. 

Conventional  game  theory  problems  are  either  to 
search  for  a  saddle  point  or  use  of  mixed  strategy 
method  for  selecting  different  strategies  for  different 
players.  In  real  world  problems  however,  the  entries 
in  the  pay-off  matrix  are  linguistically  and 
qualitatively  defined.  In  legal  procedure  the  case  is 
the  same.  In  these  type  of  situations  it  is  necessary 
to  model  the  problem  with  an  imprecise  pay-off  table. 
In  legal  procedure  context,  to  construct  the  utility 
matrix,  one  may  take  help  of  the  previous  records 
and  experiences. 

As  a  particular  case,  here  we  redefine  the 
variable  a^  as  follows 

a  1,  if  an  issue  is  passed  by  the  magistrate 
in  favour  of  P, 

when  P  adopts  i**1  strategy  and  D  adopts 
l"1  strategy; 

a,,  =  0,  if  an  issue  is  crossed  I.e.  effect  of  i., 

^  strategy  of  P  is  nullified  by  the  jxn 
ftrategy  of  D; 

and  a,,  =  -1  if  an  issue  is  passed  against  P,  i.e. 
^  passed  in  favour  of  D. 

In  this  manner,  we  can  obtain  a  utility  matrix 
and  it  is  treated  as  a  pay-off  matrix  as  in  a  usual 
game  problem. 

After  obtaining  pay-off  matrix,  we  can  solve  the 
game  by  using  any  one  of  the  methods.  Let  v(A)  De 
the  value  of  game.  This  value,  v(A)  will  be  very 
useful  for  magistrate  in  coming  to  judgements. 
Magistrate  may  have  the  following  different 
alternatives  depending  upon  the  value  of  v(A) 
i)  If  v(A)^  0,  the  judgement  is  passed  in  favour  of 

P  i.e.  plaintiff  wins  the  case; 
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ii)  If  v(A)  *  0,  the  decision  depends  upon  the 
discretionary  power  of  judge  and 
iix)  If  v(A)  <  0,  judgement  -is  passed  against  the  P 
and  it  will  be  in  favour  of'D  i.e^  Defendant  wins 
the  case. 

Ill;  REMARKS 

Here  it  is  seen  that  the  pay-off -matrix  A  has  only 
three  types  of -elements  viz.  -1,0  and  +1.  It  implies 
that  all  strategies  have  equal  effect  upon  the  decision 
of  magistrate.  But  it  is  seen  that  different  strategies 
may  have  different  effects  upon  decision,  e.g.  some 
evidence,  witnesses,  situations,  acts,  etc.  are  more 
powerful  than  the  other.  To  take  into  account  this 
fact,  we  may  use  mixed  strategies.  Different 
strategies  may  be  selected  with  different 
probabilities.  If  a  particular  strategy  Is  more 
powerful  in  relation  to  decision  making,  then  one  may 
assign  higher  probability  corresponding  to  it. 

This  approach  is  also  very  helpful  to  legal 
advisor  in  proving  his  client's  side.  He  can  find  out 
the.  more  important  strategies  for  his  client  and  can 
give  more  stress  on  the  same  to  prove  his  point.  Also, 
it  is  assumed  that  strategy  sets  x  and  y  for  players 
P  and  D  are  finite  which  is  obvious  otherwise  there 
will  not  be  an  end  to  the  case. 


IV.  CASE  STUDY 

For  an  illustration,  let  us  consider  the  case.  The 
reference  is  as  follows  :  Reg.  Civil  suit  No.  126  of 
1981.  In  the  court  of  Mr.M.H.Baig, 

B. Sc. (Hons)  LL.B.II 
Joint  Civil  Judge. 

J.D.Barshi  at  Barshi  }  Ex. No. 41. 

Dattatraya  Nagnath  Chikhale,  Age  55  Plaintiff 

Against 

Navanath  Rama  Kumbhar,  Age  40  years  Defendant 
In  this  case.  Plaintiff  and  Defendant  had 


following  different  strategics. 

Plaintiff 

s  Strategies  : 

Sr. No. 

Description  of  Strategy 

1. 

Ownership  of  suit  site 

2. 

Possession  of  suit  site 

3. 

Ownership  in  tree 

4. 

Raise  of  obstrution  by  defendant 

5. 

Permission  from  municipal  corporation  to 
cut  down  the  tree. 

Defendant's  Strategies  : 

Sr. No.  Description  of  Strategy 

1.  Tenancy 

2.  Right  or  interest  in  tree  on  suit  site 

3.  Possession  of  suit  site 

4.  Plaintiff  obtained  the  permission  to  cut 

down  the  tree  by  misleading  the 

corporation. 

With  these  strategies  and  opinions  of  Hon. Judge,  we 
obtain  the  following  pay-off  matrix. 

Defendant 

12  3  4 


1110 
2  110 

Plaintiff  3111 

4  110 

5  111 


Solution  of  the  problem  : 

We  solve  the  game,  whose  pay  of  matrix  is  given 
as  above. 


Defendant 

1 

2 

3 

* 

Row  min, 

1 

1 

1 

0 

1 

(0) 

2 

1 

1 

0 

1 

(0) 

Plaintiff 

3 

1 

1 

1 

1 

(1) 

4 

1 

1 

0 

1 

<o; 

5 

1 

1 

1 

1 

(i) 

column  -  max. 

Ld) 

(1) 

(1) 

(1) 

Here 

max. 

,  (Row  min)  = 

max 

mm  (a  .)  =  1 

- 1 

1 

13 

and  min 

(column  max . 

)  =  mj 

in  max  (a„) 

=  1. 

Which  implies 

max.  min  (a..)  - 
>  3  ij 

min 

J 

max 

(Ii) 

>  =  1 

Hence,  the  game  has  pure  value  v(A)  =  1. 

And,  it  is ■  seen  that  v(A)  'y  0 


Hence,  as  per  our  criteria  the  judgement  is  passed  in 
favour  of  plaintiff  i.e.  in  favour  of  Mr. Dattatraya 
Nagnath  Chikhale  and  then  decree  is  passed 
accordingly. 
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Fortran  Codes  for  Computing  the  Discrete 
Helmholtz  Integral  Operators 

S.  M.  Kirkup 


Subroutines  H02LC,  H03LC,  H03ALC.  The  form  of  the 
three  Fortran  subroutines  are  listed  in  this  section  The  quantities 
•that  the  subroutines  compute  are  determined  by  replacing  T  by  the 
appropriate  element,  ft  by  the  unit  function  and  G  by  the  appropri¬ 
ate  Green's  function  in  (2)-(5).  Results  from  the  application  of  these 
subroutines  to  test  problems  are  given  in  the  reference. 


Address.  Department  of  Mathematics  and  Computer  Science, 
University  of  Salford,  Salford,  UK. 

Abstract.  In  this  paper  Fortran  Subroutines  for  computing  the 
discrete  form  of  the  Helmholtz  Integral  Operators  for  two-dimensional, 
three-dimensional  and  three-dimensional  axisymmetric  problems  are 
described.  The  subroutines  are  useful  in  the  solntion  of  Helmholtz 
problems  via  boundary  element  and  related  methods. 

Introduction  The  Fortran  subroutines  described  In  this  paper 
are  useful  in  the  implementation  of  integral  equation  methods  for  the 
solution  of  the  general  two-dimensional,  the  general  three-dimensional 
and  the  axisymmetric  three-dimensional  Helmholtz  equation 

?Vp)+iv/<p)  =  o  (i) 

which  governs  y?  in  a  given  domain  and  it  is  a  complex  number  termed 
the  wavenumber.  The  subroutines  compute  the  discrete  form  of  the 
integral  operators  L\,  Mk,  Ml  and  Nk  that  arise  through  the  applica¬ 
tion  of  collocation  to  integral  equation  reformulations  of  the  Helmholtz 
equation.  Expressions  for  the  discrete  integral  operators  are  derived 
by  approximating  the  boundaries  by  the  mo3t  simple  elements  for  each 
of  the  three  cases.  The  elements  are  illustrated  in  the  figure. 


Straight  IIm  elenent  Triangular  elewnt  Conics!  element 


The  subroutines  are  named  H02LC,  H031C  and  II03ALC  and 
the  parameters  to  the  subroutine  take  the  following  form: 
SUBROUTINE  HO  2/3/3A  LC  ( 

IFawnum&er  (generally  complex), 

Point  (p  and  the  unit  normal  np  associated  with  p), 

Geometry  of  element  (vertices  which  define  element  and  unit  normal ), 
Quadrature  rule  (abscissae  and  tr (ights  for  computing  integral), 
Choice  of  discrete  form  required  (lk,  Mk,  Ml  and/or  Nk), 

Answers  (the  values  of  the  chosen  discrete  forms)  ) 

The  Helmholtz  Integral  Operators.  The  Helmholtz  integral 
operators  are  denoted  Ik,  Mk,  Ml  and  Nk  and  they  are  defined  as 
follows: 


(E»f*)r(p) 33  /  G»(p,q)i‘(q)<tft, 
r 

P) 

{M*/i}r( p)  a  J  |~(p,q)  M 
r  * 

(3) 

{M*V)r(p) 3  j  C*(p,q)  ft{q)  dS1f 

*  r 

(«) 

{AV.)r(p)*iL/  |^(p, 

(5) 

where  T  is  a  surface,  nf  is  the  unit  outward  normal  to  the  bourdary 
at  q  and  /i(q)  is  a  bounded  function  defined  for  q  £  S.  The  vector  n, 
is  a  unit  normal  associated  with  p.  <7t(p,q)  is  the  frce-space  Green’s 
function  for  the  Helmholtz  equation:  <3*(p,q)  =  jT/^r)  in  two 
dimensions  and  G*(p,q)  as  in  three  dimensions  where  r  =  |rj, 

r  =  p  -  q  and  i  is  the  unit  imaginary  number. 


*••••«•«.**•  Sutro*,tlr'*  H02LC  by  Stephen  Xlrkup  Dec  1**0  * 

««Wte»  the  discrete  font  ot  the  2-dlmensionel 
Helmholtz  integral  operators  IX.Xk.Kkt,  end  Me.  Hence  the  subroutine 
is  useful  Ui  boundary  element -type  methods  tor  tie  solution  of  Helmholtz 
problems.  .  . 

The  subroutine  has  the  fora: 

SUBRCOTIKtHClILCtK.P.ltOaXP.QA.CS.HCaWS.QPOHfL.KILXNO.NQ.AQ.VO. 

•  QLK.QKK.QXXT.QNX.DXSIK, DXSMX, DXSXKT.DISNK)  '  ' 

XEAL**  EPS 
XHTCC «  XU.UHNQ 
LOGICAL  CHECK 

PARAMETER  (EPS-  ,K >  .CHECK-  ) 

The  PARAKETtR  statement 

reel  IPS,  Integer  XU,  Integer  LXxkq,  loglcel  CHECK. 

The  parameter*  to  the  subroutine 

rt*1  reel  QA<2>.  reel  08(2). 

reel  HOBtoW,  loglcel  QPONtL,  Integer  KAXHQ,  Integer  HQ, 
reel  AQ(haXhq),  reel  WQ(KAXNQ),  loglcel  QI*»  loglcel  QKX, 

1CH,lCDic2rXT'  loglcel  qxk,  reel  DXSXX,  reel  CUSKK,  reel  Disxrr, 


Subroutine  HOJLC  by  Stephen  Kirkup  Deo  mo' 

* 

This  subroutine  computes  the  discrete  fora  of  the  3 -dimensional 
Kelaholtz  Integrel  operator*  u.Hk.Hkt.rend  Nk.  Hence  the 
subroutine  la  useful  in  boundary  element-type  methods  for  the 
solution  of  Kelaholtz  problems. 

The  subroutin''  he*  the  font 


SCMCUTINE  KOJLC(K,P,HO»(P,OA.Q»,QC.>K«WO,QPO)«L,KAX»Q.HO, 
•  XQ.YQ.WQ.QtK.OKK.OKCT.CW.DISUr.DISHK.DISWT.OlSHK) 

INTEGER  XU 
REAL**  CPS 
LOGICAL  CKtCK 

PARAMETER  (XO-  .IPS-  .CHECK-  ) 


The  PARAMETER  statement 

reel  XPS,  Integer  XU,  loglcel  CHECK. 


The  pereaetere  to  the  subroutine 

complex  K,  reel  P()J.  reel  HOJtMP(J).  reel  QA(J),  reel  Q»(J), 
reel  QC<3>,  reel  HORKQO).  loglcel  QFONEL,  Integer  haXnq,  reel  HQ, 
reel  XQ(XAXHO),  reel  YQ(KAXHQ),  loglcel  QLX,  loglcel  QMK, 
loglcel  Oxxt,  loglcel  qnx,  reel  Disix,  reel  oxskk,  reel  DISKXT, 
reel  DiSHK. 


Subroutine  HOJALC  by  Stephen  Xlrkup.  Dec  Xt*0 


This  subroutine  computes  the  diecrete  fora  of  the  3-dlmenelonal 
exlsyaaetrlo  Kelaholtz  Integrel  operators  l*,Kk,Mkt,  end  Nk  Hence  the 
subroutine  Is  useful  In  boundary  element-type  methods  for  the 
solution  of  Kelaholtz  problems. 

The  subroutine  has  the  formi 

SUtROUTIHE  HOJALC(K,  P.NCP-HP, QA.QR.NOftMQ.QFONtL, 

*  KAXNCQ.NCQ.ACQ.WCQ.MAXNTQ.NTQ.ATO.VTO. 

«  OLX.QKX.QMKT.QNX.DISUC.DlSKK.DrSHKT.DISNX) 

PARAMETER  (EPS-  ,IU-  .LlHHCQ-  .LIKVTQ-  .CHICK-  ) 

The  PARAMETER  Statement 


reel  EPS,  Integer  XU,  integer  LX HUGO,  Integer  LIKNTQ,  loglcel  CHECK 
The  parameters  to  the  subroutine 

complex  K.  reel  P(2),  reel  NOKKP(l).  reel  QA(2),  reel  08(2), 
reel  M0RMQ(2),  loglcel  OPONZL,  Integer  MAXNOO,  Integer  NCQ, 
reel  ACQ(MAXNCQ),  reel  VCQ(KAXVGQ) ,  Integer  haxhto.  Integer  HTQ, 
reel  ATQ(KAXHTO),  reel  VTO(KAXNTQ).  logical  QIX,  loglcel  qmx, 
logical  QKXT,  logical  QNX,  reel  DISI*.  reel  DXSHX,  reel  WSKKT, 
reel  DISNX. 
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Abstract-  In  this  paper,  we  consider  a  large-scale  non -symmetric 
problem  that  occurs  in  structural  mechanics  combined  with  aero-, 
dynamics  ;  we  present  in  particular  the  Arnolds  method  used  in 
conjunction  w  ith  a  Chebyshev  iteration  technique,  for  computing 
a  few  eigenvalues  of  a  large  real  non-symmeiric  matrix  (ordered 
by  their  imaginary  parts)  and  their  associated  eigenvectors.  The 
work  we  report  has  been  conducted  at  CERFACS  in  cooperation 
with  AEROSPATIALE  Aircraft  Division  and  IBM  France  whose 
supports  are  gratefully  acknowledged. 

Introduction 

Most  of  the  large-scale  nonsymmetric  eigenvalue  problems  that 
arise  m  various  research  or  engineering  fields  like  mechanics  or 
aerodynamics  are  related  to  the  stability  analysis  of  a  physical 
system.  At  present,  the  matrices  that  appear  in  the  largest  cal¬ 
culations  anse  through- the  study  of  evolutionary  problems  like 
the  Navier-Stokes  equations  .or  .the  differential  systems  arising 
in  structural  dynamics'.  As  an  example,'  the  modelling  of  the 
behaviour  cf  a  plane  in  flight  leads  to  a  differential  equation  in, 
time,  which  is  then  discretised  by  a  finite  element  method  The 
computation  of  the  modes  cf  this  physical  system  yields  a  non- 
symmetne  eigenvalue  problem  of  the  form  . 

Ax  =s  Ax,  where  A  6  Adnxn(R-)»  A  €  C  and  x  €  C*\ 
where  the  order  of  the  matrix  A  can  reach  5  x  1(P. 

1  Why  can  nonsymmetric  problems  be 

difficult  ? 

The  matrices  arising  in  stability  studies  are  often  non-normal. 
A  large  departure  from  normality  ||AA*  -  A*A|Jf  leads  to  an 
ill-conditioned  eigenbasis  which  is  always  extremely  difficult  to 
compute.  The  matrix  under  consideration  may  also  have  multi 
pie  or  defective  eigenvalues.  Such  eigenvalues  are  not  necessary 
ill-conditioned,  but  the  calculation  of  their  eigenbasis  requires 
special  methods  like  the  block*  Arnoldi  or  the  Newton  method 
(1).  When  a  few  eigenvalues' and  eigenvectors  of  a  large  nonsym¬ 
metric  matrix  A  are  required,  one  can  use  a  projection  method 
on  a  Krylov  subspace  like  the  Arnoldi  method  or  the  nonsym 
metric  Lanczos  method.  The  spectrum  of  the  reduced  problem 
is  an  approximation  to  a  part  of  the  spectrum  of  'he  original 
problem,  matrix,  which  is  interesting  from  the  point  of  view  of 
storage.  The  Arnoldi  method  is  more  stable  numerically,  but 
produces  a  Hessenberg  matrix  //,  whose  storage  increases  the 
memory  required.  However,  if  the  order  of  II  is  kept  moder¬ 
ately  large,  the  storage  memory  may  not  be  overwhelming.  In 
order  to  retain  good  numerical  stability  pioperties,  we  propose 
a  procedure  based  on  the  Arnoldi  method  with  partial  rcorthog- 
onalisation,  combined  with  a  Chebychev  acceleration  technique 
12)  (6)- 

2  The  Arnoldi-Chebychev  method  with 

redrthogonalisation 

2.1  The  problem  of  the  loss  of  orthogonality 
Let  us  recall  the  incomplete  Arnoldi  algorithm.  If  A  denotes 
a  real  or  complex  nonsymmetric  matrix  of  order  n,  we  seek  a 
rectangular  n  x  m  (m  <  n)  matrix  Vn  such  that  V^AVn  = 
II m  (where  //„  is  a  Hessenberg  matrix  of  order  m)  and 
is  the  identity  matrix  of  order  n  :  the  columns  v»,  ,  tv,  of 

Vm  form  an  orthonormal  basis  of  the  Krylov  subspace  Kn  — 
span{ui,  At>il„«MA,n',Vj}.  This  basis  is  built  recursively  by  the 


following  algorithm,  using  a  given  vector  u.  The  projected  Hes¬ 
senberg  matrix  Hm  is  obtained  at  the  same  time . 

Givcni,‘  =  R? 

fory=rltom  compute* 

hj  =  V}mAvj ,  where  kj  =  (hn ...hki) 
x,  =  Au,  -  VjV/Av,  =  (/„  -  v3v;)AV} 


This  is  an  implementation  of  the  Gram-Schmidt  algorithm  at 
the  step  y,  the  vector  Av}  is  orthonormalised  with  respect  to  the 
columns  of  V3.  It  i«  well-known  that  the  Gram-Schmidt  process 
can  become  unstable  if  these  vectors  are  almost  linearly  depen¬ 
dent.  This  phenomenon,  which  also  appears  in  other  methods 
can  implies  incorrect  eigenvalues  approximations,  by  introducing 
“spurious”  eigenvalues  during  the  computation.  We  avoid  this 
difficulty  by  using  a  recursive  algorithm  with  partial  reorthogo- 
nalisation  (3]  Since  the  dimension  m  of  the  Krylov  subspace  Km 
is  kept  relatively  small  (<  1500),  a  standard  procedure  from  LA- 
PACK  is  used  to  compute  the  eigenvalues  and  the  eigenvectors 
of  //m,  denoted  respectively  by  A»,,  ..,Am  and  by  &»•  »•»$■»•  We 
then  form-the  vectors  x,  =  Vmjr,,  which  are  approximations  to 
the  eigenvectors  of  A  associated  with  Aj  (for  1  <  i  <  m).  The 
normed  residuals  satisfy  the  relation 

||/4i,  -  A.x.ll  =  J.nnleJS.1,  where  em  =  (0,..  ,0,1)T. 

Although  they  aie  mathematically  equivalent,  we  will  see  in  sec¬ 
tion  3  that  the  duecl  and  the  Arnoldi  lesiduals  respectively  de¬ 
fined  by 


=  Pji  “r, =  W’ 

may  not  have  the  same  numerical  behaviour. 

2.2  The  iterative  Arnoldi-Chebychev  method 

Let  a  denote  the  set  of  the  r  wanted  eigenvalues.  Let  r  denote 
the  set  of  the  remaining  unwanted  eigenvalues  The  spectrum  of 
IIm  is  sp(II)  =j  o  U  t  The  r  desired  eigenvalues  can  be  either 
the  dominant  eigenvalues,  or  the  r  eigenvalues  with  greatest  real 
or  imaginary  parts.  The  iterative  Arnoldi  Chebychev  method  is 
an  hybrid  algorithm,  first  proposed  by  Saad  |6)  The  incomplete 
Arnoldi  algorithm  is  lestarted  with  a  new  vector  of  the  form 
x*  -  p*(A)*o  (where  p*  is  Chebychev  a  polynomial  of  degree 
k  -  I),  m  order  to  increase  the  rate  of  convergence  of  the  r 
eigenvectors  x,  associated  with  the  set  <r.  These  vectors  can  be 
associated  either  with  the  r  dominant  eigenvalues,  or  with  the  r 
eigenvalues  with  greatest  real  or  imaginary  parts.  The  vector 
is  taken  as  a  linear  combination  of  the  r  vectors  x,. 

Because  ot  the  three  terms  recurrence  satisfied  by  the  Chebychev 
polynomials,  the  computation  of  the  vectoi  zk  is  not  expensive 
in  terms  of  an  arithmetic  operations  count. 

3  Numerical  results 

We  use  the  following  notation  . 

•  k  is  the  degree  of  the  Chebychev  polynomial. 


•  m  Is  the  dimension  of  the  Krylov  subspace,  i  e.  the  order 
of  the  projected  Hessenberg  matrix  Hm. 

•  n  is  the  order  of  the  matrix  A. 

•  r  is  number  of  required  (“wanted”)  eigenvalues  (contained 
in  the  set  a). 

•  An  Arnoldi-Chebychev'step  (abbreviated  to: “AC.  step”) 
denotes  the ,  computation  of  a  Hessenberg  matrix  Hm  in 
the  restarted  Arnold)  algorithm. 

The  following  example  comes  from  a  model  of  a  plane  imflight. 
The  interesting  modes  of  this'  system  are1  described  by  complex 
eigenvalues  whose  imaginary  parts  lie  in  a  frequency  range  chosen 
by  the  engineer.  We  demonstrate  results  for  the  computation  of 
eigenvalues  which  have  the  largest  imaginary  parts.  The  matrix 
A  is  sparse  with  a  block-structure.  For  large  n,  the  matrix 
has  many  multiple  and  possibly  defective  eigenvalues:  Figure  l 
shows  the  general  form  of  the  spectra  of  these  matrices  —  such 
spectra  are  typical  in  stability  analysis.  The  Arnoldi*  Chebychev 


Aero  example  on  Convex  C-220  (k  5=  200  m  = 

20  n  =  2000) 

Eigenvalue 

AC 

Residuals 

CPU  time 

index 

step 

Arnoldi  direct  _ 

ts) 

1 

2 

1 

0.731  D-01  0.731  D-01 

0.193  D-fOO  0.193  D-fOO 

0.69 

1 

2 

2 

0  305  D-frOl  0.305  D+Ol 
,  0.467  D-03.  0.467  D-03 

0.67 

1 

2 

3 

0  233D-0S  0,125  D-07 

0  607  D-07  0  481  D-05 

0.75 

Table  1:  Aero  example  on  Convex  C-220  with  parameters,  k  = 
200,  m  sa  20,  n  =  2000  and  r  =  4 

method  is  found  to  be  very  efficient  when  the  order  of  the  ma¬ 
trix  is  approximately  less  than  2000.  Ibble  4  shows  some  results 
for  a  matrix  of  order  2000  with  a  ratio  a  =  ^  and  r  a  4,  and 
where  the  stopping  criterium  is  defined  to  be  art  Amoldi  residual 
less  than  or  equal  to  10"7.  In  this  case,  we  note  that  the  direct 
and  the  Arnoldi  residuals  still  agree.  Unfortunately,  when  we 
increase  the  order  of  the  matrix,  we  obtain  unstable  results  — 
neither  the  Arnoldi  residuals  nor  the  direct  residuals  are  correct. 
Note  that  the  departure  from  normality  increases  rapidly  with 
the  order  n,  as  shown  on  Figure  2.  The  eigenvalues  are  sorted 
by  decreasing  imaginary  part  and  are  represented  by  their  in¬ 
dex  on  the  horizontal  axis.  The  Arnoldi  residuals  that  lake  the 
value  zero  (at  machine  precision)  are  not  plotted.  The  non-zero 
Arnoldi  residuals  are  plotted  as  functions  of  the  eigenvalues  in¬ 
dex  on  the  vertical  axis.  When  n  =  8000,  m  =  800  and  k  s  200, 
the  direct  residuals  are  also  roughly  constant  (order  10,J),  whilst 
the  Arnoldi  residuals  have  a  very  oscillatory  behaviour  as  shown 
on  Figure  3. 


Figure  1:  The  spectrum  of  the  matrix  Aero  for  n  =  1500 


Figure  2:-  Departure  from  normality  of  the  Aero  matrix  with  the 
order  of  A  variable  from  90  to  925  with  Ni  =  "A  ^ 


^  BA *8,  ' 


Figure  3:  Arnoldi  residuals  for  the  first  Amoldr-Chebychev  step 
with  the  parameters  t  =  230,  m  =  800,  n  a  8000,  and  r  =  4 


conclusion 

•  above  experiments  demonstrate  that  the  departure  from 
mality  is  an  essential  parameter  when  computing  c.genvab 
of  nonsymmetnc  matrices.  Further  work  is  being  carnal 
to  understand  the  mstab.hties  described  above,  notably  the 
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Los  Angeles,  CA  90089-1191  U.S.A.  Moffett  Field',  CA  94035  U.S.A. 

where  Nr?®  is  (3)  calculated  with  one  of  the  contributing  velocity 
Abstract- Direct  numerical  simulations  of  homogeneous  turbu-  fields  truncated  to  P  and  the  other  to  Q.  Details  of  such  calculations 
lence  are  used  to  analyze  energy  transfer  among  scales  of  motion  in  axe  provided  in  {2],  For  homogeneous  turbulence  the  regions  P  and 
spectral  space.  A  physical  space  represertation  of  such  a  spectral  are  usually  chosen  as  spherical  wavenumber  bands.  Similarly  trun- 

energy  transfer  is  devised  and  applied  to  the  analysis  of  an  eddy  eating  velocity  u*(k)  in  (4)  to  a  spherical  shell  K  results  in  a  quantity 

viscosity  with  a  sharp  spectral  cut-off.  TKPC(k)  which,  after  averaging  over  K ,  is  interpreted  as  the  energy 

transfer  to  the  band  K  resulting  from  nonlinear  interactions  of  scales 
-1  Introduction  in  K  with  scales  in  V  and  Q. 


Statistically  homogeneous  turbulent  flows  are  conveniently  repre¬ 
sented  in  spectral  (Fourier)  space.  In  such  a  representation  dynami¬ 
cally  important,  elementary  nonlinear  interactions  involve  three  dis¬ 
tinct  modes  with  their  wavenumbers  forming  a  closed  triad.  Under¬ 
standing  these  interactions  is  of  paramount  importance  in  the  theory 
of  turbulence  since  essentially  all  turbulence  closures  rely  on  assump¬ 
tions  about  the  nature  of  the  nonlinear  interactions.  Recently,  using 
results  of  direct  numerical  simulations  (DNS)  Domaradzki  and  Ro- 
gallo  (1)  (2]  analyzed  the  energy  transfer  in  homogeneous’  turbulence. 
They  concluded  that  beyond  the  energy  containing  range  the  energy 
was  transferred  among  scales  of  motion  similar  in  size  but*  that  the 
interactions  responsible  for  this  local  energy  transfer  were  nonlocal 
in  h-  space.  The  importance  of  such  nonlocal  triadic  interactions  in 
the  evolution  of  turbulent  flows  has  been  confirmed  by  Yeung  and 
Brasseur  (3)  who  also  provided  analytical  arguments  (4]  supporting 
conclusions  drawn  from  DNS. 

Despite  the  usefulness  of  spectral  representation  as  a  theoretical 
and  numerical  tool  in  turbulence  research,  various  quantities  (veloc¬ 
ity,  energy,  vorticity,  etc.)  in  the  physical  space  often  provide  a  more 
natural  description  of  turbulent  flows.  Thus  it  is  of  interest  to  have 
the  physical  space  representation  of  the  nonlinear  transfer  processes 
that  dominate  the  spectral  space  dynamics.  One  such  representation 
has  been  proposed  by  Domaradzki  et  al.  (5).  In  this  paper  we  discuss 
other  possible  ways  of  representing  detailed  spectral  energy  transfer 
in  the  physical  space. 

2  Intcrscalc  Energy  Transfer  in  Spectral  Space 

The  equation  for  the  energy  amplitudes  £|u(k)|*  e  ^u*(k)u*(k) 
Is: 


3  Interscale  Energy  Transfer  in  Physical  Space 

Inverse  Fourier  transform,  signified  by  tilde,  of  JVn(k)  is  the  sum 
of  the  convection  and  pressure  terms  in  the  Navier-Stokes  equation 
in  the  physical  space  coordinates 


tfn(x)  =  -«.(x) 


flfin(x)  dp(x) 

6Z{  ~  fan  ' 


(5) 


Similarly,  using  ^T>c(k)  we  can  define  its  physical  space  coun¬ 
terpart  c(x)  as  well  as  $**®(x)  which  is  the  inverse  Fourier 
transform  of  Nr?®( k)  truncated  to  the  band  K.  N?®{x)  can  be 
interpreted  as  the  contribution  to  the  rate  of  change  of  the  velocity 
field  u„(x)  at  a  point  x  made  by  the  nonlinear  interactions  involving 
modes  from  the  bands  V  and  £.  Note  that  these  interactions  influ¬ 
ence  all  modes  k  which  can  form  a  triangle  with  two  other  modes 
such  that  one  is  in  P  and  the  other  in  Q.  $?*c(x)  represents  a  con- 
tnbution  to  the  rate  of  change  of  £t*(x)  which  is  made  by  all  modes 
from  K  interacting  nonlinearly  with  modes  in  V  and  Q. 

The  rate  of  change  of  the  turbulent  energy  e(x)  =  ^iin(x)tin(x) 
at  a  point  x  caused  by  the  nonlinear  interactions  is 


Mi  =  6.(x)l»„(x).  (6) 

Our  goal  is  to  decompose  (6)  into  contributions  from  the  interac¬ 
tions  among  modes  from  predefined  wavenumber  bands  KyPt  and  Q 
i  e.  to  find  a  physical  space  counterpart  of  T**fi(k)  which  Itself  is 
the  result  of  such  a  decomposition  of  the  transfer  T(k)  performed  in 
the  spectral  space.  Despite  uniqueness  of  such  a  decomposition  in  the 
spectral  representation,  the  procedure  it  ambiguous  in  the  physical 
space.  Possible  definitions  are: 


|iWk)|<  =  -5^>|iu(k)|’+r(k)  (i) 

where  ti*(k)  is  the  velocity  field  in  spectral  space,  with  the  ex* 
plieit  dependence  on  time  omitted,  the  asterisk  denotes  complex  con¬ 
jugate,  v  i$  the  kinematic  viscosity,  and  T(k)  is  the  nonlinear  energy 
transfer 

T(k)  =  fl.K(k)Wn(k)).  (2) 

In  the  last  equation  W*(k)  is  the  nonlinear  tenn  in  the  Navier- 
Stokes  equation 

«,(k)  =  (-l/JJfttaC k)  j  <*Vi,(p)u4k  -  p) .  (3) 

where  tensor  P^jm(k)  accounts  for  the  pressure  and  incompress¬ 
ibility  effects.  The  summation  convention  is  assumed  throughout. 

Detailed  energy  transfer  to/from  mode  k  caused  by  its  interac¬ 
tions  with  wavenumbers  p  in  a  prescribed  region  V  of  the  wavenum¬ 
ber  space  and  q  =  k  -  p  In  another  region  Q  is 

Tr's(k)  =  i!t(u;(k)W„r'e(k))  (4) 


#1cts(J o=«f(x)j»r«(x),  (7> 

(8) 

ftOT(x)=S.WS,fft(x),  (9) 

where  i£(x)  i>  the  inverse  Fourier  transform  of  ti„(k)  truncated 
to  the  band  K. 

Function  ffVQ(x)  is  a  straightforward  counterpart  of  T  7C(k), 
with  a  product  of  S*  and  JVja  taken  in  the  physical  rather  than  in 
the  spectral  space.  However,  since  ta/'tk)  vanishes  outside  X,  the 
multiplication  in  the  spectral  spate  implicitly  truncates  Ms  (k)  to 
the  same  band  so  that  TCT0(k)  expresses  transfer  to  the  modes  in 
K  only.  In  f*”«(x)  the  effect  of  nonlinear  transfer  to  modes  outside 
K  is  present  in  the  term  A f*a. 

An  explicit  truncation  of  JV„,c(k)  to  K  and  multiplication  by 

seemi  to  rectify  this  problem  resulting  in  (8).  The  drawback  of 
this  definition  is  that  it  does  not  satisfy  a  natural  condition: 

£T*’a(x)  =  ^,  (70) 

K,?,Q 
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rhich  Is  satisfied  by  both  (7)  and  (9). 

Function  ^^(x)  may  be  interpreted  as  a  fraction  of  the  rate 
of  change  of  the  total  energy  e(x)  due  to  variation  of  mode*  in  K  as 
they  are  affected  by  nonlinear  interactions  with  modes  from  'P  and 

q. 

Thus  none  of  the  above  definition*  is  an  exact  counterpart  of  the 
spectral  transfer  TXPC(k)  but  (9)  is.the  most  appealing  candidate. 

An  interesting  special  case  is  obtained  by  dividing  a  wavenumber 
space  into  two  disjoint  regions  K  (k  <•*«)  and  P  (J k  >  ke).  Quantity 

Ts0$(x]k,)  =  Tk"M  +  (Jl) 

provides  a  physical  space  representation  of  the  rate  of  change  of 
energy  of  large  scales  (F  <  ke)  due  to  their  nonlinear  interactions 
through  wavenumber  triads  which  have  at  least  one  of  the  legs-in  the 
region  7V  This  is  precisely  the  energy, transfer  process  which  is  the- 
subject  of  the  subgrid-scale  modeling. 

We  have  computed  transfer  functions  (7)  and  (11)  for  the  statisti¬ 
cally  isotropic  velocity  field  obtained  in  direct  numerical  simulations 
performed  with  a  resolution  of  128*  modes  (maximum  wavenumber 
F  ss  64),  The  low  wavenumber  band  Q  remains  always  fixed  and  is 
chosen  to  cover  the  entire  energy  containing  range  (0  <q<  10).  Fig* 
uxe  1  shows  one  plane  from  the  full  transfer  (7)  repre*enting  in  the 
physical  space  the  energy  transfer  to  eddies  in  the  band  23  <  k  <  28 
caused  by  their  interactions  with  eddies  in  the  bands  20  <  p  <  25 
and  0  <  q  <  10.  The  transfer  function  Is  spatially  intermittent  and 
is  predominantly  positive,  indicating  a  flow  of  energy  from  the  larger 
scales  p  to  the  smaller  scales  F. 

We  have  attempted  to  correlate  this  physical  energy  transfer  with 
a  number  of  simpler  quantities  (rate-of-straln,  dissipation,  energy, 
etc.)  calculated  from  the  velocity  field  truncated  in  such  a  way  as  to 
contain  only  either  large  or  small  scales.  We  found  that  the  energy  of 
the  velocity  field  truncated  to  large  scales  0  <  k  <  10  correlates  very 
well  with  the  energy  transfer  among  small  scales  shown  in  figure  1. 
Correlation  of  other  calculated  quantities  with  the  energy  transfer, 
notably.the  square  of  the  rate-of-strain  tensor,  was  generally  much 
worse.  Therefore  we  conclude  that  the  energy  transfer  among  small 
scales  occurs  mostly  at  those  physical  locations  which  contain  large 
amounts  of  turbulent  energy  rather  than  at  the  locations  of  high 
strain  rate,  an  unexpected  result.  Indeed,  .until  this  paradox  is  re* 
solved,  we  can  not  be  confident  that  the  particular  measure  of  energy 
transfer  that  we  have  used  is  the  appropriate  one. 

We  have  used  formula  (11)  to  calculate  subgrid-scale  (SGS)  en* 
ergy  transfer  for  the  Mime  field  with  the  cutoff  wavenumber  F«  =  10. 
A  plane  from  the  full  SGS  transfer  field  is  plotted  in  figure  2.  The 
transfer  is  characterised  by  the  presence  of  both  negative  and  positive 
regionj.  These  indicate  energy  flux  from  and  to  the  large  scales  re¬ 
spectively  due  to  their  interactions  with  the  smaller  scales.  Standard 
subgrid-scale  eddy  viscosity  models  predict  transfer  in  one  direction 
only,  from  large  to  small  scales. 
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Fig  2.  Subgtid  scale  energy  transfer  in  physical  space  T$<;s(x|Ft) 
for  K  =  10. 


4  Conclusions 

We  have  devised  a  physical  space  representation  of  the  energy 
transfer  processes  among  scales  of  motion  belonging  to  three  distinct 
wave  lumber  bands  in  the  spectral  space  and  conclude  from  it  that 
the  energy  transfer  among  small  scales  is  highly  intermittent  in  the 
physical  space  and  correlates  well  with  regions  of  significant  large- 
scale  energy. 

As  a  particular  case  we  have  calculated  a  subgrid  scale  energ) 
transfer  in  isotropic  turbulence.  The  SGS  transfer  exhibits  regions 
of  energy  drain  frem  large  to  small  scales  as  well  as  significant  regions 
of  reversed  energy  transfer  jjjroni  small  to  large  scales.  Classical  eddy 
viscosity  models  assume  that  transfer  is  always  from  large  to  small 
scales,  contrary  to  the  results  of  direct  calculations. 
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Abstract  A  discussion  is  made  of  a  method  which 
may  economize -the  computation  to  solve  two-point 
closure  equations  for  anisotropic  turbulence.  A  meth¬ 
od  based  on  generating  random  fields  to  evaluate  a 
certain  wavevector  integral  and  a  time  integral  is  pre¬ 
sented;  The  use  of  Gaussian  fields  for  the  former  has 
been  so  far  found  ;to  yield  a  satisfactory  approxima¬ 
tion  when  applied  to  a  closure  equation  for  two  di¬ 
mensional  Navier-Stokes(NS)  turbulence. 

I.  INTRODUCTION 

Among  attempts  to  construct  a  statistical  theory  of 
turbulence  in  terms  of  low  order  moments,  there  are 
approaches  based  on  renormalized  perturbative  ex¬ 
pansions.  Some  of  the  so-called  two-point  or  spectral 
closures  thus  obtained  have  been  found  to  be  in  good 
agreement  with  experiments. 

Although  these  closures  are  in  principle  applicable 
to  a  wide  class  of  turbulences,  there  are  algebraic  and 
computational  difficulties  in  solving  closure  equations 
applied  to  realistic  anisotropic  turbulences.  One  way 
to  make  closures  more  accessible  to  a  wide  class  of 
turbulences  is  to  develop  an  efficient  method  to  solve 
such  closure  equations. 

We  present  here  a  method  based  on  the  evaluation  of 
integrals  by  generating  random  fields.  We  apply  this 
method  to  the  Lagrangian  renormalized  approxima 
tion  (LRA)(1).  We  confine  ourselves  here  to  the  equa¬ 
tion  for  single-time  moments  in  the  LRA.  The  struc¬ 
ture  of  this  equation  is  essentially  similar  to  those  in 
various  closures  including  the  direct-interaction  ap¬ 
proximation  (DIA)[2],  the  abridged  -  Lagrangian  - 
history  -  DIA[3],  and  the  local-energy-transfer  the¬ 
ory  I ‘Ij,  and  the  method  discussed  here  is  in  principle 
applicable  not  only  to  the  LRA  but  also  to  these  clo¬ 
sures. 

II-EQUATION  FOR  SINGLE-TIME  MOMENTS 

The  LRA  is  a  two-point  cloc-ier  approximation  ob¬ 
tained  by  a  simple  truncation  of  Lagrangian  renor¬ 
malized  perturbative  expansions,  and  is  free  from  any 
ad-hoc  adjusting  parameter.  The  LRA  has  been  so  far 
found  in  good  agreement  with  experiments  and  diicct 
numerical  simulations  regarding  not  only  single-time 
quantities  but  also  the  two-time  Lagrangian  velocity 
auto-correlation  function  [5,6,7). 

Let  us  consider  homogeneous  turbulence  of  an  in¬ 
compressible  fluid  of  unit  density  obeying  the  NS 
equations  with  zero  mean  flow  and  no  external  force. 
Let  Q  bo  the  Lagrangian  covariance  defined  by 

Qij(xst,x‘,t')  =<  vl(x,(,|()6i(x',t,|f')  >,  (t  >  t'), 


in  which  v(x,t|s)  is  the  velocity  at  time  s  of  the  fluid 
elementdhat  was  at  x  at  time  t,  and  let  Q,*(k)  = 
R.,(k)  Q,k{ k)  where  P,,(k)  =  6,,  -  k.kjh'2, 

0(,(k,f,O  =  (2ir)-°  JdDr01,(x  +  r,t,x,t')e-kr, 

D  is  the  dimension  of  space,  and.we  use  the  summa¬ 
tion  convention  for  the  repeated  indices. 

In  the  LRA,  the  evolution  of  the  single-time  moment 
Qi;(kjt,f)  obeys 

(jt+2vk’1)Q„(k;t,t)  =  D„(k,t)  + 

»o(k,t)  =  f  ds7j,(k,t,s),  (1) 

Jh 

A 

7},(k,t,s)  =  £Mimn(k) 

p.q 

x  {‘lMrca(p)Gml(p,  t,  s)Q„c(q,  1,  s)Q,a(~k,t,  s) 

—  2Afatc(k)Gia(-k,  t>s)Qm&(p,/,s)Qnc(q,  t,s)}, 

(2) 

A/imn(k)  =  -(i/2)(kmP,„(k)  +  knPU  k)J, 
where  p  is  the  fluid  viscosity,  t0  is  the  initial  time, 
the  symbol  £p,t,  denotes  the  integrals  over  p  and  q 
satisfying  k  =  p  +  q, 

Q„(k,(,s)  =  Gim(k,t,s)Qm,(k,$,s),  (3) 

and  G  is  the  so-  called  Lagrangian  response  function 
We  omit  here  writing  the  LRA  equation  for  G,  for 
which  the  reader  may  refer  to  Rcf.l. 

III.  MONTE  CARLO  METHOD 

Let  us  consider  the  evaluation  of  Ti/  defined  by  (2), 
for  given  Q  and  G.  Let  £  be  a  Gaussian  random  field 
with  zero  mean  and  covariance 

^  Cl (k. s)s; (p. s)  >=  J|(+pQv(k,S,s), 

and  let  {,(k,t)  =  G',j(k,(,sjfJ(k,.s),  then  we  have, 
because  of  (3), 

^  Ci (k 1 0C;  (P,  -■)  > -“  k  t  pQi; (k,  1 ,  s),  (t  >  s).  (4) 

If  we  define  z  as 

A 

-i (k)  —  G,m(k,t,s)  Af,n,r(k)  C;(p»s)Cfc(q»s), 

p.q 
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then 

A 

Tf;(k,t,s)  =<  ^imn(k)(»(p)^Xn(^j 0K‘(*“^) 

p,q 

A 

+  2  C  {^jW;m„(k)2m(p)^„(q,t)}?J(-k,0  >  .  (5) 

p.q 

Now  we  introduce  an  approximation  for  the  average 
<  /  >  of  a  random  field  /  that 

<  /  >«<  /  >Mi  (6u) 

where 

</>m=t r-£/m,  (66) 

*‘‘mar  "T. 

m—l 

/m  represents  the  value  of./  by  the  m-th  realization 
of  the  Gaussian  random  field  {,  and  Mmaz  is  the  total 
number  of  the  realizations,  i.e.,  we  approximate  the 
average  <  >  in  (5)  by  the  Monte  Carlo  average  <>*? 
of  finite  number  of  realizations. 

An  approximation  of  (1)  may  be  obtained  by 

N 

A';(k,  0  SS  Sn)Asn, 

nsO 

and  by  evaluating  T,,  with  (5)  and  (6)  at  each  s„, 
where  so  =  <o,  aw  =  f,  and  A s„  is  an  appropriate 
weight  corresponding  the  discretization  of  the  time 
interval. 

The  above  Monte  Carlo- method  lias  been  tested 
by  applying  it  to  the  LRA  equations  for  two  dimen¬ 
sional,  isotropic  and  anisotropic  NS  turbulences  (8j. 
In. practice  the  computation  was  simplified  by  us¬ 
ing  the  stream  function  arid  vorticity  formulation.  It 
has  been  found  that  the  values  of  the  energy  spec¬ 
trum  and  the  Lagrangian  veloicty  auto-correlation 
obtained  by  using  this  method  are  in  good  agreement 
with  file  LRA  solutiori  and  direct  numerical  simula¬ 
tions,  even  with  M™*  =  l- 
The  value  of  the  integral  in  (2)  over  p,q  for  given 
values  of  the  subscripts  depends  not  only  oh  i,m,n 
but  also  on  the  other  subscripts  «,6,c,  j,  so  that  the 
use  of  (2)  in  geneial  requires  us  to  evaluate  the  in¬ 
tegrals  for  all  possible  values  of  the  subscripts.  On 
the  other  hand  the  integrals  in  (5)  depend  only  on 
i,m,n  and  not  on  the  other  extra  subscripts.  Thus 
the  evaluation  in  (5)  is  in  general  much  easier  than  in 
(2),  in  particular  ,  .  three  dimensions. 

The  above  method  is  applicable,  at  least  in  principle, 
not  only  to  NS  turbulence  with  zero  mean  flow,  but 
also  to  turbulent  shear  flow,  Rossby  turbulence,  MUD 
turbulence,  etc..  In  its  application  to  inhomogeneous 
turbulence,  we  need  generate  random  Gaussian  fields 
satisfying  equations  like 

<  6(MKi(P>*)  >=  Qii( k,P,«,s),  (7) 


for  given  Q;;(k,p,t,s),  instead  of  (4).  Under  an  ap¬ 
propriate  discretization,  this  is  equivalent  to  generat¬ 
ing  fields  satisfying  equations  like  <  {m{„  >=  Qm„  in 
a  symbolic  notation,  where  the  subscripts  m,  n  stand 
for  the  wavevector,  time  and  r  or  j  in  (7)  "h  {-field 
may  be  obtained  by  putting  {m  =  Am„yn  here 

AmjAnj  ~  Qmn>  (6) 

and ymy„.  >=  6m„.  It  is  not  difficult  to  find  A 
satisfying  (8)  by  imposing  Amn  =  0  for  m-<  n. 

The  idea  of  using  random  fields  may  be  applica¬ 
ble  also  to  the  time  integration  in  (1).  Suppose  that 
the  Gaussian  random  {-fields  at  time  t  and  s  are  cor¬ 
related  as  (4)  for  s  =  s„,  n  =  0,1,...,  AT.  Let  {a} 
and  {6}  be  sets  of  real  random  numbers  independent 
of  each  other,  and  satisfying  <  ama„  >=<  (hm-  < 
bm  >)  f>».>=  <5mn,  b„>  0  for  any  n,m,  and  let 

N 

{i(k)  =  £a,N/^{i(k,s„), 

nsO' 

Gtj{ k)  =  ^(6„—  <  bn  >)Gi>(k,t,s„)Asn, 

n=0 

*i(k)  =  G;m( k)  X  Mm/*(k)  {>(p){l(q), 

p,u 

then  the  r.h.s.  of  (5)  with  z  replaced  by  z  yields.an 
approximation  for  D1;(k,t),  (not  T,j).  The  average 
<  >  may  be  then  approximated  by  the  Monte  Carlo 
average  <  >  defined  by  (6)  in  which  fm  depends  not 
only  on  the  realized  random  field  {  but  also  on  {«} 
and  {6}.  This  method  of  evaluating  D,,  has  the  ad¬ 
vantage  that  it  does  not  require  the  evaluation  of  inte¬ 
grals  over  p,  q  at  each  s„  .  The  performance/efficiency 
of  this  method  remains  to  be  tested. 
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Abstract  -  Using  an  optimally  convergent  representation,  a  low 
dimensional  model  is  constructed  (Aubry  et  al,  1988)  which 
embodies  in  a  streamwisc-invariant  form' the  effects  of  streamwisc 
structure  (Berkooz  et  al,  1990).  Results  of  Stone  (1989)  show 
that  the  model  is  capable  of  mimicking  the  stability  change  due  to 
favorable  and  unfavorable  pressure  gradients.  Results  of  Aubry  et 
al,  (1990)  suggest  that  polymer  drag  reduction  is  associated  with 
stabilization  of  the  secondary  instabilities,  as  has  been  speculated 
Results  of  Bloch  &  Marsden  (1989)  indicate  that  drag  can  be 
reduced  by  feedback,  and  that  this  is  mathematically  equivalent  to 
polymer  drag  reduction.  * 


Objective  analysis  of  experimental  measurements  indicates  that 
there  are  recurrent  streamwisc  rolls  present  in  the  wall  region  of  a 
turbulent  boundary  layer,  at  legist  in  the  quadratic  mean  sense 
(Conno  &  Brodkey,  1969,  Kline  et  al,  1967).  Representation 
theorems  (Lofcve,  1955)  permit  optimal  expansion  of  the 
instantaneous  velocity  field  in  the  wall  region  in  terms  of  these 
streamwise  rolls  (Lumley,  1967).  -  Without  involving  ourselves  in 
the  question  of  the  source  of  these  y.T>lI$,  we  ask  how  they  will 
.  behave  dynamically.  Severely  truncating  our  system,  and  using 
Galerkin  projection,  we  obtain  a  closed  set  of  non-linear  ordinary 
differential  equations  with  ten  degrees' of  freedom '  The  methods 
of  dynamical  systems  theory  are  applied  to  these  equations.  Loss 
to  unresolved  modes  is  represented  by  a  Heisenberg  parameter 
(Aubry  etal,  1988;  Berkooz  etal,  1990). 

We  find  that  for  large  values  of  the  Heisenberg  parameter  fiarge 
loss),  we  obtain  stable  streamwise  rolls  having  the  experimentally 
observed  spacing  For  smaller  values  of  the  parameter,  we  have 
traveling  waves  (corresponding  to  cross-stream  dnft  of  the  rolls), 
we  also  find  a  heteroclinic  attracting  orbit  giving  rise  to 
interraittency;  and  finally  a  chaotic  state  showing  ghosts  of  all  of 
the  above. 

The  intermittent  jump  in  phase  space  from  one  attracting  point 
to  the  other  resembles  in  many  respects  the  bursts  observed  in 
expenments.  Specifically,  the  time  between  jumps,  and  the 
duration  of  the  jumps,  is  approximately  that  observed  in  a  burst, 
the  jump  begins  with  the  formation  of  a  narrowed  and  intensified 
updraft,  like  the  ejection  phase  of  a  burst,  and  is  followed  by  a 
gentle,  diffuse  downdraft,  like  the  sweep  phase  of  a  burst.  During 
the  jump  a  spike  of  Reynolds  stress  is  produced,  as  is  observed  in 
a  burst,  although  the  magnitude  is  limited  in  our  model  by  the 
truncation  of  the  high  wavenumber  components. 
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The  behavior  is  quite  robust,  much  of  it  being  due  to  the 
symmetries  present  (Aubry’s  group  has  examined  dimensions  up 
to  128  with  persistence  of  the  global  behavior,  Aubry  &  Sanghi, 
1989).  We  have  examined  eigenvalues  and  coefficients  obtained 
from  experiment  (Herzog,  1986),  and  from  exact  simulation 
(Mom,  1984),  which  differ  in  magnitude  Similar  behavior  is 
obtained  in  both  cases;  m  the  latter  case,  the  heteroclmic  orbits 
connect  limit  cycles  instead  of  fixed  points,  corresponding  to 
cross-stream  waving  of  the  streamwise  rolls.  The  bifurcation 
diagram  remains  structurally  similar,  but  somewhat  distorted. 

The  role  of  the  pressure, term  is  made  clear  -  it  triggers  the 
intermittent  jumps,  which  otherwise  would  occur  at  longer  and 
longer  intervals,  as  the  system  trajectory  is  attracted  closer  and 
closer  to  the  heteroclimc  cycle.  The  pressure  term  results  in  the 
jumps  occurring  at  essentially  random  times,  and  the  magnitude  of 
the  signal  determines  the  average  timing.  This  clarifies  the 
question  of  whether  bursting  scales  with  wall  variables  or  with 
outer  variables  -  evidently  the  structure  of  a  burst  scales  with  wall 
variables,  while  the  time  between  bursts  should  scale  in  a  complex 
wav  with  both  inner  and  outer  variables  (Stone  &  Holmes,  1990a, 
b). 

Stretching  of  the  wall  region  shows  that  the  model  is 
consistent  with  observations  of  polymer  drag  reduction  (Aubry  et 
al,  1990),  m  which  one  of  the  accepted  mechanisms  is  the 
stabilization  (by  the  extensional  viscosity  associated  with  the 
polymers)  of  the  large  eddies  in  the  turbulent  part  of  the  flow, 
allowing  the  eddies  to  grow  bigger  and  farther  apart,  as  observed. 
Aubry  et  al  (1989)  tned  stretching  the  eddy  structure  in  the  wall 
region,  producing  drag  reduction,  and  found  the  bifurcation 
diagrams  morphologically  unchanged,  except  that  the  bifurcauons 
occuned  for  larger  and  larger  values  of  the  Heisenberg  parameter. 
This  suggests  that  the  motions  giving  nse  to  the  bifurcations  arc 
more  and  more ,  unstable,  the  more  the  region  is  stretched, 
requiring  a  larger  and  larger  value  of  the  Heisenberg  parameter  to 
stabilize  them.  Now,  the  Heisenberg  parameter  represents  the  loss 
of  energy  to  the  unresolved  modes  as  well  as  loss  to  any  other 
dissipation  mechanism,  such  as  viscosity  or  extensional  viscosity. 
Hence  the  findings  of  Aubxy  et  al  (1989)  arc  completely  consistent 
with  the  idea  of  the  larger  eddies  being  less  stable,  and  able  to 
grow  to  this  larger,  less  stable  size  due  to  the  stabilizing  effect  of 
the  polymer. 


Change  of  the  thud  order  coefficients,  corresponding  to 
acceleration  or  deceleration  of  the  mean  flow^  changes  the 
heteroclinic  cycles  from  attracting  to  repelling,  increasing  or 
decreasing  the  stability,  in  agreement  with  observations  (Stone, 
1989). 

The  existence  of  fixed  points  is  an  artifact  introduced  by  the 
projection,  in  the  exact  equauons,  the  rolls  always  decay  aft'  r  a 
period  of  growth.  The  projection  incorporates  statistically  the 
birth  and  death  of  individual  rolls,  producing  a  stationary 
situation  However,  a  decoupled  model  soil  displays  the  nch 
dynamics  (Moffat,  1989;  Holmes,  1990a,  Berkooz  et  al,  1990). 
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Has  son  of  rrirtvdy  cs^Sc  nsodd  cccSi  be  esed  as  2  "Had: 
box'  ia  feed-bad:  sys^ms  to  cocroi  tbs  bocadry  byes.  »  «s3 
as  bring  used  to  pcct&t  jrcsscrt  sod  sress  flsssrSocs  21  tie 
n-?H  23d  the  dice:  of  vzriocs  dr2g  rcdxrica  srhrnrs.  icegge 
back  ri  gccftacaocs  wih  the  proper  phase  cn  tfcby  6c  brssag. 
tAe  heterociiajc  j=?  to  tbs  «b=r  fixed  poet).  tJaros?gtbs 
drag.  It  is  also  possible  to  speed  c?  the  bearing.  i^crcasg 
nuxisg  to  cocrol  sepexxa  (Bloch  &  J-irsdea.  1989). 

la  recest  work.  Berkeez  (1990),  xa  coCiboeaaaa  wsh  Hofcres 
aad  Lesley,  has  shown  that  several  asscropooss  cads  oa  an 
insarive  basis  in  the  uxxk  of  Adxy  c/  cay  be  jariacd  fccrxSr. 
namely:  that  the  Hrisexbcrg  model  used  gives  the  ccacct 
dissipation  whhin  a  constant  of  oeder  enriy,  2s  asssmed;  6at  the 
Leocarf  stresses  may  be  neglected  ia  the  case  of  modeF**  vrthco 
streamwise  v-jfarion,  2s  asssacd;  (bat  the  previoes  resah  bolds 
for  aa  arbitrary  camber  of  rigcsfsncoons  when  00  srreamwise 
v2riatioo  is  present;  that  models  with  00  saeasrnse  variarioo  ia 
effect  average  the  srieamwisc  dynamics,  as  conjectured  by 
Holmes. 

Errly  work  of  Bloch  &  Marsdta  (19S9)  showed  that  sysuas 
with  bomochnie  anoaoes  tdc  ia  principle  coacoUabie,  with  a 
certain  type  of  control  iapoL  Ocr  efforts  ia  this  dirrerion  are 
concerned  oa  determining  the  feasibility  of  control,  and  trying 
to  understand  the  possible  gains  ia  terms  of  drag  redaction  and 
mixing  enhancement  To  this  end  Bcxkooz.  Holmes  and  Lcrnley 
introduced  the  notion  of  short  term  tracking  time  T_sl  This  is  a 
measure  of  the  time  over  which  a  dynaxaeal  sys^ms  model  tracks 
the  true  dynamics  accuracy.  T^  is  of  fundanxcral  importance  in 
the  control  application,  and  it  mast  be  of  the  order  of  the  wd- 
region  time  scales  to  make  control  possible.  They  then  showed 
that  dynamical  systems  based  on  the  Proper  Orthogonal 
Decomposition  have,  on  the  average,  the  best  T _s  foe  a  given 
number  of  modes  (Bcxkooz,  1991). 

Berkcoz  (1990)  has  made  rigorous  estimates  using  the  proper 
onhortcal  decomposition  showing  that  a  structured  turbulent 
flow,  such  as  the  wall  layer,  has  a  phase  space  representation  that 
remains  within  a  thin  slab  centered  cn  the  roost  energetic  modes 
for  most  of  the  time.  However,  exits  from  this  region,  which  is 
all  that  our  !ow-dimcns;onal  models  include,  should  not  be 
ignored,  since  they  typically  correspond  to  violent  events,  such  as 
the  bursting  phenomenon.  Bcrkooz  and  Holmes  arc  trying  to 
develop  a  theory  in  which  deterministic,  low-dimensional 
dynamics  governing  the  low  modes  apply  most  of  the  time, 
passages  from  and  returns  to  this  bring  modeled  probabilistically. 
This  might  be  viewed  as  a  dynamical  closure.  They  plan  to  test 
their  theory  on  problems  including  the  32  and  54  dr^msional 
projections  of  Aubry  &  Sanghi  (1989). 

Campbell  &  Holmes  (1990)  arc  continuing  their  studies  of 
symmetry  breaking  (0(2)  -»  D4)  in  systems  with  structurally 
stable  hetcroclinic  cycles.  They  have  proved  that  no  analytic 
(second)  integral  of  motion  exists  in  a  certain  limiting  case  and  that 
only  two  pairs  of  the  continuum  of  0(2)  symmetric  hetcroclinic 
cycles  persist  in  general.  They  are  studying  the  bifurcations  from 
these  survivors.  This  work  is  relevant  to  our  models  of  interacting 
coherent  structures  in  boundary  layers  with  discrete  spanwise 
symmetry,  such  as  that  caused  by  riblets.  This  is  to  our 
knowledge  the  fust  analytical  contribution  to  our  understanding  of 
the  drag  reduction  causal  by  riblets. 
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APPLICATION  O?  BESQ2MAU5AIH)S  CHOU?  TO  TCHBULEXCZ 
SIMULATION  CS3XG  CONDITIONAL  AY'EHAGIXG- 


W.D.McCceb, 

Dey*zi:r>e=i  c£  FHycc*,  Umrccty  of  Zdmhmgj,  Edmlcrgi  Z53  3JZ,  U.K- 
TaAukace  presets  tic  trchefj^probfem  of  the  aocHvrrr 


5dd  vHi  ed2fi<  chaotic  bdaroc.  Abort  a  c^ktl  7jJk 
cf  tic  Bepc&2t  rr~:Vr,  tV*  relochy  £dd  U(r,{)  noo  o 
predkisbJy  with  poets  ,  a  *«.  Oa  tie  cAba.  Btai,  if  -arc 

Former  tzaas&ea  tic  t  jc  4  with  respect  to  wxTerrmber 
k,  then  tic  jcc£2e=j  beets?  5  cf  masy  degrees  cf  freedom: 
tie  Fccrie?  aoics  U(k,t/»  tie  zriemi  0  <  k  <  k±  *  c 

«  =  jT  2^S{i}a~  (1) 

wiere  <  is  tic  dissipation  rate,  s*  is  tic  Hacsatk  viscorit y,  and 
£(i)  ix  tie  energy  spectasa.  TK*dc£zxtioz  ensures  that  5*  » 
of  tic  Mac  order  of  magnitude  u  tic  Kolmogorov  cxdptlxn 
wavenumber. 

Iz  general,  for  all  bat  tic  simplest  turbulent  fidds  sad  low¬ 
est  Reynolds  numbers,  tiexc  are  too  many  degree*  of  freedom 
for  cozjldc  numerical  simulation  cf  tic  Karier-Stokes  equation 
(XSE)  to  be  possible.  Computers  are  sot  large  enough  for  tHs 
purpose,  cor  are  they  Ekdy  to  be  so  ia  tie  foreseeable  future. 
Tics  tic  theorist  is  faced  with  tic  important  and  interesting 
question:  can  we  derixe  some  analytic  method  to  reduce  tie 
camber  cf  raodes  wLich.must  be  resolved  by  tic  computer?  Ia 
tHs  way  we  could  Hope  U » male  cal  edition  feasible  ia  a  Hybrid 
fashion,  with  a  solution  to  tic  problem  wbici  is  neither  wholly 
asalytic  cor  wholly  cumericaL 

Tic  obstacle  which  Ees  ia  the  way  of  such  aa  approach  is 
the  well  known  phenomenon  of  nonlinear  caring.  This  may  be 
seen  as  follows.  Consider  the  solenddal  form  of  the  NSE  ia 
wavenumber  space.  This  takes  the  form  [lj: 

= iw  k)  /  ejuMqafr  -  i.‘)+/.(k,0. 

.  <J> 

where 

AWk)  =  (2.r'foD^(k)  t  k,D*(  k)]f  (3) 

and  the  projection  operator  (which  arises  is  the  process  of  elimi¬ 
nating  the  pressure  and  the  incompressibility  con  ditios  together) 
is  given  by 

Dot  =  Safi  -  ktkfi  |  k  |'3,  (4) 

where  is  the  Kronecker  delta.  We  choose  the  stirring  forces 
/•(k»t)  to  satisfy  the  usual  requirements  for  a  well-posed  prob¬ 
lem.  That  is,  they  act  directly  on  the  fluid  only  at  the  lowest 
value#  of  the  wavenumber-  Although  they  arc  therefore  some¬ 
what  arbitrary  in  form,  the/  are  only  introduced  in  order  to 
allow  us  to  consider  the  simplest  situation,  which  is  turbulence 
which  is  homogeneous,  isotropic  and  stationary.  We  shall  also 
restrict  our  attention  to  the  case  of  zero  mean  Add. 

From  equation  (2)  it  is  now  obvious  that  the  nonlinear  term 
couple#  the  mode#  corresponding  to  different  wavenumber#  to¬ 
gether.  In  principle,  it  follow#  that  all  modes  are  coupled  in  this 
way,  although,  in  practice  it  is  generally  assumed  that  appre¬ 
ciable  interaction  only  occurs  between  mode#  which  arc  dose  to 
each  other  in  wavenumber.  Indeed,  it  if  the  assumption  of  local¬ 
ness  of  energy  transfer  In  wavenumber  which  underpins  the  idea 
of  an  energy  cascade  and  which  leads,  by  way  of  dimensional 


aaalycs,  to  tie  experimentally  observed  Kolmogorcfi' spectrum; 

It-c 

(s) 

where  a  is  tic  constant  of  propcrt/oaaHy. 

Now  fot  us  ccurinate  modes  by  tie  appEcaixo  of  tie  Rc&or- 
rarHsatioa  Group.  Wc  begin,  by  decomposing  the  Telocity  £dd 
into  cxpEdt  scales  for  k  <  kj  and  smpSdt  scales  for  kj  <  k  <  k& 
tins:  ~ 

Um{ k)  =  I/-(k)  for  0<k<k! 

=  C?(k)  for  kx<k<kc,  (6) 

wicrc  ij  is  defined  by 

kl  =  (l-A)*=.  (7) 

with  the  bandwidth  parameter  A  satisfying  the  condition  0  < 

A  <  1. 

In  principle,  the  resormalkaiion-group  approach  can  be  car¬ 
ried  out  as  follows: 

(A)  Solve  the  NSE  cn  kx  <  k  <  io-  Substitute  that  solution 
for  the  high-k  modes  back  into  the  NSE  oa  0  <  k  <  kx.  This 
results  in  an  increment  to  the  viscosity:  v9  — *  vx  —  u0  +  Sv^. 

(B)  Rescale  the  baric  variables  such  that  the  NSE  on  0  <  k  < 
k2  looks  like  the  original  N#  viex-Stokcs  equation  on  0  <  k  <  ko. 

These  two  stages  are  then  repeated  to  e£  ruinate  th*  effect 
of  high  wavenumbers  progressively  in  a  series  of  bands  k^t  < 
k  <  k,,  until  the  rescaled  viscosity  no  longer  changes  (i-e.  has 
reached  a  "fixed  point*-). 

This  procedure  is  appealingly  rimple,  and  has  a  clear  physical 
interpretation,  bat  it  has  not  proved  easy  to  put  it  into  practice 
in  the  turbulence  problem.  Certainly,  the  method  of  iterative 
averaging  has  shown  that  a  fixed  point  may  be  obtained,  with 
reasonable  quantitative  results  [2, 3].  But,  this  method  has  been 
open  to  the  criticisms  that  the  baric  averaging  technique  rued 
was  obscure  and  that  there  was  an  unexplained  dependence  on 
the  bandwidth  parameter  A.  However,  recently  these  criticisms 
have  been  answered  by  the  introduction  of  a  conditional  average^ 
which  is  then  evaluated  a#  an  approximation  in  which  A  play* 
the  part  of  a  small  parameter  [4j.  It  is  this  development  which 
concerns  us  here. 

Let  us  introduce  a  conditional  average  which  smooths  out 
the  effect  of  the  high-fc  modes,  while  keeping  the  U"  constant. 
We  represent  it  by  an  operator  |  U“]  and  denote  its  effect 
on  the  first  shell  of  wavenumber*  to  be  eliminated  by  <>o,  thus: 

,i[U*  I  U~]UmUfl...U1  =<  V.V,...U,  >0 .  (8) 

It  then  follow*  that  this  operator  has  the  properties: 

<  U;(k)  >»=  t/-(k),  (9) 

<  !/.-UTO(k  -  j)  >.=  ICOWk  -  j).  (10) 

It  is  at  this  point  that  we  encounter  the  difficulties  associated 
with  nonlinear  mixing.  We  now  wish  to  evaluate  averages  of  the 
above  kind  over  the  high-k  modes  and  express  them  in  terms  of 
global  mean  quantities,  such  as  the  energy  spectrum.  Evidently 
the  problem  we  face  is  that  the  U+  field  is  not  independent  of 
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tieC-£ddwij<h*exreiclE^<x=tu=t.Tietwo&J2nl;e 
corded  tesctirrtirorgh  the  secEaexr  term  znepatsca  (2). 
Wetad3etH»<=5c=bjbywrit=stleizsh.l=cdabte=. 

edaaewrirUV+stlm* 

^CM)  =  1*CMHA:(M>.  (11) 

Sere  V*  £f*&Mef  tie  mme gened  type aaU+ and  beJougx  to 

**:3C  tarbrSeut  cnarmhis.  Eesccjtlullenacp^Krixi 
esiejijiditctpjjtia: 

<K.*(M)>=0,  (12) 

<  VCMPSNK3  >=<  ^(MJff/flc'.O  >  .  (13) 

However,  tic  essential  feature  of  V+  a  that  ii  »  rot  coupled  lo 
tie  low-fc  modes.  Tins,  through  (11)  we  introduce  the  fcndion 
A+  to  tike  account  of  code  coupHng.  We  aote  that  it  fflm 
ixzx&ltd y  feom  (12)  that  A+  to*  zero  mean  g=dir  gjohri  *t- 

to  orde  to  complete  ocr  specification  of  the  two  new  fields, 
ire  state  tier  properties  under  conditional  averaging  as 

4[UMU-)V+(M)  =  0  (14) 

tad 

^[OMU-lA+(fc,0  =  O(Xr,m>L  (15) 

We  coir  make  use  of  tie  above  equations  to  evaluate  condi- 
tioaal  moments  of  tie  U+  to  term*  of  unconditional  mome nts  of 
ttc  Vf,  tad  to  this  way  wc  decompose  the  KSE  into  aa  equa- 
tioa  fox  the  expEdt  scales  bat  containing  the  term  <  U+U*  >0 
as  representing  the  coupling  to  implicit  modes,  along  with  aa 
iterative  solution  fox  the  latter  quantity,  which  cao  be  shown  to 
be  Eaeaxly  dependent  oa  U“(le,<).  to  this  way,  we  can  derive  aa 
expression  of  the  toexemeat  to  the  fluid  viscosity  as  the  sum  of 
aa  infinite  series  to  the  unconditional  moments  of  the  V*  field. 

In  order  to  extend  this  calculation  to  progressively  lower 
bands  to  wavenumber,  we  hare  to  make  three  approximations. 
First,  we  evaluate  convolution  integrals  over  intermediate  times 
oa  the  assumption  that  the  U*  evolve  muds  more  rapidly  than 
(oa  aTeiage)  the  U  .  Second,  we  truncate  the  expansion  to  mo¬ 
ments  of  the  U*,  oa  the  grounds  that  the  expansion  is  bounded 
by  a  power  series  to  the  velodty  field  in  the  dissipation  range  of 
wavenumbers.  Thirdly,  and  this  is  the  main  new  feature  of  the 
work,  we  relate  the  V+  field  to  the  <J+  by  assuming  that  the 
coupling  between  distinct  Fourier  modes  is  local  in  wavenumber. 
That  is,  we  assume  that  A  is  large  enough  for  U(fco)  lo  be  inde¬ 
pendent  of  U(k,),  and  at  the  same  time  that  A  is  small  enough 
for  us  to  represent  the  Fourier  components  to  the  band  by  means 
of  »  fint-ordo  Taylor  acrica.  In  thi*  way,  wc  import  both  upper 
and  lower  boundr  on  A,  when  we  male  the  identification 

K.+(M)  =  WM  +  (fc-k.)-Vltf(M) I..*  +0(A’).  (16) 

Tie  RG  procedure  may  then  be  implemented  band  by  band, 
with  the  scaling  transformation 

**♦*  =  (1  -  A)fc*  !=  kkn%  k  =  kn^l1dt  (17) 

for  each  iteration  n.  Am  aaaumption  tbit  the  energy  rpeclrum  ii 
a  power  law,  followed  by  power  counting,  lead*  to  the  following 
expression  for  the  effective  viscosity 

''(KV)  =  a'»('l‘K'l%(k<),  (18) 


wbertbecoeSaenSy^yjrspTeabytberecurfSoQ  rriaijccihr? 

KrtiOT  =  hS'S’JllX)  -h  (19) 

^•(*0  =  5J5 

for  tie  wrenmicr  basda  0  <  V  <  1,1  <  f,T  <  tr\  where 
r=|lf-ri,and 

(J<  =  iW>_Hi«V>(f_i-,)m„.)  (21) 

Tie  coefficient  L{VtT)  contains  purely  geometrical  factors  and 
is  given  to  detail  to  ft). 

As  a  test  cf  the  method,  wc  have  calculated  the  KctoaogoroT 
spectral  constant,  as  defined  by  equation  (5).  Over  a  range  of 
nines  of  A  for  which  the  theory  is  valid,  we  find  a  —  1-6,  to 
good  agreement  with  experiment.  At  large  nines  of  A,  one  may 
observe  the  breakdown  of  the  first-order  Taylor  series  approxi¬ 
mation;  whereas,  at  small  nines  one  sees  the  effect*  of  mode  con- 
pErg,  which  would  invalidate  the  baric  assumption  that  U(k«) 
is  independent  of  U(fcj). 


Dependence  of  Kolmogorov  constant  on  bandwidth 


aa  0,1  or  as  04  as  us  or 


A 
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Pl»<  GmcU&os «x41j raxsfet  of  E*cjy  zad  E«£rity* 


Ycokhj  MURAKAMI 
PtyKtaqi  c(  Kilhau&il  Soom. 

Cc3^t  of  EipKtrij, 

loniKiy  o(0«b  P«f((t*w,S*hi,i51,  J»fu. 

Altftnd 

«TT«  lit  c(  ead  ooie,  are  ippw  to 

ii  Fourier  jp*te  of  tuzbxfexl  idd*  *!sdi  ire  obubtd 
aumrri ca2y  it  rtUurdr  low  Kcyxc/J*  nski  (Si  ~  30).  The 
effects  of  destroying  the  corrtUlzof'j  be-xxem  nodes  »uj  these 
crtkyJi  to  both  etttjj’  uf  fef&dt y  inafei  «e  umlijit«L  It 
b  shears  that  one  method  which,  abo  preserves  the  heEdty  spec- 
tree*  »  the  roast  effective  to  change  the  fcc a  of  both  transfer 
spectra  uf  that  *t  wtlf  as  if  it  w;e  no(oHtmjf. 

L  INTRODUCTION 

We  iitotisite  the  role  of  phase  correlations  oa  the  uoa- 
Eiear  transfer  ia  this  universal  xasge.  We  treat  the  foDowiag 
problem:  //  «  destroy  the  eerreUtioxs  fetrerx  modes  rx  scute 
sales  keeyity  tie  czcryy  sjectnl /cm  cttif.eitlly.  how  ere  tie 
egiciczcy  ef  tie  exeryy  tnxsfer  (Le.  exeryy  c*sc*ie)  effected? 
This  destruction  bit  be  regarded  a s  aa  idealization  of  the  ej- 
i frail  random  disturbances  which  inevitably  exist  ia  laboratory 
ex  per  in  eats  ard  natural  phenomena.  Hence  this  problem  is  r<- 
lated  to  the  srrscfcrc/slabtEty  problem  oa  the  turbulent  equilib- 
rian  state.  Aloxg  thb  Eae,  we  hare  iavestigaled  rirVoas  aspects 
of  the  dyutnie*  of  the  decaying  disturbed  tcrbufence{I,2].  Here 
we  do  tot  consider  the  dynamics  b»t  apply  reshufflings  to  wri- 
exs  ranges  of  Foaner  space  ia  order  to  estimate  the  effect  to  not 
only  the  energy  transfer  bat  also  the  hehdtj  transfer  which  was 
aot  treated  ia  Murakami  et  s(2). 

IL  NUMERICAL  PROCEDURE 

We  used  the  data  obtained  by  direct  an  clerical  sin  ala  lions 
of  the  incompressible  Navier-Stokes  (NS)  equation  with  periodic 
bonedary  conditioas  by  pseudospcctral  method  (.V  =  «V  = 
0  015).  Details  of  the  aomerica)  scheme  are  girea  ia  Poliflc  and 
Shtilnan[3]  and  Folifle[<].  Oae  field  (RH)  corresponds  to  the 
statbtically  steady  state  generated  by  a  time-independent  and 
ran  don  helidty  fordng  exerted  at  large  scales  (  3  <  k  <  4  ). 
The  field  has  been  obtained  after  aronnd  six  tarnorer  time  (1000 
time  steps  with- At  as  0X01  ).  The  othet(MH)  is  the  same  as 
the  above  except  using  morimsm  helidly  fordng.  The  charac¬ 
teristic  parameters  of  these  fields  are  given  ia  the  Table  I.  Note 
that  we  treat  the  relatively  low  Reynolds  number  (Rx  =  26.5 
and  33.4)  tnrbnlence  owing  to  the  limitation  of  the  resolution. 

TABLE  I  The  characteristic  parameters  of  the  tnrbnlent  fields. 
E:  total  energy,  0:  total  enstrophy.  Hi  total  helidly,  R\  — 
y/lO/ZEf(vy/(i):  Taylor’s  micro-scale  Reynolds  namber,  r  = 
(yr/ l"*£(t)dh)  fy/iH/i ;  tninovn  time. 

TABLE  I 


Field 

E 

n 

// 

Rx 

(*)  RH 

1.29 

35X 

0.79 

265 

(b)  MH 

1.43 

27A 

8.47 

33.4 

*  "This  wrl  was  performed  with  partial  icpport  of  V  S  Depart  meat  of 
tettu  Grant  No  DE-FG02&ER13&7  AQ  compn  tat  torn  were  performed 
at  Lawrence  Livermore  National  Laboratory 


We  give  lie  ddbilm  of  three  diJerext  methods  fee  exeryy- 
ccxseirixj  tiusitmutm,  which  we  call  robfibg  (£*(£)  = 

mm 

(1)  HR  txtiici  KeBaty  jprctrta  is  pm  by  ff(k)  =  i(k)- 

C<-i)  =  <**)  *  the  aagje 

brtweea  these  two  vectors  (the  heEdty  associated  phase).  We 
ooly  charge  tie  odextadoa  of  the  teal  part,  f(Eo'(i))  a  s  txi- 
finrmly  random  way  ia  a3  dt:ecUozs’fi-2:).  paJ'jfc)  = 
«d(Erf'(f)|  =  |Eri(t)H 

(2)  HC  meticL  V*e  preserve  the  angle,  d{k)  between  real  and 
imxpxxsy  part  of  t(t),  bet  rotate  thb  pair  in  a  rxifbrslj  raa- 
dom  way,  thrs  preserving  both  energy  E(k)  aad  heEdty  Zf(fc). 
CKe^(w)I  =  IRef(h)|,  |lm?(*)|  =  |Iat(t)]  and  #(jt)  =  #£).] 

(3) 54  artW.  We  apply  the  foOoiring  transformation: 

g ri'(t)  =  (\Zo£(t)/jEn(t)aRri;t)  jid  fai f(k) 

—  (\/(l  —  o)£li)/jfa£Tt)!)Im-(i) ,  wt«.  a  t*  .  .aiiornlj  ri>- 
dou  «Bb «  l*t*m  Oudl.  =  f(Bn(t)).f(IaS'(i)) 

=  f<fer(t))«d£'{f)  =  £(t).] 

We  apply  each  xeshsSixg  separately  and  calculate  both  energy 
and  helidty  transfer  aad  compare  the  results  on  the  undisturbed 
case  and  the  filtered  spectrum  where  some  modes  ia  a  range  are 
set  to  zero.  We  report  the  resells  oa  two  ranges.  IS  <  k  <  32 
and  6  <  k  <  Jl.  to  two  fields:  (a)  random  helidly  field  (RH) 
and  (b)  maxinen  hdidty  field  (MH). 

III.  RESULTS 

A.  Turbulent  fields 

Ia  Fig.  ]  both  energy  and  hefcdlj  specra  lor  (b)  MH  b  shown. 
The  dolled  lines  shows  ac(H(k))  —  {ljy/z)E(kj  which  is  the 
standard  deviation  of  the  heEdty  spectrum  ia  the  qnasi-Gacssiaa 
approximation^, 4]  while  thesoEd  line  shows  the  roaximam  he¬ 
lidly  2t£(L).  Positive  valses  of  the  helidty  are  denoted  by  pins 
signs,  negative  by  diamonds.  There  b  no  dear  inertial  range 
owing  to  the  relatively  low  Remolds  number.  (See  TABLE  1.1 
In  case  (a)  RH  almost  all  H(k)  lie  within  qoasi-Gacssian,  whose 
figure  b  omitted,  while  dear  deviation  b  observed  in  case  (b) 
MH. 


FIG.  1.  Helidty  cospectrum  and  energy  spectrum  in  log-log 
sca’e  for  case  (b)  MH. 

We  recall  both  balance  equations  for  the  energy  and  helidly 
spectra 

^P-  =  TB(l:)-1vk’E(t)  (I) 

and 

^=r,(i)-2^;/(i),  (2) 
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wi*rr  we  osaS  Ik  term  due  to  tie  forasg  (3  <  i:  <  4).  Tie 
Uuiicr  hno  ud  vc  ddbtd  tt  tki^rtn^d 
oC 

Trff)  =  Mi(f)-i-(‘)!  (j) 

ud 

3V(i)=2t-fa[r(f)xi(t)j.  o) 

»kre  A(i)  a  a  Fourier  tUMfcfo  of  tie  Lteb  rector,  A(£)  = 
(5x  J)(z)  *Se  asterisk  dexotes  tie  coaplec  cosjtgile.  Xi* 

that  tie  reshuffling*  keep  £(f)  let  destroy  Tivf)  axd  X*  (4)  axd 
tie  integrab  of  Te(k)  aad  Ta[t)  met  all  modes  are  alio  lero 
after  traxsforaadoa.  Eq.  (4)  fcfls  tkat  7}/ (4)  b  also  defined  as 
tie  projection  of  1(4)  in  tkb  plane  as  wtfl  as  T/(4).  Hence  tie 
cortteir  redact  jo  a  medanbaa  observed  numerically  by  SktO- 
man  aad  PolifteJS),  tke  leadeacy  of  ike  alignment  of  A (4)  aad 
k,  is  also  essential  to  tke  keh'citj  transfer. 

We  tosch  os  tie  fora  of  tke  trass  for  spectra  ia  tke  dtsspia- 
rion  range.  Is  tke  steady  state  we  obtain  tke  following  bequaJ- 
ity: 

Pjf(k)l<2  kTe(k),  (5) 

wiere  tke  equality  kolds  for  maximum  kebdty  modes  irrespec¬ 
tive  of  tie  steady  condition.  Note  tkal  tke  inequality  does  sol 
bold  get r rally  Is  oir  fields  eq.  (S)  b  satisfied  with  botk  cases 
(j7if(fc)l/(2fc7i(fi))  b  lie  order  of  .01  for  (a)  and  0.1  for  (b)) 
although  tke  steady  condition  b  sot  satisfied. 

B.  Energy  and  helidty  transfer  spectra 

All  cases  ia  Hg.  2  wkere  we  treat  tke  eaergy  traasfor  for  case 
(a)  RH  skow  tkat  the  HC  method  b  tke  most  effective  cf  three 
aad  tie  carve  S  b  very  dose  to  tie  curve  N  aad  tkat  tke  HC 
method  b  almost  tke  same  as  tke  filtered  case  (carve  F).  In  Fig.  2 
(15  <  4  <  32)  tke  reshuffled  casee  (tke  curves  R.C  and  S)  aad 
tie  filtered  case  (tie  carve  7)  have  a  peak  (lie  eaergy  barrier)  at 
tie  boaidary  between  tie  reshuffled  region  aad  the  unresbuffled 
cse.  Tie  reshuffled  regioa  (4  >  15)  of  the  HC  is  destroyed 
completely  as  tkat  of  ike  filler*  c*'.-.  Note  that  the  definition 
of  A(i)  depends  on  the  correlations  between  all  scales.  Oc  the 
other  hand,  fire  carves  are  closer  to  each  other  ia  the  larger 
scales  (4  <  10).  The  small  scale  reshuffling  does  not  affect  the 
larger  scales  noticeably.  In  the  case  (6  <  k  <  11),  whose  figure 
b  omitted,  in  tie  small  scales  next  to  the  reshuffled  region  of  the 
curves  C  aad  F  we  observe  the  hollows  in  contrast  tke  curves  N 
and  S.  The  HC  works  as  if  it  cat  the  largei  scales,  0  <  k  <  5. 
We  sttess  that  the  results  for  case  (b)  iin  is  almost  the  same 
as  the  above;  the  helidty  does  no:  affect  basic  mechanism  of  the 
correlations  which  we  treat  here. 


FIG.  2.  Energy  transfer  spectra  in  lia  lia  scale  for  (a)  RH.  The 
symbol  K  indicates  the  nondistorbed  case;  R t  HR,  C;  HC,S:$A 
and  F.  Filtered  case.  These  notations  are  used  throughout  ihu 
piper.  N:  Solid  line,  R:  Dotted  line,  C:  Dashed  line,  S;  Chain- 
dotted  line  and  F;  Solid  line.  15  <  k  <  32, 


Fig.  3  shows  tke  helidty  transfer  spectra  for  (a)  RH  (IS  < 
k  <  32).  We  cannot  see  a  dear  peak  at  ike  boundary  between 
the  reshuffled  region  and  tke  cnzesicSed  one.  In  the  reshuffled 
regioa  (4  >  15)  ve  cannot  find  a  dear  tendency  about  five  curves 
as  is  tke  energy  Iran- for.  five  curves  are  doser  to  each  other 
in  tke  larger  scales  (/  <  10).  Tke  curves  F  and  C  are  not  so 
dcse  as  those  ia  the  energy  transfer  spectra.  Tke  small  scale 
reshuffling  does  net  affect  the  larger  scales  noticeably. 


FIG.  3.  Helidty  transfer  spectra  in  lia-lin  scale  for  (a)  RH.  for 
15  <  4  <  32. 

Finally  we  consider  the  helidty  transfer  spectra  for  (b)  MH 
whose  figure  b  also  omitted.  We  point  out  one  thing,  that  b, 
that  the  tendency  b  very  similar  to  the  energy  transfer  spectra 
although  tke  maximum  helidty  condition:  T//(4)  =  2kTe(k)  b 
not  satbfied. 

IV.  CONCLUDING  REMARKS 

We  have  compared  tke  reshuffled  fields,  HR.HC  aad  $A  from 
tke  undisturbed  one  N  aad  tke  filtered  one  F  about  both  energy 
and  helidty  transfer  spectra.  It  b  found  tkat  in  all  cases  the  HC 
method  b  the  most  effective  ia  destroying  the  ordering  of  the 
transfer  of  three  and  it  works  as  if  it  were  mode-filtering  except 
Ike  helidty  transfer  in  random  hdidty  forcing  case.  By  keeping 
the  correlation  betneen  the  imaginary  parts  of  all  modes  the 
energy  b  transferred  in  the  HR.  The  correlation  between  modes 
b  more  important  than  that  within  each  mode  (i*.  the  helidty 
assodated  phase)  The  SA  method  gives  the  results  similar  to 
tke  undisturbed  cases  Tkb  suggests  (hat  ike  amplitude  cor¬ 
relation  between  tke  real  patt  and  tke  imaginary  one  may  not 
be  $0  essential  to  the  nonl.near  transfer  as  (be  phase  relations. 
When  we  applied  all  three  methods  to  the  range,  15  <  4  <  32 
simultaneously,  wc  obtained  almost  tke  same  result  as  the  case 
HC.  Hence  keeping  the  helidty  spectrum  has  no  spccnl  mean¬ 
ing.  The  helidty  transfer  in  maximum  helidty  forring  case  has 
characters  similar  to  the  energy  transfer. 

REFERENCES 

[1]  E<  Levich,  Y.  Murakami  and  L.  Shtilnun,  submitted  to 
Phys.  Lett.  A. 

[2]  Y.  Murakami,  L.  Shtilman  and  E.  Levich,  submitted  to  Phys.  Flu¬ 
ids  A. 

[3]  W.  Polifke  and  L.  Shtilman,  Phys.  Fluids,  A1  (1939)  2025. 

||J  W  Polifke,  Thesis,  City  College  of  CUNY,  19 DO. 

(5j  L  Shtilman  and  W.  Polilkc,  Phys  Fluids  Al,  (1939)  778. 


1995 


INFRARED  AND  ULTRAVIOLET  BEHAVIOR  OF  TURBULENCE 
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Abstract  la  lie  pitted  paper  we  discus  two  ways  of  renormalization 
groip  zaetlod  ia  turbulence:  oae  by  eHmixaring  shorter  wavelength* 
asd  tie  oiler  by  eEminatiag  longer  wavelengths.  Tie  former  proce- 
d*rt  gives  the  eddy  tiscosity  and  the  additional  conelatioa  of  external 
Ibices  as  a  fanclioa  of  tie  cutoff  wavenumber,  predicting  a  reasonable 
value  of  Kolmogorov  coastast.  Tie  latter  way  helps  us  baOd  a  model, 
wlicl  may  interpret  tie  iatemitteat  effects  La  f«Uy  developed  turbu¬ 
lence. 

I.  INTRODUCTION 

Since  a  renormalization  group  (RN’G)  method  was  successfnlly  applied 
to  critical  phenomena  by  Wflsoa  (1],  toibaleace  las  beea  oae  of  the 
most  attractive  objects  for  the  application  of  the  method.  The  diffi- 
tally  in  tie  application  to  tnrbaleace,  however,  seems  to  stem  from 
the  fact  that  the  fluctuation  soarce  is  located  ia  low  waveaamber  re¬ 
gion,  while  the  sink  is  ia  high  waveaamber  region.  Under  this  dream- 
stance  the  simple  elimination  of  high  wavenumber  components  cannot 
renormalize  lie  fluctuation  soiree  term  appropriately  and  the  elimina¬ 
tion  of  low  wavenumbers,  on  the  other  land,  fails  to  express  the  silk 
in  a  renormalized  way.  Hence,  the  simultaneous  elimination  of  low 
and  high  wavelengths  seems  to  be  needed;  every  eddy  receives  energy 
mainly  from  larger  eddies  and  transfer  it  to  smaller  eddies  at  the  same 
time,  which  must  be  an  essential  cascade  unit.  Such  a  anil  might  not 
be  established  by  the  RN’G  method  carried  ont  in  oae  direction. 

In  SecII  we  ask  the  infrared  behavior  of  tarbnlence  by  eliminating 
shorter  wavelengths,  in  which  external  driving  forces  are  presumed. 
The  estimated  value  of  Kolmogorov  constant  and  the  coefficient  of  the 
eddy  viscosity  axe  in  reasonable  agreement  with  the  current  values.  In 
OCC.H1  the  ultraviolet  behavior  is  studied  by  eliminating  longer  wave¬ 
lengths.  In  the  procedure  the  utermittency  effects  are  examined  in 
the  framewoik  of  RNG. 

II.  INFRARED  BEHAVIOR  OF  TURBULENCE 

Ia  tarbnlence  the  most  contributions  come  from  near  eddies  11  wave¬ 
number  space.  If  we  eliminate  eddies  with  q  >  A,  therefore,  the  eddy 
viscosity  must  be  dependent  on  the  cotoff  A.  MS  (2J  and  ZVH  (3) 
obtained  such  a  cotoff  dependent  viscosity  without  relying  on  the  dis¬ 
tant  interaction  approximation  in  contrast  to  Yakhol  and  Orszag  (4), 
However,  the  additional  stochastic  forces  cannot  be  forgotten  in  such 
a  treatment.  In  the  present  paper,  we  numerically  calculate  the  eddy 
viscosity  v(L,A)  and  correlation  D(k,A)  of  external  forces  as  a  funo 
tiou  of  A.  To  this  end,  wc  first  assume  that  the  bare  external  driving 
forces  described  by  the  correlation 

<  /.(k  ,w)/,(kV)  >=  2Dok'*(2xy^,J-ktkJfki)5{k+l')S(wWl 

as  done  by  YO  (4).  Eliminate  the  components  with  A/s  <  q  <  A  and 
rescale  the  velocity  components  to  compare  the  original  form.  Several 


1.6  -1.2  -0.8  -0.4  0 

tog  t 


Fig.J.  The  eddy  v»cos;iy  scaling  function  /  for  various  values  of  s  as 
a  function  of  1  =  kjA. 


values  arc  chosen  for  s,  ranging  from  1.$  to  2  2  The  obtained  resells 
slightly  depend  on  the  value.  The  recursion  equations  for  the  eddy 
viscosity  and  the  correlation  of  forces  arc  derived. 


A  Eddy  Viscosity  and  Correlation  of  External  Forces 


express  the  viscosity  and  correlation  in  terms  of  scaling  functions 
as  v\JttA)  =  h*-am(k/A>  and  D(fc,A)  =  i-,n(fc/A).  Substitution  of 
these  expressions  into  thv  recursion  equations,  combined  with  the  en 
ergy  consideration,  yields  y  —  d  and  z  ~  2/3  in  accordance  with  the 
Kolmogorov  scaling.  Then,  the  viscosity  and  correlation  take  the  forms 
v(k,  A)  =  A-«'3/(*/A)  and  D(k,  A)  =  D0tw(l  +  ( Wy(*/A)).  For 
the  numerical  calculation  Do  =  has  been  employed.  The  scaling 
function  /( t )  with  I  =  Jfc/A  is  depicted  for  various  s  in  Fig.l,  where 
a  cusp-like  structure  is  sees  near  the  cutoff  for  large  s,  in  agreement 
with  the  observation  by  Kraichnan  (5].  This  is  reasonable  because  he 
sets  s  equal  to  infinity,  in  principle,  in  the  test-field  model  calculation 
For  small  s  such  as  1.5  a  cusp  is  not  found,  in  accordance  with  MS  (2) 
employing  s  =  1/0.7  =14  ZVH[3]  obtained  the  small  cusp  for  the 
same  value  of  s  by  introducing  the  triple  nonlinear  terms.  Unfortu¬ 
nately,  however,  their  triple  terms  are  not  Galilean-invariant  In  Fig  2 
«e  depicted  /(f)  and  y(f)  only  for  J  =  2.  We  notice  that  the  additional 
correlation  of  forces  contributes  to  the  original  one  by  50%  near  the 
cutoff,  implying  that  quantitative  collections  are  significant  as  com¬ 
pared  with  the  case  ia  the  absence  of  the  additional  force..  Another 
important  point  is  that  g  acts  as  the  backscatler  from  me  subgrid  scale 
in  LES,  reducing  a  true  eddy  viscosity  considerably  near  the  cutoff. 
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Fig-3.  Kolmogorov  couiui  c*  and  tie  coefficient  of  the  eddy  tmcos- 
ity  Cy  vs  s. 

B.  Kolmogorov  Constant 

Pot  E(k)  =  CKiW\rW  and  v{k)  =  Cv71,3k-*li.  The  above  nu- 
meiical  molts,  combined  with  the  requirement  Cy/C \  —  0.1904  by 
the  energy  conservation,  estimate  Ck  and  Cv  «  s,  as  shown  in  Fig X 
Both  numbers  are  quite  reasonable  although  they  w-cally  depend  on 
j.  The  details  in  this  section  will  be  reported  elsewhere  [6] 

Ill.  ULTRAVIOLET  BEHAVIOR  OF  TURBULENCE 

The  purpose  in  this  section  is  to  investigate  how  the  intermit tencj  ef¬ 
fects  are  appreciated  in  the  framework  of  RNG.  Since  those  effects  are 
enhanced  with  decreasing  scale,  the  scccessive  elimination  of  longer 
wavelengths  must  reveal  the  existence  of  the  intermittent  effects.  In 
order  to  proceed  in  that  direction,  we  had  better  recall  the  experimen¬ 
tal  situation,  in  which  pth  order  moments  of  velocity  t(r)  on  scale  r 
are  observed  to  be  <  t(r)>  >~  r^i(L/r)T'  ~  r*'.  (,  =  p/3  -  ji>  is  not 
simply  proportional  to  p.  For  small  p  (,  is  almost  p/3,  but  for  large 
p  it  is  far  smaller  than  p/3.  Thb  fact  means  that  a  single  scaling  ex¬ 
ponent  for  the  velocity  field  cannot  interpret  the  exponents  associated 
with  structure  functions  of  all  orders-  It  is  hardly  believed,  therefore, 
that  any  simple  modification  of  the  exponent  y  in  the  correlation  of 
external  forces  will  make  satisfactory  prediction  on  the  internnttency 
effects-  We  need  two  scaling  factors  at  least.  A  two-scalmg  model  was 
proposed  previously  (7),  in  which  (,  reads 

(  =  Jp/3,  ifp<*S3*/(J*-2); 

\(l-;)p  +  z,  if  p  >  p<, 

where  z  is  a  fitting  parameter.  If  z  =  0  84  is  chosen  (pe  s=  4  85),  the 
predicted  values  agree  with  the  experimental  data  [8)  quite  well.  Of 
interest  is  (hat  the  lower  order  moments  obey  the  Kolmogorov  scaling 
characterized  by  the  exponent  1/3,  while  the  higher  order  ones  do  the 
intermittent  scaling  characterized  by  the  exponent  1  -  z. 

The  essentia]  idea  behind  the  RNG  of  elimination  of  longer  wave¬ 
lengths  is  as  follows.  To  begin  with,  we  have  to  presume  a  fictitious 
eddy  viscosity,  which  drains  energy  from  the  system;  molecular  vis¬ 
cosity  does  not  suffices.  This  fictitious  viscosity  is  a  counterpart  of 
the  presumed  external  forces  in  the  process  of  elimination  of  shorter 
wavelengths.  Consider  the  behavior  of  an  eddy  with  wavenumber  k, 
which  interacts  with  eddies  with  q  <  A  and  with  p  ~  |k  -  qj.  The 
contributions  from  the  interaction  are  different,  depending  on  whether 
p  is  smaller  than  A  or  not.  (1)  If  p  <  A,  the  interaction  terras  act  on 


the  eddy  k  as  stochastic  forces.  Thu  case  is  possible  only  for  k  <  2A. 
The  correlation  of  these  forces  is  described  by  a  type  of  the  correlation 
assumed  in  Sec.II,  although  it  is  limited  to  the  range  k  <  2A.  The 
obtained  correlation,  combined  with  the  presumed  viscosity,  derives 
the  Kolmogorov  scaling.  (2)  If  p  >  A;  the  velocity  components  p  must 
not  be  eliminated.  The  condition  b  realized  for  any  value  of  k.  The 
coEliibnlioas  in  thb  case  are  conveniently  decomposed  into  two  parts: 

(а)  the  convection  and  (b)  the  distortion  of  the  eddy  b  by  larger  eddies 
q.  The  convection  velocity,  which  b  predominantly  determined  by  the 
largest  eddies,  carries  all  eddies  together  with  no  change  in  geometrical 
configuration;  its  effect  b  readily  removed  in  the  frame  moving  at  the 
convection  velocity.  The  dbtortion  b,  on  the  other  hand,  provided  by 
the  velocity  gradient  of  larger  eddies  (mostly  near  eddies);  the  eddy 
is  stretched  in  a  certain  direction  (or  directions)  and  shrinks  it  ta  an¬ 
other  direction  (or  directions).  The  term  completely  different  from  the 
stochastic  forces  b  expected. 

The  dutortional  contribution  was  investigated  in  the  Fourier  anali- 
zed  Xavier-  Stokes  equation  in  ReL(9],  where  we  have  shown  that  the 
strain  rales  due  to  eddies  of  a  certain  scale  stretch  and  shrink  the  vor- 
tidty  field  of  smaller  scales,  whose  strain  rates  excite  the  vortidty  of 
even  smaller  scales.  If  a  chain  of  the  exdtation  occurs  randomly,  the 
process  may  be  treated  as  stochastic  forces.  If  the  exdtation  takes 
place  systematically,  however,  a  land  of  coherent  self-similar  structure 
will  be  built  ia  wavenumber  space.  A  pair  of  oppositely-oriented  vor¬ 
tex  sheets  are  a  candidate  for  such  a  self-similar  structure;  the  two 
sheets  approach  each  other  and  the  vortiatj  b  enhanced  during  the 
pxocos.  The  time  evolution  of  the  vortidty,  then,  can  be  related  to 
the  dynamical  scaling  form  proposed  in  ReL[7].  Hence,  the  derived 
iortical  structure  is  expected  to  be  responsible  fox  the  intermittent 
eff<x».  This  picture  will  be  developed  ia  the  framework  of  RNG 
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ABSTRACT  A  Finite  Element  model  for  predicting  the  motion  of 
axially-symmetnc  loudspeaker  transducers  using  thin  shell  of 
revolution  elements  has  been  developed.  The  displacement  of  the 
diaphragm  is  calculated  for  a  constant  force,  and  the  post  processing 
required  to  correct  for  the  actual  force  is  described.  The  effects  of  the 
electrical  impedance  of  the  voice  coil  and  the  mechanical  impedance 
due  to  the  motion  of  the  coce  2re  Included. 

1.  INTRODUCTION 

Although  the  ultimate  evaluation  of  a  loudspeaker  depends  on 
measuring  the  sound  radiated  by  it,  this  radiated  sound  is  not  simply 
related  t*o  the  vibration  of  the  diaphragm  caused  by  the  signal  fed  to  it 
In  order  to  apply  the  techniques  of  computer  aided  engineering  to 
ioudspeakar  design,  it  therefore  seems  necessary  to  break  the 
modelling  process  into  stages. 

The  work  presented  here  shows  how  a  Finite  Element  (FE)  model  can 
be  used  to  predict  the  vibration  of  a  drive  unit  when  it  is  driven  by  a 
constam  force.  Vvc  then  describe  a  method  of  correcting  for  the  real 
force  provided  by  an  amplifier.  This  approach  allows  the  independent 
modelling  of  the  electromagnetic  parts  of  transduction.  We  present 
results  for  a  model  of  a  low  frequency  conical  drive  unit  which  can  be 
compared  with  measured  properties.  Having  gained  confidence  in  the 
FE  model,  it  is  then  possible  to  go  on  to  predict  the  radiated  sound 
pressure  using  methods  such  as  the  Boundary  Element  technique. 

2.  THE  FINITE  ELEMENT  MODEL 

The  fust  stage  in  the  calculation  of  the  sound  pressure  produced  by  an 
arbitrary  loudspeaker  drive  unit  is  to  calculate  the  displacement  of  the 
diaphragm  when  it  is  driven  by  an  arbitrary  force.  The  motion  of  the 
diaphragm  is  governed  by  a  partial  differential  equation  and  is 
therefore  ideally  suited  to  solution  by  the  FE  method. 

Previous  work  (sec  for  example  Jones  er  al  [1]  and  references 
contained  therein)  has  assumed  that  the  drive  unit  is  axially 
symmetric.  This  assumption  is  reasonable  both  because  the  use  of 
homogeneous,  isotropic  materials  results  in  symmetric  vibrations  of 
the  diaphragm,  and  because  asymmetric  modes  are  only  weakly 
excited  by  the  predominantly  axially-directed  force.  The  present 
computer  code  allows  for  the  inclusion  of  asymmetries,  thus  making 
the  model  truly  three-dimensional,  but  only  at  the  cost  of  large  losses 
tn  computational  efficiency.  In  the  following  we  shall  restrict 
ourselves  to  describing  the  method  used  to  model  a  conical  drive  uim 
ior  low  frequency  sound  reproduction.  The  method  is  however 
applicable  to  other  axially-symmetnc  structures  and  has  been  used 
successfully  in  the  design  of  dome  tweet cts. 


Figure  1  Schematic  picture  of  a  bass  drive  unit  (left  diagram)  and 
the  components  of  the  idealisation  used  to  model  it 


A  typical  dme  unit  consists  of  the  components  shown  ir.  figure  1. 
The  FE  method  requires  values  of  Young’s  modulus,  density, 
damping  and  Poisson’s  ratio  for  all  the  materials  used  in  the  structure 
in  order  to  predict  its  displacement.  A  database  has  been  set  up  which 
contains  models  of  components  already  in  construction  and  improved 
versions  of  these  components.  The  accessing  program  uses  a  WIMP 
environment  with  one  window  leading  on  to  another  or  to  a  specific 
I/O  screen.  This  database  easily  permits  alterations  to  components  of 
the  drive  unit  and  the  testing  of  novel  design  features. 

The  vibration  of  the  structure  is  governed  by  the  linear  elasticity 
equation.  Damping  can  be  included  in  this  equauon  by  the  use  of  a 
complex  Young’s  modulus,  (l+jqJE,  in  the  stiffness  matrix  The 
material  loss  factor  q  has  been  measured  at  3&W  for  paricular 
materials  (11.  It  has  been  found  that  the  drive  unit  can  be  discretised 
economically  by  the  use  of  a  three  noded  thin  shell  of  revolution 
element  as  described  in  reference  ( 1  J.  This  particular  element  allows 
the  displacement  normal  to  the  shell  to  vary  as  a  fifth  order 
polynomial  instead  of  the  more  usual  cubic.  The  element  thus  allows 
for  internal  bending  to  some  extent 

If  the  force  depends  sinusoidally  on  time  (i.c  F  =  f  eJ®*)  then  the 
displacements  are  similarly  sinusoidal.  When  the  tructure  is 
discretised  the  displacement  of  the  system  is  defined  by  the  solution 
of  the  set  of  linear  equations 

(-Mk>2  +  K)  u  =  f  (1) 

where  M  and  K  are  the  global  mass  and  stiffness  matrices  of  the 
system,  u  is  the  complex  vector  of  nodal  displacements  and  f  is  the 
vector  of  applied  forces. 

A  finite  element  program  has  been  developed  to  form  and  solve  the 
set  of  equations  1.  The  equations  are  solved  using  an  efficient 
Gaussian  elimination  alg/nthm.  An  additional  option  is  the  use  of 
modal  analysis,  giving  information  on  the  manner  in  which  the  cone 
breaks  up  at  greatly  reduced  computational  cos* 

3.  POST-PROCESSING 

The  Finite  Element  calculations  described  above  eventually  give  the 
displacement  vector  of  the  cone  for  the  constant  force  applied  at  the 
voice  coil.  The  displacements  calculated  using  this  idealised  force 
must  be  corrected  for  the  effect  of  the  real,  frequency  dependent 
force.  In  reality  the  current  flowing  in  the  coil  is  dependent  both  on 
the  electrical  circuit  of  which  the  coil  forms  a  part  and  the  equivalent 
electrical  impedance  of  the  mechanical  parts,  such  as  the  cone  itself 
The  force  on  the  coil  depends  linearly  on  the  current  Since  the 
amplitude  of  the  cone  (and  therefore  its  velocity)  varies  with 
frequency,  the  effect  of  the  motion  of  the  diaphragm  must  be  included 
in  the  post-processing.  The  way  in  which  this  should  be  done  was 
indicated  by  Shepherd  and  Alfredson  (2J. 

Since  the  elasticity  equation,  eq.  1,  assumes  that  the  displacement  of 
the  diaphragm  for  a  given  frequency  depends  linearly  on  the  force, 
the  correction  to  the  displacement  is  given  by 

UccoC®)  =  UCJ;(0)) .  (2) 

Shepherd  and  Alfredson  define  a  common  point,  called  the  driving 
point,  between  the  electrical  and  the  mechanical  parts  of  the 
transducer.  This  is  the  point  at  which  the  voice  coil  former  joins  the 
cone.  Our  finite  clement  results  calculate  the  frequency  dependent 
velocity  at  this  point,  vd(©)  (*s  jwuo),  for  a  given  constant  force  F. 
The  total  lumped  mechanical  impedance  of  the  system  at  the  driving 
point  is  defined  by 
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By  calculating  the  current  flowing  in  the  voice  coil  due  to  the 
(constant)  emf  E  supplied  by  the  amplifier,  we  find  by  simple  algebra 
that  the  force?  is  given  by 

where  BS  is  the  magnetic  coupling  constant  of  the  coil  and  Z£(co)  is 
the  impedance  of  the  electrical  circuit  which  can  be  modelled 
theoretically  or  empirically  13). 

The  correction  to  the  calculated  displacements,  F(co)/F,  adjusts  both 
the  amplitude  and  the  phase  of  the  results.  The  electrical  circuit  means 
that  at  high  frequency  the  amp  brad e  at  a  given  point  decreases  linearly 
with  0).  The  mechanical  parts  damp  the  velocity  when  it  is  large, 
particularly  at  cone  resonances. 

4.  COMPARISON  OF  RESULTS 

The  drive  unit  analysed  here  had  a  13  cm  diameter  cone  made  of 
cobcx  and  a  1.3  cm  surro.md  made  of  PVC.  In  the  interests  of  clarity, 
results  are  shown  for  a  unir  a  dust  cap. 

The  cone,  surround  and  voice  coil  were  discrctiscd  into  25  finite 
elements  of  the  type  described  in  section  Z  The  FE  program  was  ran 
and  solutions  obtained  for  the  complex  displacement  of  the  diaphragm 
for  a  constant  driving  force  applied  at  the  voice  coil  along  the  axis  of 
symmetry.  The  other  boundary  condition  was  that  the  surround  was 
stationary  at  its  edge.  The  FE  results  were  then  post-processed  using 
the  theory  described  in  section  3,  in  order  to  correct  for  the  real 
driving  force  supplied  by  the  electrical  circuit  of  the  voice  coil  and  the 
equivalent  electrical  circuit  due  to  the  finite  impedance  presented  by 
the  cone. 

Figure  2  shows  the  FE  results  after  post-processing.  It  can  be  seen 
that  at  low  frequency  the  cone  exhibits  piston  like  behaviour,  with  a 
maximum  in  the  velocity  occurring  towards  the  outer  edge  of  the 
cone.  The  flat  portion  at  the  centre  cf  the  diagram  corresponds  to  the 
zero  velocity  of  the  magnet  pole  piece.  As  the  frequency  increases  the 
position  of  maximum  velocity  amplitude  moves  towards  the  centre, 
and  beyond  this  point  the  cone  starts  to  move  out  of  phase  with 
respect  to  its  centre.  We  see  a  number  of  resonance  peaks  in  the 
velocity  of  the  whole  cone,  for  example  at  frequencies  of  1080  Hz, 
2040  Hz,  2650  Hz,  3440  Hz  and  4320  Hz.  At  about  1 100  Hz  the 
first  breakup  mode  of  the  cone  is  seen,  and  above  this  frequency  the 
number  of  separate  parts  of  the  cone  vibrating  out  of  phase  with  the 
voice  coil  increases. 

The  main  differences  between  the  prc-  and  post-proccsred  data  are 
found  to  be  the  magnitude  of  the  maxima  tn  the  velocity,  and  the 
velocity  of  the  voice  coil  at  high  frequency. 

Velocity  amplitude 
(arbitrary  units) 


Meridional  Coordinate  0 
(x  10-1  cm) 


Frequency  (kHz) 


velocity  occurring  at  the  same  frequencies  to  within  10%.  The 
measurements,  show  a  moderate  amount  of  asymmetry  in  the 
behaviour  of  the  drive  unit  which  is  not  allowed  for  in  the  calculated 
rcsults.The  other  main  discrepancy  is  the  velocity  at  the  voice  coil  at 
high  frequencies.  This  appears  to  be  negligible  in  the  measurements, 
but  is  seen  to  have  a  finite  component  from  the  calculations. 
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Figure  2  Finite  Element  prediction  of  the  velocity  of  a  drive  unit 
along  a  diameter  (meridian)  of  the  diaphragm. 

These  results  should  be  compared  with  laser  Doppler  interferometry 
measurements  made  at  B&W  Loudspeakers  (4),  shown  in  figure  3. 
The  agreement  between  these  sets  of  results  appears  to  be  good,  the 
qualitative  behaviour  being  similar  and  the  maxima  and  minima  in  the 


Figure  3  Measured  velocity  of  a  corresponding  dnve  unit  using 
laser  Doppler  interferometry. 


5.  CONCLUSION 

We  have  described  a  method  of  predicung  the  vibration  of  axially- 
symmetric  loudspeaker  transducers  using  the  FE  method.  The 
predictions  have  been  compared  with  the  velocities  of  existing  dnve 
units  measured  by  a  laser  interferometer,  and  the  model  found  to  give 
substantially  accurate  results  The  importance  of  this  verification  of 
the  FE  method  is  that  if  there  arc  discrepancies  between  measured  and 
predicted  radiated  sound  pressures,  then  the  errors  must  have  been 
introduced  by  the  method  used  to  calculate  the  pressure. 

Improvements  to  the  calculation  of  the  mechanical  vibrauon  of  the 
diaphragm  should  take  account  of  the  interaction  of  the  structure  with 
the  surrounding  air.  The  construction  of  fully  coupled  structural- 
acoustic  models  is  possible  but  at  present  they  represent  a  significant 
increase  in  expense  given  the  relatively  small  increase  in  accuracy 
gained.  The  development  of  faster  algorithms  for  this  modelling 
process  is  an  area  of  current  interest. 

Areas  requiring  further  work  for  a  complete  loudspeaker  to  be 
modelled  accurately  include  modelling  the  magnetic  field  which 
governs  the  force  on  the  diaphragm,  modelling  the  interaction  of  the 
transducer  with  the  cabinet  and  calculating  the  sound  field  created  in  a 
listening  room. 
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Abstract  -  This  work  presents  the  algorithms  that  can  be 
usefully  implemented  in  molecular  dynamics  simulations 
and  reports  ah  analysis  of  the  accuracy  of  such  methods. 


L  INTRODUCTION 


Molecular  dynamic  simulation  methods  are 
currently  used  in  condensed  and  solid  state  physics  to 
evaluate  many  properties  of  liquids  and  crystalline 
targets. 

The  standard  procedure  upon  which  a  molecular 
dynamic  simulation  is  constructed  is  to  consider  a 
crystallite  containing  a  number  N  of  atoms  interacting 
with  realistic  forces.  The  trajectories  of  the  atoms, 
considered  as  point  charges,  are  described  by  classical 
mechanics  and  arc  determined  by  solving  the  second 
order  differential  equation 

r  =  a(r) 
a(r)=  V  V(r) 

where  r(t),  v(t)  and  a(t)  arc  vectors  with  3N  components, 
representing  the  x,  y  and  z  components  of  the  position, 
velocity  and  acceleration  of  each  of  the  N  atoms  in  the 
computational  cell.  V(r)  is  the  potential  describing  the 
interatomic  forces. 

Generally  large  N  values,  in  the  range  1000-10000, 
are  used  either  to  describe  lattice  properties  in  solid  state 
theory  or  for  applications  in  material  processing,  like  ion 
implantation.  Furthermore  the  characteristic  times  of  the 
processes  require  from  10*  up  105  integration  steps.  An 
additional  difficulty  arises  from  the  strong  non-linear 
character  of  the  potential  V(r). 

This  work  revises  the  methods  which  can  be 
advantageously  used  m  molecular  dynamics  simulations 
and  reports  an  analysis  of  the  errors  for  various  time- 
steps  and  cell  dimensions. 


II.  METHODS 


Owing  to  the  duration  of  the  simulations,  multi- 
step  methods  which  make  use  of  tho  evaluation  of  a(r)  at 
previous  steps  are  to  be  preferred  over  those  methods  that 
make  successive  approximations  to  a(r)  or  that  require 
the  calculation  of  the  derivative  of  a(r).  On  the  other  side 
the  size  of  the  vectors  r,  v  and  a  render  difficult  the 
retention  of  information  from  many  previous  steps.  A 
simple  approach  is  the  midpoint  predictor 

ra*i  =  ra.i  +  2 Ava 

and  the  second-order  Moulton  corrector 

va»j  =  vE  +  (aa*i  +  anW2 
rB.i  =  ra  +  (Vq.j  +  vtOh/2 


where  A  represents  the  time  step  A  =  t  ,  -  to.  This  method 
is  referred  in  the  Tables  below  as  P(EC).  This  indicates 
that  a  predictor  is  used  to  calculate  r  and  there  are 
iterations  of  the  sequence  evaluation  of  a,  correction  of  v 
and  r.  Also  a  third-order  Adams-Bashforth  predictor 

rB*x  =  ra  +  (23va  -  16Vd  +  5va^)M2 

and  Adams-Moulton  corrector  can  be  used  in  this 
sequence 

va.i  =  va  +  (5a0»i  +  8aa  -aa-i)A/12 

ra*i  =  ra  +  (5v0.i  +  8va  -  va.i)A/12 

These  methods  will  be  indicated  as  P(EC)AB-AM. 

Alternatively  Runge-Kutta  (RK)  methods  can  be 
applied  which  in  the  lowest  order  take  the  form 

v(t+h)  =  v(t)  +  [2a(t+h)  +  5a(t)  -  a(t-h)]  A/6 

x(t+h)  =x(t)  +  v(t)h  +  (4a(t)-a(t-h)lA2/6 


III.  RESULTS 


A  senes  of  tests  has  been  performed  in  order  to 
compare  the  vanous  methods  described  above.  The 
simulation  cell  represents  a  silicon  lattice  with  N  in  the 
range  1000-5000.  The  interatomic  forces  are  desenbed  by 
a  simple  Morse-type  potential  of  the  form 

V(r)  =  A  lcxp-2a(r*r0)-2exp-a(r-r0)J 

Tho  atomic  coordinates  and  velocities  at  the 
beginning  of  the  simulations  are  adjusted  to  correspond  to 
the  ones  of  a  lattice  at  room  temperature.  As  a  test  for 
numcncal  accuracy  the  energy  conservation  is  used.  In 
Table  I  we  report  the  average  energy  change  per  step 
<dE/step>  ((meV))  taken  on  1000  steps  for  h  =  10*  H  s  In 
Tablo  If  a  time-step  h  =  10*13  s  one  order  of  magnitude 
larger  has  been  used.  For  the  iterative  methods  the 
number  of  iterations  is  equal  to  2,  as  this  value 
represents  a  good  compromise  between  accuracy  and 
computer  times. 


TABLE  I 

h  a 

10'14  s 

N  =  1000 

N  =  5000 

method 

vdE/stcp> 

«iEVstcp> 

P(EC) 

-  10'7 

-  10-7 

P(EC)AM-AB 

-10-5 

-  5x10-5 

RK 

-10-3 

-8x10-3 

2000 


TABLE  II 

h  = 

10'13  s 

N  =  1000 

N  =  5000 

method 

<dE/steo> 

<dEystep> 

P(EC) 

-10-7 

».  ~  5xl0"6 

PCEOAM-AB 

- 10'5 

-  10-3 

RK 

-10-2 

-5xl02 

It  is  seen  that 

i)  the  simple  P(EC)  offers  a  stable  and  highly 
accurate  solution. 

ii)  the  use  of  P(EC)AM-AB  leads  to  errors  two  orders 
of  magnitude  larger 

iii)  the  worse  results  are  obtained  with  RK.  It  has 
been  found  that  the  increase  of  thev  order  of  the  RK 
does  not  lead  to  significative  decrease  of  the  errors. 

In  conclusion  low  order,  iterative  methods  seems  to 
be  preferred  for  applications  in  molecular  dynamics 
simulations. 
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A  FAST  METHOD  FOR  APPROXIMATE  SEISMIC  RAY-TRACING. 
W.F.D.  Theron 

Department  of  Applied  Mathematics 
University  of  Stellenbosch,  South  Africa. 


THE  MATHEMATICAL  MODEL. 


This  paper  describes  a  fast  method  for  approximate  seismic 
ray-tracing  in  a  two-dimensional  field,  based  on  a  generalisation 
of  the  method  described  by  Vidale  (1988)  for  finding  the  travel¬ 
time  field  from  the  finite  difference  approximation  of  the  eikonal 
equation. 

The  method  described  here  is  eminently  suitable  for 
implementation  on  a  local-memory  message-passing  multi¬ 
processor. 

INTRODUCTION. 

The  work  which  is  described  here  is  the  first  stage  of  a 
program  for  seismic  tomography,  in  which  a  large  existing  data¬ 
base  of  natural  seismic  events  will  be  used  to  gain  insight  into  the 
state  of  the  rock  mass  This  software  will  be  incorporated  into 
the  Integrated  Seismic  System  (I]. 

The  physical  properties  of  the  rock  mass  are  specified  by  its 
slowness  field,  which  is  a  scalar  field  w;'h  the  slowness  at  a  point 
defined  as  the  inverse  of  the  speed -with  which -a  P-wave 
propagates  at  that  point.  This  slowness  field  is  discretised,  and 
the  values  at  each  grid  point  are  the  basic  unknowns  in  our 
problem. 

The  tomographic  method  is  described  by  Kissling  (2J,  and  is 
based  on  the  idea  that  the  travel-time  for  a  very  large  number  of 
seismic  rays  is  known,  where  each  ray  travels  along  a  path  from 
the  source  of  the  event,  through  the  slowness  field,  to  the 
receiving' station.  An  initial  slowness  field  is  assumed,  and  an 
approximate  ray-tracing  method  is  used  to  calculate  the  travel¬ 
time  through  this  field  for  each  ray.  The  difference  between  the 
measured  and  calculated  travel-times  is  minimised  by  iteratively 
improving  the  slowness  field.  The  coefficients  of  the  equations 
used  in  this  minimising  procedure  arc,  for  each  ray,  the  partial 
derivatives  of  the  travel-time  with  respect  to  the  slowness  at  each 
point. 

Various  methods  have  been  described  for  carrying  out 
approximate  seismic  ray-tracing.  The  method  we  have 
implemented  makes  use  of  the  travel-time  field,  which  is 
calculated  as  described  by  Vidale  (3J.  At  this  stage  only  the  two- 
dimensional  case  has  been  implemented;  we  are  busy  extending 
to  the  three-dimensional  case  as  described  by  Vidale  [4], 

Two  schemes  for  tracing  the  rays  through  this  travel-time  field 
have  been  implemented. 

Motivation. 

We  envisage  applying  this  tomography  program  using  the  data 
of  say_onc  thousand  events,  measured  at  say  25  receiving  stations, 
requiring  the  tracing  of  twenty-five  thousand  rays  for  each 
iteration  of  the  method  described  above.  The  need  for  a  very  fast 
ray-tracer  is  self  evident ! 


The  method  described  here  is  eminently  suitable  for 
implementation  on  a  local-memory  message-passing  multi¬ 
processor,  as  the  rays  corresponding  to  any  given  event  are 
completely  independent  of  those  for  the  other  events,  so  that  the 
ray-tracing  phase  of  the  program  is  embarassingly  parallel. 

This  is  a  typical  "farm"  type  ol  parallel  program,  in  which  a 
driving  program  controls  the  execution,  and  the  different  worker 
tasks  calculate  the  rays  for  each  event  and  send  the  required 
results  to  the  driver. 

Furthermore,  this  is  a  very  coarse  grained  algorithm,  with  far 
more  calculation  than  communication,  so  that  a  high  efficiency 
can  be  expected. 


Given  a  rectangular  region  in  the  vertical  x-z  plane,  we  define 
the  slowness  at  a  point  as  the  inverse  of  the  speed  of  propagation 
of  a  P-wave  at  the  point :  s(x,z)  =  I/v(x,z). 

The  travel-time  field  for  a  particular  event  is  denoted  by  t(x,z), 
which  is  a  scalar  field  of  values  of  the  time  required  for  the  wave 
to  travel  to  the  point  (x,z). 

Based  on  certain  assumptions  which  will  not  be  dealt  with 
here,  the  relationship  between  the  travel-time  t  and  slowness  s  is 
given  by  the  eikonal  equation  (I) : 

(3,/3x)2  +  =  S2  (1) 

The  travel-time  for  ray  number  r  is  the  lime  required  to  travel 
from  the  source  to  the  station,  and  is  denoted  by  Tr,  where 

Tr=  J  s(x,z)  dr,  (2) 

taking  the  line  integral  along  the  ray  path. 


We  discretize  the  region  under  consideration  with  horizontal 
dimension  H*Nx,  where  Nx  is  the  number  of  elements  along  the 
x-axis,  and  vertical  dimension  fH^Nz,  with  f  a  dimensionless 
ratio  of  the  sides  of  the  rectangular  elements.  This  is  a  slight 
generalisation  of  Vidale's  approach,  which  is  based  on  square 
elements. 

S(ij)  and  T(ij)  denote  the  slowness  and  travel-time 
respectively  at  the  gridpomt  positioned  at  depth  (row)  i  and 
column  j  The  line  integral  (2)  will  be  approximated  by  taking 
the  sum  of  travel-times  over  short  straight  segments  of  the  ray 
path. 

The  final  results  required  of  this  program  are  the  partial 
derivatives  of  the  travel-time  with  respect  to  the  slowness  at  each 
point ; 

wifcr  (3) 


The  first  step  is  to  obtain  values  for  the  travel-time  to  each 
point,  using  Vidale's  method  for  two  dimensions  (3J.  This 
method  can  be  summarised  as  follows : 

Assume  given  values  for  S(i,))  and  the  location  of  the  source  of 
the  event.  The  values  of  T(ij)  for  the  whole  field  are  then 
calculated  by  starting  at  the  source  and  extending  the  solution  of 
T(i  j)  in  rings  by  using  various  finite  difference  approximations  of 
the  eikonal  equation. 

The  detailed  equations  for  the  various  cases  are  given  in 
Appendix  A.  These  reduce  to  the  equations  given  by  Vidale  for 
the  case  I. 


The  next  step  is  to  use  this  travel-time  field  to  trace  the  ray 
from  the  event  to  the  receiving  station,  and  to  calculate  the  partial 
derivatives  of  travel  time  with  respect  to  siowness  at  each  point. 

At  any  particular  point  on  the  ray,  the  gradient  of  the  travel¬ 
time  field  can  be  determined,  again  using  finite  difference 
approximations.  This  gives  a  vector 

grad(t)  =  i  t  =  (  3!/jx  ;  3,/a*)( 
the  negative  of  which  gives  the  direction  from  which  the  ray 
came. 

Starting  at  the  receiving  station,  we  move  backwards  step  by 
step  to  the  point  where  the  event  occurred.  The  travel-time  of  a 
ray  is  found  by  accumulating  aT,,  the  increments  for  each 
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segment  of  the  ray.  The  partial  derivatives  (3)  can  also  be 
accumulated  for  those  points  affected  by  each  segment  of  the  ray. 

In  the  initial  implementation 
two  schemes  have  been 
investigated.  In  the  3-point 
scheme  the  ray  is  forced  to  travel 
either  horizontally,  vertically  or 
diagonally  through  the  comers  of 
a  particular  element,  as  follows : 

Move  to  point  1  if  0  <  tan  0 

<ar 

ATf  =  H«ri(S  0  +  S,). 

^r/as,  =  m,  i=  o,i. 

Move  to  point  2  if  2f  <  tan  0  <  lhr 
Tr  =  f  H  >ri(S0  +  S?). 

^r/OS;  =  «}H,  i=  0,2. 

Move  to  point  3  if  xhX  <  tan  0  <  2f 

Tr  =  H-/1+J-2  «(S„  +  S,  +  S2  +  S3). 

^tfdSi  ■=  KHl/l+f2,  i=  0,1, 2, 3. 

The  more  accurate  5-point  scheme  also  allows  the  ray  to  go 
through  the  midpoints  of  the  sides  of  the  element.  Equations 
similar  to  the  above  can  easily  be  derived. 

RESULTS. 

The  various  algorithms  have  been  implemented  in  a  program 
written  in  3L-FORTRAN  and  executing  on  a  T800  transputer 
with  1  Mbyte  of  external  memory.  We  intend  implementing  them 
in  a  parallel  program. 

Two  results  are  of  interest  at  this  stage,  namely  results  relating 
to  accuracy  of  the  calculated  travel-time,  and  speed  of  execution. 
Both  results  are  dependent  on  the  shape  of  the  region,  the  position 
of  the  source  and  the  positions  of  the  receivers,  and  the  results 
given  below  for  the  specific  example  are  preliminary  results  given 
solely  for  the  purpose  of  getting  an  indication  of  the  order  of 
magnitude. 

As  our  typical  example  we  use  the  case  of  Nx  *>  2  Nz  and  f  « 
0.5,  with  the  source  in  the  bottom  left  comer  and  25  stations 
round  the  edges  opposite  the  source,  for  large  grids  (Nz  >  50), 
The  total  time  to  calculate  the  travel-time  field  for  a  grid  with 
N  points,  N  «  Nx  x  Nz,  plus  25  rays  with  their  associated  partial 
derivatives,  was  found  to  be  of  the  order  of  325  N  /is  using  the  3- 
point  scheme,  and  362  N  /xs  using  the  3-point  scheme. 

Using  a  constant  slowness  value  to  obtain  exact  values  for  the 
travel-time,  the  maximum  errors  found  for  the  calculated  travel¬ 
time  to  the  different  stations  were : 
using  the  3-point  scheme :  2.8  % 
using  the  5-point  scheme :  0.8  %. 

The  reduction  in  calculation  time  for  obtaining  the  travel-time 
field  when  using  the  simplified  equations  for  f  *■  1  was  found  to 
be  1%  of  the  total  lime. 

I:  is  hoped  to  present  detailed  results  of  the  parallel 
implementation  at  the  conference. 

REFERENCES. 

(1]  Mcndccki,  AJ.  The  Integrated  Seismic  System,  Paper  to 
be  submitted. 

[2]  Kissling,  E.  (1988),  Geotomography  with  local  earthquake 
data,  Reviews  of  Geophysics  26  659-698. 

(3]  Vidale,  J.E.  (1988).  Finite-difference  calculation  of  travel 
times.  Bull,  Seism.  Soc .  Am.  78, 2062-2076, 

[4]  Vidale,  J.E.  (1990).  Finite-difference  calculation  of 
traveltimcs  in  three  dimensions.  Geophysics  55,  521-526. 

The  work  described  herein  wa$  timed  out  as  a  contract  between  the  Bureau 
of  Industrial  Mathematics  of  the  University  of  Stellenbosch,  and  Advanced 
Minin*  Software. 


APPENDIX  A  :  FINITE  DIFFERENCE  EXPRESSIONS 
USED  IN  THE  EHCON4L  EQUATION. 

An  abreviated  notation  is  used  here.  We  write  T,  for  the 
travel-time  to  point  i,  i  =  0,1, 2,3,4  and  S  for  the  corresponding 
slowness  values,  with  the  points  defined  in  the  diagrams. 

Three  cases  can  be  identified.  In  all  cases,  only  the  points  in 
the  first  quadrant  are  shown  in  the  diagrams.  Identical  aquations 
are  found  for  the  remaining  three  quadrants. 

LStafltiEauhs,. source* 

To  start  the  solution,  we  take  TQ  =  0  at  the  source,  and  obtain 
the  travel-time  to  a  neighbouring  point  1-on  the  x-axis  by  taking 
the  average  value  of  the  slowness  between  the  two  points : 


T,  =H»A(S0  +  S,). 
Similarly,  for  point  2  In  the  z- 
direction 

T2=fH«6  (S0+S2). 


H 

m 

z 

o  l  * 

2 

2*.CQmapQia&. 

With  the  travel-time  known  at  three  points  (points  1,2,3)  of  a 
rectangular  element,  the  time  at  point  4  can  be  found  by  writing 
the  finite  difference  form  of  (1)  for  the  center  of  the  rectangle 
(point  0),  as  follows : 

Take  the  average  slowness  at  the  centre 

S0«  W(S,f  s2  +  s3  +  s4). 

Define  the  parameters 

a  =  mr2  +  1) 

and 

Use  central  difference  formulas  for 
the  centre  of  the  rectangle,  to  obtain : 


as 


H  2 

0. 

4 

«  (>/„)  (Wcr4  +  Tj>-wcr3  +  T,)] 
»  O/j,,)  (CVTV  +  OrTj)) 


(CVT,)-(VT3)]. 

Using  these  expressions  in  the  eikonal  equation  (1)  we  obtain  a 
quadratic  in  ( T4  -  Tj),  from  which  we  can  solve  for  T4 : 

T4  «  T,  *  6<T3  -  T2)  ♦  Va(2H  SJ*  -  <l-fl2)<T3  -  T2)* 

For  the  special  case  wi‘h  £  =  1,  a  =  1/2,  6=0,  and  this 
reduces  to  the  equation  derived  by  Vidale  (1988). 

3.  Midpoints, 

With  Ti  known  at  3  points  (0,1,2) 
on  a  column,  the  travel-time  to  the 
mid-point  on  the  next  column  can  be 
found  by  using  a  forward  difference 
formula  m  the  x-direction,  and  central 
differences  for  the  z-dircetion,  at  point 
0,  to  obtain : 


t3 =t0 

Similarly,  with  3  points  known  on  a  row . 

T3  =  T0  +  fV(H  Sp)2  -  U  (Tj  -  T,)?}. 
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Abstract 

A  Moote  Carlo  tcheme  for  calculating  nonlinear  radiation  tran* 
port  problem*  i*  developed  on  the  at*umption  that  the  thermal  radia¬ 
tion  emission  is  governed  by  local  thermtldynamic*  equilibria  (LTE). 
Some  numerical  example  it  Riven  and  *  conclusion  i*  drawn  from  the 
numerical  retails. 

1.  INTRODUCTION* 

That  thermal  radiation  interact!  with  matter  in  procettet  of  it* 
transfer  form*  radiation  hydrodynamic*  problem*.  Thermal  radiation 
here  U  taken  to  mean  electromagnetic  radiation  of  atomic  origin,  ob¬ 
tained  from  the  procetse*  of  tcattering,  abtorption,  and  thermal  emit, 
lion,  the  matter  U  generally  modeled  a*  a  fluid  of  electron*  and  Ion* 
who»e  motion  i*  govern'd  by  the  equation*  of  hydrodynamic*  The  ra¬ 
diation  Held  it  ditcribcd  by  a  Boltrman  transport  equation  for  photon* 
Tht»e  equation*  are  »o  complex  that  It  it  necessary  to  limphfy  the  gc* 
ometry  or  take  *ome  approximation*  of  equation*  for  tolving  them. 
The  mo*t  widely  u*ed  approximation  to  Boltunan  equation  i»  one 
croup  diffusion  theory.  But  in  tome  tituation*  difTuttioa  theory  I* 
midequite,  we  need  to  tolve  directly  the  traetport  equation.  Some 
available  »eb*me»  hat  been  given  (1,2).  Wc  here  diicutt  a  tcheme  for 
solving  radiation  trantfer  problem*  Involving  tcattering  proctu. 

H.  MATHEMATICAL  DESCRIPTION  OF 
THE  PROBLEM 

For  timplify,  tome  aitumption*  arc  taken  at  follow*: 

••The  electron  temperature  equilibrates  with  the  Ion,  we  call  them  ma< 
te till  temperature, 
b.  The  nutter  it  stationary. 

e.  The  thermal  radiative  emiiiion  i*  governed  by  LTE. 
d.  Thermal  conduction  in  (he  matter  it  ignored. 

With  above  awomption*.  the  radiation  transfer  problem*  it 
ditenbed  a*  following  equation*  ()-. 

J  o  f  is  •?/<,-,,  n ,) + .<(,>,r>o/(-.v,fi,o 

-;;».('.,,r)S(.,r)ar'<r,0 

m  - «,if  •  in,  a, 

V 

^  -  j* /fcn&iwo 
-  ta  f  (r.Oi* *' (r ,v,77^<v,7 ><fv 

“  .7 *J„ J*  i„ JU V,v.  .  o> 
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<r,T ,T>  +  r  .O  •  (T ) 

+  f (r,r,v-.v'kS-lT> 

2 hv  v  1 

r 

Here  )  i*  the  tpeafic  intensity  of  radiation,  T(r,r>  It  the 

metttial  ttmptratutt,  b(v,T>  i»  the  normtlutd  Plank  tptettum,  C,  i» 
the  tpecifie  heat,  e  it  the  light  tpeed  and  the  other*  have  the  phyiie* 
meaning*  aiutual. 

BI.  SEVfHMPLICIT SCHEME 

There  are  tome  coupling*  in  equation*  <I>  and  (2)  The  Plank  func¬ 
tion  emlttion  tource  and  crow  tection*  are  function*  of  material  tern 
perature.  In  addition,  the  total  tection  o,  depend*  on  the  intensity  of 
radiation.  Thete  coupling*  ean  be  divided  into  itrong  and  weak  cate¬ 
gories  (4l  Strong  coupling  refer*  to  the  temperature  dependence  of  the 
emiition  tource  in  EQ  (1)  and  the  radiation  field  dependence  of  the 
temperature  tn  EQ  (2);  weak  coupling  refer*  to  the  temperature  and  in 
tenuty  dependence  of  the  erott  tection. 

Wc  divide  time  interval  into  *ome  time  steps:  4t*t,/l« 
0«*t*<t*<—  <0<*“.  and  then  tolve  EQS  (1)  end  (2)  in  each  cycle 
At  with  ttrategy  that  treat*  ttrong  coupling  implicitly  and  weak  coop 
ling  explicitly.  Let 

E<?,t)moT'(r,t),  (4) 

Substituting  (4)  for  EQ  (2)  and  oting  a  backward  Euler  time  differ* 
eneing  of  EQ  <2),  we  obtain 

c"‘-&i"h(.r.T"w.T") 

-  (**(__  joJ  **  jaty.T" o’  •  o) 

W‘('.1,U)\±W)£\  (5) 

where 

f(f,D-4#rV,0/ri, 

Xf,n*  1/  (1  4- 

<r,<F,r>-  9 t(r 

We  ute  again  a  backward  Euler  time  differencing  of  EQ  (1)  and 
tobttitute  (5)  tot  the  term  of  emission  tource  to  obtain 

*  *it  **  »'>  9 ft* 
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Here  E*.T*  aad  I*  caa  S>r  obtained  from  last  cydr.  The  vaisrs  of 
r'itd  l''«r  to  be  obtained  frea  Ur.ce  T**  — 1*4**  —I*  or  from 
wiubls  cxtropolxuoa. 

13  every  cycle t*<t  <  t”1.  Ecjv.  (l),(2)  diseiiNet  radiation  trassfer 
pcoblrcs  bow  torn  into  E(Jt.  0).C)  and  0) 

r'"-<~’s  (11 

We  cote  (bat  there  are  eo  stroat  cocplitt*  it  Eqi  0>.(5).<!»  and  tbey 
aa  be  solved  explicitly,  ix-,  if  Eq.  (7>  it  solved  for  I**1,  then  Eq.  (5)  ex 
ffiatly  tire*  E*’,4sd  liea  Eq.  0)  explicitly  fcivrt  T‘*‘.  So  the  key  solv- 
nt  rail  alio*  transfer  problem  is  solving  Eq  (7).  However.  Eq.  (7)  ha* 
bees  (sell  a  lierar  transport  rqcatioa  is  each  cycle  At  that  can  be 
solved  easily  by  Monte  Carlo  method  il-2  or  descrite  ordisate*  (Sa) 
method  (it 

IV  NUMERICAL  EXAMPLES 

We  codtidtr  one  dimensional  slab  8  an  a  thickness,  bested  by  *  1 
kev  bUckbody  soorce  at  x-0.  The  slab  i*  divided  Into  three  zone*. 

D>-C*,.i,>.  D}*(i}.S)),  *,-0.  x,-4.  *,-5.  x,-$(c<n». 
Macroscopic  cross  section  of  each  rone  are  tives  by 

«>’.  -  — <■  h  (« 

V 

*.-0.  J- 1,2.3 

o,-o, -0  I,  Oj-10,  C,,-C,,-|00,  C.j—IWM  At- 1 x  i»rVs 
(lnt-|0*\eeond).  N’orotncal  rmlts  are  shown  in  figl.  which  shows 
the  material  trmperater;  distribotioss  at  difficrrct  moaeal  aad  com 


parrs  the  Moete  C»A»  teeperrere  wfet  this  by  ddh;=  Casr; 
orfearalcahad 

The  aoadswos  to  be  draws  £oa  these  numerical  reswts  si  that 
She  Mos»  Carlo  saecerxcal  re*si*  ss&rstse  as  that  by  tie  dnere 
or  dilates  method,  the  statistic al  Ecsdoa  wf  At  ndisuoa  mtmwty  sad 
oe^rul  IncKTiu;:  is  mil  axd  appear*  not  to  be  ptwsi'jcz  **  ey- 
*  t> r*  03- Tie  scheme  rtsmtabi  Ear  pM^rrdistba  hyfrotfynac*?* 
tetsuoiw 


Fr*  1.  Ttopctxiu  e  tncikiins  rates  b  W  by  MC  aid  So. 
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Abstract-Tie  orographic  effect  ca  atmospheric 
gravity  ware  i*  investigated  with  two-dimensional, 
coallsear,  tire  dependent  numerical  simulation.  The 
model  of  anelastic  equations  with  belgth  dependent 
basic  potential  temperature  is  applied  to  study- the 
problem.  Tie  crcgrapbically  generated  gravity  waves 
propagated  upward  to  the  alddle  atcospbere.  Tie 
computation  results  show  that  ware  properties  agree 
with  the  existing  solution. 

X.  INT20DXTI0S 

Gravity  waves  play  aa  extresely  important  role 
in  affecting  the  density  perturbation  In  the  slddle 
and  upper  atsosphere  (Hung  and  Lee,  1990).  The 
investigation  on  the  slddle  and  upper  atmospheric 
density  changes  is  not  only  need  for  space  vehicle 
design  but  also  for  numerical  weather  prediction. 

The  projects  such  as  the  Space  Shuttle,  National 
Aerospace  Plane,  Space  Telescope  and  Tethered 
Satellite  beniflt  iron  such  studies. 

Numerous  research  efforts  over  the  last  decades 
have  shown  gravity  wave  motions  play  a  sign i 1 leant - 
role  in  determining  the  circulation  and  structure 
of  the  middle  atmosphere.  Theoretical  and 
observational  studies  deronstrate  that  gravity  waves 
are  able  to  transport  momentum  and  energy  over 
considerable  distance.  Lilly  and  Klerp  (1979) 
investigated  the  orographic  effect  on  the  gravity 
wave  transportation  and  discovered  that  the  induced 
gravity  wave  transport  the  momentum  and  energy  to 
ceso-sphere.  The  energy  observed  and  dissipated  in 
the  middle  atmosphere.  The  gravity  waves  give  rise 
to  wave  drage. 

Smith  (1979),  Pitt  and  Lyons  (1989)  discovered 
that  «h.i  the  width  of  orography  is  greater  than 
the  Induced  gravity  wave  length,  the  gravity  wave 
propagated  upward  and  tilted  upwlndward.  There 
exists  a  maximum  wind  velocity  region  in  the 
orographic  leeward.  The  linear  theory  explains  that 
the  gravity  wave  transmits  upward,  reflects  from 
traupopause  and  transport  the  energy  to  accelerate 
the  air  at  lee  side. 

Klemp  and  Lilly  (1975),  Peltles  and  Clark  (1979) 
applied  the  non-linear  theory  to  investigate  the 
orographic  effect  on  gravity  wave.  They  Indicated 
that  the  energy  accelerating  the  lee  side  air  comes 
from  the  dispersion  of  gravity  waves.  The  theories 
on  the  mechanism  between  the  gravity  wave  and  1 
atoosphic  motion  are  different  based  on  various 
assumption. 

The  study  of  uilly  and  Klemp  (1979)  Indicated 
that  the  orographic  effect  on  the  gravity  wave 
propagation  plays  an  important  role  in  atmospheric 
motion.  Especially  gravity  waves  induced  by  irregular 
orography,  propagates  upvard  high  enough.  Then  the 
wave  dissipates  and  the  energy  is  absorbed  by 
atmosphere.  Therefore,  the  mecnanlsm  between  gravity 
wave  and  seso-sphere  atmospheric  motion  is  widely 
investigated.  The  gravity  wave  saturation  is  the 
source  of  kinetic  energy  of  atmospheric  motion,  e.g. 
the  trauposphere  and  stratosphere  atmospheric 
motion. 

The  drag  have  parameterized  by  Canadian  Climate 
Center  (KcFarlance,  1987)  to  investigate  FCMWF 
numerical  weather  prediction  phenomena.  Wallace  ec. 
al.  (1983)  Incorporated  the  parameter  in  studying 
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North  Hemisphere  circulation  during  winter  and 
reduced  the  medium  range  forecast  error.  Alpert, 
et.  al.  (1988)  applied  the  balance  method,  and 
considered  the  orographl-  effect  induced  gravity 
ware  drag,  improved  the  medium  range  forecast 
accuracy  in  National  Meteorological  Center  operation. 
Iwasaki,  Tana da  and  Tada  (1988)  employed  the 
ncmhydrostatic  gravity  wave  mode  and  investigated  the 
gravity  wave  drag  parameter  for  medium  range 
forecast. 

In  this  study,  the  anelastic  model  with  beigth 
dependent  basic  potential  temperature  model 
developed  by  Bacnelster  and  Scboeberl  (1989)  Is 
applied  to  study  the  orographic  effect  on  the 
gravity  waves.  Applying  tfce  model  described  and 
initial-boundary  value  specified,  we  can  solve  the 
governing .equations  and  obtain  the  gravity  wave 
properties  in  the  atmosphere. 

II.  MODEL  DESCRIPTION 

In  this  paper,  the  generation,  development 
and  damping  of  gravity  waves  induced  by  Orography 
is  investigated.  The  model  equations  based  on  the 
modified  fora  of  the  anelastic  equations  with  height 
dependent  basic  potential  temperation  (Macseister 
and  Scboeberl,  1539)  is  incorporated.  Using  the 
primitive  variables  (u,  v,  p),  ve  have  the  two 
dimensional  momentum  equation  in  horizontal  (x)  and 
vertical  (z)  component 
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In  order  to  couple  density  and  velocity  and  to 
filter  the  acoustic  wave,  the  local  variation  of 
density  in  equation  of  continuity  must  be  neglected 
(Vllhelsmson  and  Ogura,  1972) 

3pu  >pv 

- + - -  0  (3) 

3x  3z 

And  the  energy  equation  Is  of  the  fora 
30'  30*  se*  30 

- +  U -  +  V—  +  w — -  F(e')  (4) 

3t  3x  3z  3z 

In  above  equations,  p(z>  is  the  density,  0,  potential 
temperature;  R,  gas  constant;  Pqo,  the  reference 
pressure. 

The  potential  temperature  can  be  split  into  the 
upwind  steady  potential  temperature  0(z)  and 
disturbance  f>'(x,  z,  t), 

6(x,  z,  t)  -  0(z)  +  6*(x,  z,  t)  (5) 

The  diffusion  damping  terms  in  momentum  equation 
%nd  (2)  are 
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undisturbed  wind  aad  potential  tesperature 
distribution* 
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+  k* - y(*.  *)«.  (2) 
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where  y(x,  z)  is  Rayleigh  damping  coefficient-  The 
diffusion  dzmpicg  term  in  energy  equation  (4)  is 
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The  turbulent  diffusion  k,  .  (Orlanski  and  Ross 
1973)  1.  tatb 


3z 

(9) 

In  equation  (9),  Lzq  is  1  ka  (Bachelster  and 
Schoeberl,  1989),  Ax  is  the  vertical  grid  distance. 
k2  and  ki,  in  above  equations  are  basic  diffusion 
coefficient  k2  -  10**'a2/sec  and  bihaxmenlc  diffusion 
coefficient  k^  -  10S/N,  where  N  Is  Brunt-Saeisaelae 
frequency 
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In  order  to  consider  the  orographic  effect, 
the  coordinate  transformation  is  applied 

z  -  h(x) 

x*  -  x,  z*  -  zo  1  (11) 

z0  -  h(x) 

where  h(x)  is  the  orographic  height  and  zq,  the 
characteristic  height  considered.  Applying  the 
coordinate  transforaation,  vecan  transform  the 
physical  plane  into  Cartesian  coordinate  in 
cosputation  plane. 

Solution  of  the  governing  equations  is  initial- 
boundary  value  problen.  The  initial  values  of 
the  atmospheric  variables  are  those  of  background 
undisturbed  data.  Incorporating  the  periodic  wave 
model  developed  by  Bachelster  and  Schoebcl  (1989), 
we  consider  the  wave  motion  is  periodic  and  assnae 
that  the  cosputation  dosaln  is  periodic.  The 
center  of  the  cosputation  domain  in  x-dlrection  is 
the  hightest  point  of  the  orography.  The  domain 
extends  it  range  of  length  v  in  both  directions. 

The  sponge  dasping  boundary  conditions  are 
applied  (Figure  1). 

The  boundary  condtions  for  the  upwind  are  the 


where  H.  is  scale  height  potential  tesperature.  In 
general  case,  Hg  -  100  ka  and  N  -  0.01/sec.  The 

upwind  wind  profile  and  tesperature  distribution 
are  the  sounding  data.  For  the  downstream,  the 
spononge  boundary  conditions  are  applied.  Because 
the  viscosity  effect  are  cot  considered,  the  free 
slip  boundary  condition  is  esployed. 

The  initial  conditions  for  the  field  considered 
are  those  of  undisturbed  conditions.  At  the  very 
beginning  ox  the  time,  the  flow  properties  over 
the  domain  are  uniform.  Then  the*  flow  field  is 
disturbed.  The  gravity  wave  is  Induced  by  the 
orographic  effect  and  propogates. 

III.  RESULTS  AND  DISCUSSION 

At  first,  the  physical  plane  is  mapped  into 
computation  place  by  equation  (11).  Casting  the 
governing  equations  into  finite  difference  form  and 
applying  the  alternating  direction  implicit 
algorithm,  ve  obtain  the  numerical  solution  for  the 
flow  properties. 

In  order  to  check  the  accuracy  of  the  model  and 
the  computer  coding,  we  test  the  case  of  flow_over 
a  mountain.  The  orography  is  of  bell  shape  with 
height  500  e,  width  24  ka.  Assume  the  background 
undisturbed  wind  velocity  is  15  n/scc.  The 
disturbance  appears  because  of  the  existence  of 
the  orography.  At  the  Initial  time,  t  -  40  minutes, 
the  disturbance  propagates  upward  to  z  -  4  ka 
(Figure  2).  Figure  J  shows  that  the  streamlines 
become  steeper  near  the  lee  side.  At  time  c  ■  80 
minutes,  there  appears  a  periodic  wave  at  height 
3  ka  (Figure  4).  At  time  t  -  100  minutes,  the 
flow  pattern  shows  that  the  notion  is  dominated  by 
the  orography  effect  for  the  lower  atmosphere, 
z  <  3  ka,  Whilst  at  upper  atmosphere,  it  is 
controlled  by  the  wave  motion.  Figure  5-  As  time 
goes  on,  t  -  120,  140  minutes,  because  of  the 
nonlinear  mechanism,  the  dispersive  phenomena 
appear  (Figure  6,  7).  The  physical  phenomena  agree 
with  these  of  investigation  of  Peltier  and  Clark 
(1979). 

Then,  wc  apply  the  program  developed,  to  study 
the  gravity  wave  motion  in  the  atmosphere.  Assume 
the  thickness  of  the  atmosphere  be  100  km,  and 
Initial  velocity  distribution  is  function  of  height, 
equation  (12a).  Frltts  and  Dunkerton  (1984) 
proposed  the  velocity  distribution  be  of  the  form 

u(z,  0)  •  -  U  tanh  l(z  -  zc> /d )  (13) 

where  U  •  30  m/scc,  z  -  60  km,  d  -  10  ka.  The 
horizontal  wavclengh  fs  50  ka.  The  vertical 
velocity  disturbance  is 

v(x,  t)  -  w0f(t)  sin  kx,  (14) 

the  amplitude  is  function  of  time. 
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where  t.  •  2,  3  or  4T,  T  Is  the  period.  The 
Investigation  data  of  Fritts  (1985}  is  applied  for 
w3  »  1.0,  1.4,  2.0  s/sec.  Acd  horizontal  wavelength* 
are  16.7,  25  ,  50  km. 

Similar  to  the  previous  case,  unifroa  flow  over 
orography,  the  disturbance  is  Induced  by  the  lower 
boundary  orography.  The  instigated  wave  propagated 
upward  as  tine  goes  on. 

At  first,  we  examine  the  tine  variation  of 
potential  temperature  distribution.  The  disturbance 
(14)  is  added  at  initial  tine,  t  <s  2T.  Then,  no 
additional  disturbance  further.  As  t  «  3.0T  and  the 
lover  atmosphere,  z  <  50  km,  the  potential 
temperature  distribution  is  alsost  donainate.  by 
the  background  unifora  flow.  Figure  8a.  The 
gravity  wave  spreads  upward  atlO  ka  <  z  <  60  ks, 
the  wave  patterns  are  discernible.  Vhen  t  -  3.5T, 
Figure  8b,  the  rear  wave  coves  faster  than  the  front 
one.  The  wave  crest  and  trough  locate  alcost  at 
vertical  line.  Figure  8c  shows  that  at  t  -  4.0T. 
above  40  km,  especially  z  <  60ka,  the  tendency  of 
dispersive  appears  for  the  gravity  wave  development. 

The  disturbed  horizontal  velocity  Is  also 
shown  in  Figure  9.  At  tine-3.5  period,  there  arises 
unsteady  phenomena.  The  disturbance  velocity  is 
greater  than  10  a/sec.  We  assume  that  the  unifbra 
background  velocity  is  of  magnitude  30  m/sec.  The 
disturbance  grows  up  to  one  third  of  the  background 
value.  Figure  9a.  For  tise  goes  to  4T,  the 
disturbance  increases  to  20  a/sec  at  height  40  ka. 
Figure  9b  and  9c. 

The  investigation  indicates  that  wave  of  X  •  50  ka 
under  the  disturbance  such  as  orography,  behaves 
dispersive  tendency.  The  phenomena  agree  with  those 
investigated  by  Holten  and  Wehrbein  (1980). 

IV.  CONCLUSION’  AND  RECOMMENDATION 

Gravity  waves  play*  an  Important  role  In 
affection  the  density  disturbance  In  the  aiddle  and 
upper  ataosphere.  In  this  investigation,  model  of 
codified  fora  of  the  anelastlc  equation  with  height 
dependent  basic  potential  tesperature  is  applied 
to  study  the  disturbance  generated  by  orography. 

The  perturbation  Induced  by  orography  begins  to 
display  the  dispersive  phenomena.  The  relationship 
between  the  nonlinear  effect  and  dispersion  Is 
obvious.  The  results  agree  with  the  study  of  Holten 
and  Webrbein  (1980) 

According  to  the  observation  of  VHF  radar,  the 
maxisua  amplitude  of  density  perturbation  caused  by 
gravity  waves  associated  with  Typhoon  at  Vest 
Pacific  Region  is  15Z.  While  for  tropical  atoras 
at  Northern  America,  r,he  amplitude  disturbance  is 
+  12Z  (Hung  and  Lee,  1990).  This  Is  an  lntrestlng 
problea.  Besides  the  density  variation  with  height, 
the  gravity  force  codification  oust  be  considered  in 
the  further  study. 
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Figure  1.  Computation  domain 
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Figure  3.  Flow  pattern  at  t  -  60  minutes  for  Figure  2. 


Figure  5.  Flow  pattern  at  t  -  100  minutes  for  Figure  2, 


Potential  temperature  disturbance  papagation 
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Abstract  -  In  this  work  a  system  for  calculate  the  solution 
of  biharmonic  equation  is  developed.  In  the  main  step  of 
calculation  by  numerical  method,  there  are  n  system 
equations  and  n  variables  being  generted.  The  step  of 
calculation  arc  very  cumbersome  and  tedious  jobs. 
Program  for  calculate  these  solutions  is  written  for  the 
purpose  of  handling  such  problems.  This  technique  is 
suitable  as  a  simple  and  time  saving.  This  program  is 
used  to  calculate  the  solutions  of  biharmonic  equation  and 
display  the  results  by  printer  and  plotter. 


I.  INTRODUCTION 


In  this  paper  cur  object  is  to  develop  a  finite- 
difference  method  to  solve  the  biharmonic  equation. 

4  4  4 


<>$,.,  d*i>  t  dt>  _ 

d*  3xJ3yJ  fy4 


0 


(l) 


This  equation  may  by  satisfied  by  taking  the  function  ♦  in 
the  following  form: 

(f  =  cos^pffy)  (2) 


where  $  is  the  stress  function,  m  is  an  integer.  1  is  the 
length  of  rectangular  beams..'f(y)  is  a  function  of  y  only. 
Substituting  equation  (2)  into  equation  (1)  and  using  the 
notation  mrt/1  =  a  ,  we  find  the  following  equation 
for  determining  f(y) : 

f'V(y)  •  2(2  f' (y)  +  cA(y)  =  0  (3) 


II.  METHODS 

Finite-Difference  .method. 

To  solve  a  boundary-value  problem  by  the  method 
of  finite  differ  nces,  every  derivative  appearing  in  the 
equation,  as  well  as  in  the  boundary  conditions,  is 
replaced  by  an  appropriate  difference  approximation. 
Central  differences  arc  usually  preferred  because  they 

lead  to  greater  accuracy.  Using  the  difference 
approximation  with  y0  =  0,  yN  =  L,  and  Nh  -  L, 

we  will  have 


fn2+(-4-2ahVni  +  (6+aV-*  4aV)f„  ()) 

+  (.  4  -  2a  hVn*ri-  fm2  =  0,  n  =  l,  2 . N-t 

For  y  =  0,  f0 -f(yo>  =  A 

For  y  =  N,  f[i=  f(yt3  =  A  (5) 

and  f.i  =  f  i,  fnsi  •“  -fun 


writing  out  (4)  for  n  =  I,  2,  ...,  N-l  and  using  the 
boundary  condition  (5),  wc  obtain  the  matrix  which 
nonzero  elements  appearing  only  along  the  principal  five 
diagonals.  An  algorithm  is  again  available  for  solving 
directly  such  systems. 


The  algorithm  consists  in  applying  the  following 

steps: 


The  equation  (3)  can  solve  by  finite-difference  method. 
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] .  Compute  the  initial  values 
W,  =  C, 

P,  =DIW„Po=0,  pM  =  0 
7,  =  EtW,,  70  =  0,  =  yN_2  = 

2.  Compute  recursively- 


Tnc-solution  of  bihanttonic  equation  was  programmed 
for  computer 


S„=  B.-A.ft* 

Wn  “  CD-  AnYfl.2-  8J50.j 

V  —  ^fl"  ^p7n-i 

Tn  "  W„ 

y  =  -2. 

W„ 

where  n  =  2, 3 . N-l 

3.  Compute 


I.  _  bn-Ajha.j-S^ln.j 
"  — - 

where  n  =  2, 3 . N-t 

4.  Compute  the  values  of  f  backward,  using 
fn-i  =  hH,i 

Pi/n*l  ’  Y/n*2 
where  n  =  N-2,  N-3, ....  1 

We  can  now  apply  these  algorithms  to  obtain  the  solutions 
of  bihatmonics  equation. 

CoillPAiter.tesults  for  biharmonic  solution. 

From  article  II,  we  obtain  the  solution  of 
biharmcnic  equation  by  substituting  the  following, 
conditions: 


ACKNOWLEDGMENT 

The  authors  thank  Dr.  Wudhibhan  Prachyabrucd 
for  his  suggestions  which  greatly  improved  the  paper. 

REFERENCES 

1.  J.P.  Coleman,  A  new  fourth-order  method  for  y"  = 
g(x)y  +  r(x).  Preprint,  Dept,  of  Mathematics,  Univ.  of 
Durham,  England,  March  1979. 

2.  V.I.  Krylov  and  L.T.  Shul’gina,  Handbook  of 
Numerical  Integration  [  in  Russian  ),  Nauka,  Moscow 
(1966),  p.370. 


yo  =  0  ,  y„  =  2.0 

N  =  20  ,  h  =0.1 
f0  =  150  ,  fN  =  150 


2012 


Vti  THE  NUMERICAL  SOLUTION  OF  A  MODEL  FOR 
A  SINGLE ' SPECIES  POPULATION  DYNAMICS 
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Abstract-  In  this  paper  we  present  a  numerical 
method  for  solving  an  integro-differential 
equation  which  is  a  model  for  age-dependent 
populations  with  spatial  diffusion. 'Our 
numerical  scheme  is  based  on  finite  difference 
method  and, discretization- of  the  model  yields 
a  linear  system  with  block  tridiagonal  matrix. 
The  stability. of  the  method  is  discussed  by 
considering  the  eigenvalues  of  this  matrix. 

1.  INTRODUCTION 


There  has  been  much  work  on  mathematical 
theories  of  biological  populations.  The  best 
known  model  of  age-dependent  population 
dynamics  was  first  introduced  by  Von  Foerster 
[1].  In  this  model  it  is  assumed  that  the 
death  and  birth  processes  depend  only  on  age. 
Several  authors  including  MacCamy  and  Gurtin 
12],  Hoppcnstcadt  (3],  Swick  (4],  drived  a 
model  based  on  the  Von  Foerster  model  in  which 
the  birth  and  death  processes  are  allowed  to 
depend,  in  addition  to  age,  on  total 
population  size. 

The  problem  of  spatially  nonhoaogeneous, 
age-dependent  population  dynamics,  i.e.  the 
case  where  population  density  can  vary  with 
space,  age  and  time  has  been  considered  by 
Zachmann.  and  Logan  (5]  and  MacCamy  [6]  and 
Gurtin  [7]. 

The  model  ini [6]  concerns,  a  single  species 
population  moving  in  a  limited  one-dimensional 
environment  and  avoiding  crowding.  The  model 
is  based  on  the  following  equations: 


*Vpa =-<lx(x,a,t)-X(a,p)p,  n.l) 

0<x<d ,  0<t,  0<a<L 


L 

P(x,t)=J  p(x,a, t)du 
o 

p(x,0,t)sb(x,t) 

p(x,a,0)=p(x,a) 

p(0,a,t)=p(d,a,t)=0 


0<x<d 

,  0<t 

(1.2) 

0<x<d 

,  0<t 

(1.3) 

0<x<d, 

0<a<L 

(1.4) 

0<t, 

0<a<L 

(1.5) 

where 

p(x,a, t)  Is  the  population  density,  that  is, 
the  number  of  individuals,  per  unit  volume  of 
ago  a  at  time  t  and  position  x,  whore  x  is 
restricted  to  an  interval  0<x<d. 

P(x,t)  is  the  total  population  density  and  L 
is  the  maximum  life  span  of  individuals. 

q(x,a,t)  represent  the  population  flux  and 
it  is  assumed  to  bo  proportional  to  the 
gradient  of  total  population: 


q(xfa,t)=-X(a)7P(x,t)  (1.6) 

where  &(a)>0  is  the  diffusivity.  The  negative 
sign  indicates  that  the  flow  of  population 
always  lies  in  the  direction  of  decreasing 
density. 

X(a,P)>0  is  the  death  rate  and  the  term 
X(a,P)p  in  Bqh.(l.l)  is  the  loss  of 
individuals  of  age  a  at  x,  due  to  deaths. 

b(x,t)  is  the  birth  process  and  is  given  by 
the  regeneration  rule  of  the  fora: 
x. 

b(x,t)=j0(a,P>p(x,a,t)da  (1.7) 

o 

where  /?(a,P)  is  a  non-negative  function  often 
called  the  birth-rate. 

p(x,a)  is  the  initial  age-space 
distribution. 

The  boundary  conditions  in  Eqn.(1.5), 
biologically,  assert  that  all  individuals 
reaching  a  boundary  leave  the  interval  (0,d). 

In  (6],  [7]  the  authors  assume  that  the 
individuals  can  live  to  an  arbitary  age,  so 
they  take  L=+to  in  Eqns.(1.2)  and  (1.7).  As  was 
asserted  in  [6]  and  [7],  finding  a- solution 
p(x,a,t)  of  Eqns. ( 1. 1 )-( 1 .5)  in  general,  is 
difficult  and  it  was  left  as  an  open  problem. 

The  purpose  of  this  paper  is  to  develop  a 
numerical  scheme  to  approximate  p(x,a,t)  by 
simplifying  assumptions  as  follows: 

It  will  be  assumed  that  k,  the  diffusivity, 
X,  the  death  rate  and  ft,  the  birth  rate  are 
constants.  Thus  the  mathematical  model  for 
p(x,a, t)  is: 

p(tPa=JtJpxx(x.a,t)da-Xp,  (1.8) 

°  0<x<d,  0<t,  a<L 

L 

P<x,t)rjp(x,a,t)da  0<x<d,  0<t  (1.9) 

o 

L 

p(x,0,  t)=b(x,  t)=Jjfy>(x,a,  t  )da  (1.10) 

“  0<x<d,  0<t 

p(x,a,0)=p(x,a)  0<x<d,  0<a<L  (1.11) 

p(0,a,t)Fp(d,a,t)=0  0<t,  0<a<L  (1.12) 


2.  IMPLICIT  FINITE  DIFFERENCE  SCHBMB 


Wc  approximate  Eqn.(1.8)  by 

kM  k  k  k 

u  -u  u  -u 
V/)  v.)  .  *>J  «.)•»  _ 
it  Aa 


(2.1) 


k£cx 


(Ax)4 


-Xu 
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where 


u  ~  p(x  ,a.,t.  )=p( i Ax, jAa,kAt) , 

\,i  \  j  k 

1=1,2,.. .  ,1,  j=l,2,...,a.  k=0, 1,2, .... 


A  <d'  A  1L 

•  ix=m’ 

The  sub  in  (2.1)  is  a  quadrature  formula 
approximating  the  integral  in  (1.8).  Assuming 
At=Aa,  Bqn. (2.1)  ay  be  written  in  the  fora 

, .  ,  bl  .  k*i'  _  Wi»  kti  , 

(lts)u  -r£«  (u  -2u  +u  )  = 
i.)  Z,  P  *-*.P  l«P  '**.P 
P-* 


for  each  i=l,2,.«.,  l,  j=l,2, • • ■ »a,  k:0(l(... 
where 


In  view  of  Bqns. ( 1. 10)-( 1. 12) ,  the  following 
conditions  arc  iaposed  on  the  difference 
Bqn. (2.2): 

uk  =b^=b( iAx,  jAa)r/3T' a  uV  ,  (2.4) 

t  ,o  v  p  i .  p 

pal  K  V 

u°  ,-p  =p(iAx,jAa), 

MS  *  »  S  r 


o.J  l+i 

1:1,2 . t,  j=l,2 . a,  k=0, 1,2 . 

The  sub  in  (2.4)  Is  again  a  quadrature  foraula 
approxiaatlng  the  integral  in  (1.10).  ^ 

By  Bqn. (2.2)  we  can  coapute  the  unknown  u  * 
k 

in  terms  of  knowa  u  •  Collecting  the  Ixa 
unknowns  u^1  and  tx»  knowns  in 

Bqn. (2.2)  into  coluan  vectors,  yialds 

_  f  k*l  k*i  k*i  k*i 
[  i.i  i,m  z,i  2.m 

kn  k»l'|^ 

mj  -  (Z-5) 

C=fuk  ...uk  ;  uk  ...uk  ; 

[  1.0  l,m-l  2,0  l.Tivt 

k  k  1T 

U,  ...u,  ,  (2.6) 

l,Q 

then  Bqn. (2.2)  is  equivalent  to  a  syatea  of 
linear  equations  of  the  fora 


Iia  the ^identity  matrix  of  order  a  and  R  is  a 
square  matrix -of,  order  a  which  has  rank  one 
and  is  given  by 


where  r=ra.,  j=l,2,...,a. 
)  J 


3.  STABILITY 

In  this  section  we-  exaain  the  stability* for 
(2,2)  or  equivalently "for  (2.7)  by  analizing 
.error- grjowth.  As suae  at  time  level  k^an  er’or 
vector  E  was  aade  in  representing  the  vector 
C.  -Then  an  error  A  B*'  would  de  propagated  in 
X;  since  by  Bqn. (2.7),  we  have 

X=a"‘  (CiEk)=A‘‘c+A"1Kk . 

In  the  next  section  we  wiil  show  that  A*1 
exists  by  showing, that  none  of  itsv eigenvalues 
is  zero. 

Let  ,  , 

„wi  . -l„k  ,  _  .. 

B  =A  B  ,  (3.1) 

then  we  have  a  relation  between  the  errors  at 
tiae  levels  k  and  k+1  and  the  aethod  defined 
by  Bqn. (2.7)  is  stable  if 


M  |^| . 


whore  Jj  (J  is  the  Buclidian  norm. 

To  find  conditions  under  which  (2.7)  is 
stable,  in  this  sense,  first  we  find  the 
eigenvalues  and  eigenvectors  of  A  in  (2.8). 

Since  R  has  rank  one,  N(R),  the  null  space 
of  R,  has  dimension  a-1.  Therefore  there 
exists  a-1  lir.sarly  independent  coluan  vectors 


w  €  (R  such  that  RwsO, 
m-i  j 

,a-l.  Now  consider  the  vectors 
in  the  fora 


j=l,2 . a-1, 

where  0  denotes  the  zero  coluan  vectoi  in  £R  . 
Lot  us  relabel  the  above  fx(a-l)  veotors  and 


where  A  is  an  txm  by  Ixa  matrix  given  by 
r'l*s)I+2R  -R  I 


l(a-l)  vectors  are  linearly  independent  and 
satisfy 

Av  s(Hs)v  ,  1=1,2,.  .. ,{(■-!).  (3.2 


-R  ( lfe) I+2R 


It  follows  from  Bqn. (3.2)  that  \=l+s  is  an 
eigenvalue  of  A  of  multiplicity  at  least 
«»-l). 
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Now  we  look  for  l  eigenvectors  in  the  for* 


N={w 

s={vv 


&n-l>  *  X ?  Z  *  *  *  * 


<3. 3) 


Lemma.  The  set  S  is  linearly  independent. 

The  proof  of  this  lemma  will  be  given  in  the 
next  section.  Let  us  use  the  letter  v  to 
denote  the  vectors  in  S  by  setting 

u  =v.  ,  u  =v,  , . . . , 

1  «m-mi  2 


eigenvalue.  Then  in  R  we  have 

Au=Xu.  (3 

m 

Let  y=£  r  •  then  it  follows  from  Bqn.(3.4) 

,=i  J 

that 

{Ua*2r)ri-y72  =Xyt 

-yy  4(l4s42y)y  -yy  =Xy 


~Y7 i- 1 + * 1 +B*2r )  yrXyf 

In  Matrix  for*  the  above  system  of  l  linear 
equations  aay  be  written  as 

BY=XY,  (3.5) 

where  the  aatrix  B  and  the  vector  y  have  the 
fora 


l4s+2y  -y 

y,' 

B= 

-y  lts42y  -y 

t  y= 

-y 

-y  I4s42y  • 

u. 

Note  that  B  is  a  square  aatrix  of  order  /. 
Bqn.(3.5)  shows  that  X  Is  also  an  eigenvalue 
of  B.  But  the  eigenvalues  of  B  are  given  by 


V1*8*^008  HTm?  i=I-2- 


and  hence  wo  have  obtained  the  l  other 
eigenvalues  of  A  which  (since  y*0)  are 
obviously  distinct. 

If  u(i  =  l, 2, are  the  eigenvectors 

corresponding  to  A^(i  =  l,2 . t ) ,  thon  they 

are  linearly  independent.  Note  that  each 

has  the  fora  (3.3).  Now  wo  have  the  sets  of 
eigenvalues  and  eigenvectors  of  A  which  we 
denote  thea  by  T  and  S  respectively 

T=MJN  (3.7) 

where 


H={\=V 


14s,  1=1,2,. 


3={vv* . v4 


Now  we  return  to  the  discussiojni  of  stability 
of  Eqn.(2.7).  The  error  vector  E  aay  be 
expressed  uniquely  as 


B  =  £  c  v 
u  \  \ 


and  using  Eqn.(3.1),  we  obtain 
(m  fm- 

B  =a'‘j  cvrj  oA‘‘v  =  £  cX.‘‘v  .  (3.10) 


Froa  Bqn.(3.9)  and  (3.10),  it  follows  that  the 
error  does  not  grow  if 


|X.|*|SI.  1=1,2 . (». 


This  condition  holds  by  (3.7)  and  therefore  we 
have  the  following  result: 

Theorem.  The  iaplicit  foraula  (2.2)  is 
unconditionally  stable. 

4.  PROOF  OF  THE  LEMMA 

Lot  C=<v  . . .  l  and 

(  i  2 

D={vv...u  .}•  Let  V  be  the  span  of  C  and  U 

be  the  span  of  D.  Then  U  and  V  are  subspaces 
of  R^Xm  and  since  v^ (1=1,2 . l(m-l))  are 

linearly  independent,  wo  have  dimV=f(m-l ) . 
Similarly  dinU={. 

Lot 

W=U4V={u4v:u€U,  veV) , 

then  W  also  is  a  subspace  of  and  ScW, 

where  S  is  defined  by  Eqn.(3.8).  Wo  assort 
that 

UiVs(O)  ,  (4.1) 

which  implies  that  W=U$V,  that  is,  W  is  the 
direct  sum  of  U  and  V. 

Let  u^ell'W  and  u%0,  then  u  can  be 
expressed  uniquely  as 
2015 


u%  £  b  u  (4.2) 

.  x  i 
ui 

and 

f'm-O 

u*=  £  ov  (4.3) 

i=l 

where  the  b  and  are  scalers.  Multiplying 

both  sides  of  Eqns.(4.2)  and  (4.3)  by  A,  we 
obtain 

.  t  t 

Au=£bAu  =  £b\u  (4.4) 

\  x  ‘'III 

*  =  »  X-i 

and 

Au  s  £  c  Av,:  £  c  ( l+s) v  =(  l+s)u  (4.5) 

i  i  “  i  i 

1=1  151 

where  A, (1=1 ,2, . . . ,()  are  defind  by  Eqn.(3.7). 
From  Eqn8.(4.4)  and  (4.5),  we  can  write 

u%e  fes\v  <«••> 

1=1 

Since  the  rcspresentation  of  u  in  Eqn.(4.2) 
is  unique,  we  reach  a  contradiction.  This 
proves  (4.1). 

Now  we  have 

dim  w=duv+duu=m<«-o=f»- 

Hence  W=tR^Xm,  and  Ixm  vectors  in  S  which  span 

Or  m  are  linearly  independent.  This  completes 
the  proof. 
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